Tuesday, November 24, 2009

Charming Python: Easy Web data collection with mechanize and Beautiful Soup

Charming Python: Easy Web data collection with mechanize and Beautiful Soup: "For collecting data from Web pages, the mechanize library automates scraping and interaction with Web sites. Mechanize lets you fill in forms and set and save cookies, and it offers miscellaneous other tools to make a Python script look like a genuine Web browser to an interactive Web site. A frequently used companion tool called Beautiful Soup helps a Python program makes sense of the messy 'almost-HTML' that Web sites tend to contain."