Web Development

Reverse Templating for Scraping websites with Zope, TAL, Beautyful soup

AkaSig wrote about a solution for web scraping that looks pretty elegant to me… The idea is to reverse the use of ZPTs (Zope Page Templates).

The idea is to reverse the templating process. You take an example of an existing webpage and use zpt-like instructions to get data out of similar html pages into data.

He has several examples in his article – if I need to do webscraping, I'll try and grab his .zip first.

He also pointed me to BeautifulSoup which is the python parser basis for it.

Similar entries