Sumana Harihareswara on Nostr: http://www.crummy.com/2004/05/20/1 "But other parsers know too much about HTML. They ...
http://www.crummy.com/2004/05/20/1
"But other parsers know too much about HTML. They choke on or try to rewrite bad markup. They assume you care about the whole document. A pirate might make you walk the plank, but only a parser would make you walk the whole tree."
Happy 20th birthday to the #Python screen-scraping library Beautiful Soup by Leonard Richardson (npub14hs…swac) .
"But other parsers know too much about HTML. They choke on or try to rewrite bad markup. They assume you care about the whole document. A pirate might make you walk the plank, but only a parser would make you walk the whole tree."
Happy 20th birthday to the #Python screen-scraping library Beautiful Soup by Leonard Richardson (npub14hs…swac) .