I'm looking for a script that will parse out a chunk of HTML (typically a web-page) and strip out advertising.
It's really quite simple to do and I cant imagine it hasn't already been done. All one has to do is
- compile a list of ad-exchanges
- strip out HTML on a page that matches the (beginning to end tags) for that domain.
$myPageClean = stripAds($myPageOriginal) ;
If you have done this before or can do this now, that would be lovely!
I know Adblock is open source and provides this very functionality.