]>
Commit | Line | Data |
---|---|---|
ac4d1142 NL |
1 | title: //*[@class="article"]/h1\r |
2 | date: //*[@class="article"]/div[@class="date"]\r | |
3 | \r | |
4 | # strip the title and date from the article text\r | |
5 | strip: //*[@class="article"]/h1\r | |
6 | strip: //*[@class="article"]/div[@class="date"]\r | |
7 | \r | |
8 | # strip annoying <br> between metadata and article\r | |
9 | strip: //*[@class="article"]/div[@class="date"]/following-sibling::br | |
10 | test_url: http://minnesota.publicradio.org/display/web/2012/06/19/health/senators-want-health-care-ruling-on-tv/ |