]>
Commit | Line | Data |
---|---|---|
ac4d1142 NL |
1 | date: //meta[@name="published"]/@content\r |
2 | date: //div[@class="timeLine"]\r | |
3 | title: //div[@id='contentBody']//h1\r | |
4 | author: //dl[@class="storyBlogByline"]/dd/a\r | |
5 | body: //div[@id='storyMediaBox'] | //div[contains(@class, 'storyText')]\r | |
6 | \r | |
7 | # Content Pruning\r | |
8 | strip: //div[@class="scrollingArrows"]\r | |
9 | strip: //div[@class="timeLine"]\r | |
10 | strip: //dl[@class="storyBlogByline"]\r | |
11 | \r | |
12 | prune: no\r | |
13 | \r | |
14 | test_url: http://www.cbsnews.com/8301-201_162-57366361/rescued-americans-dad-proud-of-the-u.s/ |