]>
Commit | Line | Data |
---|---|---|
ac4d1142 NL |
1 | author: substring-after(substring-before(//div[@id='byline'],'|'),'By')\r |
2 | author: //div[@class='byline']/a\r | |
3 | date: //span[@class='pubdate']\r | |
4 | # print friendly page\r | |
5 | body: //div[@id='text']\r | |
6 | # regular page\r | |
7 | body: //div[@id= 'articlecontent']\r | |
8 | \r | |
9 | strip: //div[@id= 'articlecontent']/h1\r | |
10 | strip: //div[@id='articlecontent']/p[@class='deck']\r | |
11 | strip: //div[@id='articlecontent']/div[@class='byline']\r | |
12 | strip: //div[@id='articlespacer']\r | |
13 | strip: //div[@id='incsharebox']\r | |
14 | strip: //div[@id='articlesidebar']\r | |
15 | \r | |
16 | prune: no\r | |
17 | \r | |
18 | single_page_link: //a[contains(@href, 'Printer_Friendly.html')]\r | |
19 | strip: //a[contains(., 'Dig Deeper')]\r | |
20 | test_url: http://www.inc.com/guides/2010/11/seven-tips-for-lobbying-politicians.html\r | |
21 | test_url: http://www.inc.com/eric-schurenberg/startups-are-we-geting-irrationally-exuberant.html |