]>
Commit | Line | Data |
---|---|---|
ac4d1142 NL |
1 | body: //div[@class='blogbody']\r |
2 | strip: //h3[@class='title']\r | |
3 | date: //h2[@class='date']\r | |
4 | #Should Atwood just be a literal?\r | |
5 | author: substring-before( substring-after(//div[@class='posted'], 'y'), 'V')\r | |
6 | \r | |
7 | # tim.kingman@... 2011-07-26\r | |
8 | # Prune:no to retain all-link ULs that are part of the body content like\r | |
9 | # http://www.codinghorror.com/blog/2011/07/building-a-pc-part-vii-rebooting.html\r | |
10 | # Then explicitly strip the "Posted By" and prev/next links that Prune:yes would have removed.\r | |
11 | \r | |
12 | prune: no\r | |
13 | strip: //div[@class='posted']/following-sibling::*\r | |
14 | strip: //div[@class='posted'] | |
15 | test_url: http://www.codinghorror.com/blog/2011/07/building-a-pc-part-vii-rebooting.html |