]>
Commit | Line | Data |
---|---|---|
ac4d1142 NL |
1 | # metadata\r |
2 | author://div[@class = 'post']/div[@class='meta']/a[1]\r | |
3 | date://div[@id = 'rap']/h2[1]\r | |
4 | body://div[@class = 'post']\r | |
5 | \r | |
6 | # wrapping caption and image\r | |
7 | wrap_in(fieldset)://div[contains(@class, 'wp-caption')]\r | |
8 | \r | |
9 | \r | |
10 | # clean up\r | |
11 | strip://div[@class = 'post']/h3[@class = 'storytitle']\r | |
12 | strip://div[@class = 'post']/div[@class = 'social']\r | |
13 | strip://img[@style = 'display:none;']\r | |
14 | strip://img[@height='0' and @width='0'] | |
15 | test_url: http://blogs.smithsonianmag.com/adventure/2011/10/tips-for-women-traveling-in-turkey/ |