]> git.immae.eu Git - github/wallabag/wallabag.git/blob - inc/3rdparty/site_config/standard/technologyreview.com.txt
minimum of control on server side added
[github/wallabag/wallabag.git] / inc / 3rdparty / site_config / standard / technologyreview.com.txt
1 title: //header[@class='article-meta']/h1
2 title: substring-before(//title, '|')
3
4 body: //section[contains(@class, 'body')]
5
6 # Author & Date for News and Featured Stories
7 author: //ul[@class='byline']/li/a
8 author: substring-before(substring-after(//ul[@class='byline']/li, 'By '), ' on')
9 date: substring-after(//ul[@class='byline']/li, 'on ')
10
11 # Author & Date for "Views"
12 author: //div[@class='view-byline']/div[@class='meta']/h2[1]
13 date: //div[@class='view-byline']/div[@class='meta']/h2[2]
14
15 next_page_link: //section[@class='pagination']/a[contains(@class, 'continue')]
16 test_url: http://www.technologyreview.com/news/427567/facebooks-telescope-on-human-behavior/