]> git.immae.eu Git - github/wallabag/wallabag.git/blame - inc/3rdparty/site_config/standard/abcnews.go.com.txt
updated specific configuration for parsing
[github/wallabag/wallabag.git] / inc / 3rdparty / site_config / standard / abcnews.go.com.txt
CommitLineData
4e067cea
NL
1title: //h1[@class='headline']
2body: //div[@id='storyText']
3# for video entries
4body: //img[@id='ff-img'] | //div[@id='meta']//div[contains(@class, 'overview')]
5author: //div[@class='byline']
6date: //div[@class='date']
7strip: //*[@id='date_partner']
8
9strip: //div[@class='breadcrumb']
10strip: //div[contains(@class,'show_tools')]
11strip: //div[@id='sponsoredByAd']
12strip: //div[contains(@class,'rel_container')]
13strip: //p[a[starts-with(@href, 'http://www.twitter.com')]]
14strip: //p[a[starts-with(@href, 'http://www.facebook.com')]]
15strip: //p[contains(., 'Click here to return to')]
16#strip_id_or_class: media
17strip_id_or_class: mediaplayer
18
19replace_string(<link rel="image_src" href="http): <img id="ff-img" src="http
20
21prune: no
22
23single_page_link: concat(//li[@class='pager']//a/@href, '&singlePage=true')
24
25test_url: http://abcnews.go.com/Politics/newt-gingrich-rocky-rollout-presidential-campaign-recover/story?id=13632744
26# multi-page
ac4d1142 27test_url: http://abcnews.go.com/Blotter/family-freed-american-hostage-somalia-seals-obama/story?id=15439544