]> git.immae.eu Git - github/wallabag/wallabag.git/blame - inc/3rdparty/site_config/standard/neh.gov.txt
merge fix 776
[github/wallabag/wallabag.git] / inc / 3rdparty / site_config / standard / neh.gov.txt
CommitLineData
4e067cea
NL
1#host configuration should be http://www.neh.gov/news/humanities/
2
3
4#meta data
5title:substring-after(substring-after(//title,':'),':')
6author:substring-after(//h2[@class = 'subHead'],'By')
7date:substring-before(substring-after(//title,':'),':')
8
9#img and caption handling
10wrap_in(small)://div[@id = 'mainContent']/table/descendant::p/descendant::text()
11wrap_in(fieldset)://div[@id = 'mainContent']/table
12
13# clean up
14strip: //table[@class = 'marginpaddingTop']
15strip: //h2[@class = 'subHead']
ac4d1142
NL
16
17test_url: http://www.neh.gov/news/humanities/2011-11/IslamicScholar.html