]> git.immae.eu Git - github/wallabag/wallabag.git/blame - inc/3rdparty/site_config/standard/domusweb.it.txt
merge fix 776
[github/wallabag/wallabag.git] / inc / 3rdparty / site_config / standard / domusweb.it.txt
CommitLineData
4e067cea
NL
1# TODO: clean up the extra junk at the end of articles
2
3# general text formatting
4prune: no
5convert_double_br_tags:yes
6
7# where to find the basic metadata
8author://a[@class='articleauthor']
9date://a[starts-with(@href,'/en/search/published/')]
10title:substring-before(//h2[@class='title'],'—')
11body://div[@id='maincontainer']
12
13dissolve://div[starts-with(@id,'commentableblock')]
14
15# clean up the crap
16strip://div[contains(@class,'domusnetwork')]
17strip://div[contains(@class,'relative_wrapper')]
18
19strip://div[contains(@class,'captionsubimage')]/img[contains(@class,'arrow')]
ac4d1142
NL
20wrap_in(em): //div[contains(@class,'captionsubimage')]/span
21test_url: http://www.domusweb.it/en/design/in-praise-of-lost-time/