diff options
author | Nicolas LÅ“uillet <nicolas@loeuillet.org> | 2014-07-13 10:15:40 +0200 |
---|---|---|
committer | Nicolas LÅ“uillet <nicolas@loeuillet.org> | 2014-07-13 10:15:40 +0200 |
commit | 4e067ceabd705201a16b4c92cf4b23f3b990326c (patch) | |
tree | 939f3a8e5ff3ab9ee414a57a895d3e78e1d46ce3 /inc/3rdparty/site_config/standard/blogs.scientificamerican.com.txt | |
parent | 58dbe103889148def78b0fc8744d3f94c56a1561 (diff) | |
download | wallabag-4e067ceabd705201a16b4c92cf4b23f3b990326c.tar.gz wallabag-4e067ceabd705201a16b4c92cf4b23f3b990326c.tar.zst wallabag-4e067ceabd705201a16b4c92cf4b23f3b990326c.zip |
updated specific configuration for parsing
Diffstat (limited to 'inc/3rdparty/site_config/standard/blogs.scientificamerican.com.txt')
-rwxr-xr-x[-rw-r--r--] | inc/3rdparty/site_config/standard/blogs.scientificamerican.com.txt | 28 |
1 files changed, 14 insertions, 14 deletions
diff --git a/inc/3rdparty/site_config/standard/blogs.scientificamerican.com.txt b/inc/3rdparty/site_config/standard/blogs.scientificamerican.com.txt index a7d15081..2102015d 100644..100755 --- a/inc/3rdparty/site_config/standard/blogs.scientificamerican.com.txt +++ b/inc/3rdparty/site_config/standard/blogs.scientificamerican.com.txt | |||
@@ -1,16 +1,16 @@ | |||
1 | # meta data | 1 | # meta data |
2 | title://h1[@class = 'postTitle'] | 2 | title://h1[@class = 'postTitle'] |
3 | author:substring-before(substring-after(//span[@class = 'byline'],'By '),'|') | 3 | author:substring-before(substring-after(//span[@class = 'byline'],'By '),'|') |
4 | date://span[@class = 'datestamp'] | 4 | date://span[@class = 'datestamp'] |
5 | 5 | ||
6 | #body content | 6 | #body content |
7 | body://div[@id = 'singleBlogPost'] | 7 | body://div[@id = 'singleBlogPost'] |
8 | 8 | ||
9 | #reclaim author info | 9 | #reclaim author info |
10 | move_into(//div[@id = 'singleBlogPost'])://div[@id = 'aboutAuthorDiv'] | 10 | move_into(//div[@id = 'singleBlogPost'])://div[@id = 'aboutAuthorDiv'] |
11 | strip://p[@class = 'moreLink mobileHide'] | 11 | strip://p[@class = 'moreLink mobileHide'] |
12 | 12 | ||
13 | #cleanup comments, there might be some open <div> sections | 13 | #cleanup comments, there might be some open <div> sections |
14 | strip://div[@id = 'comments2'] | 14 | strip://div[@id = 'comments2'] |
15 | strip://h3[a[@href = '#add-comment']] | 15 | strip://h3[a[@href = '#add-comment']] |
16 | test_url: http://blogs.scientificamerican.com/a-blog-around-the-clock/2012/07/10/science-blogs-definition-and-a-history/ \ No newline at end of file | 16 | test_url: http://blogs.scientificamerican.com/a-blog-around-the-clock/2012/07/10/science-blogs-definition-and-a-history/ \ No newline at end of file |