]> git.immae.eu Git - github/wallabag/wallabag.git/blob - inc/3rdparty/site_config/standard/blog.sina.com.cn.txt
Merge branch 'dev' of github.com:wallabag/wallabag into dev
[github/wallabag/wallabag.git] / inc / 3rdparty / site_config / standard / blog.sina.com.cn.txt
1 # Sina blog, the most popular blog host in China.
2 # Its source code is horrible.
3 #
4 # Issue:
5 # Only the first image in the article is displayed.
6 # The rest images are replace by a 1x1 transparent gif by sina blog host.
7 #
8
9 title://*[contains(@class,'titName SG_txta')]
10 author://*[contains(@id,'ownernick')]
11 date://*[contains(@class,'time SG_txtc')]
12 body://div[contains(@class,'articalContent')]
13
14 # Remove redundant content which has span class start with "MASS"
15 # Example <span class="MASSf21674ffeef7"></span>
16 strip://span[contains(@class,'MASS')]
17
18 # Remove comment
19 strip://div[contains(@class,'allComm')]
20
21 # Remove hiden text and link
22 strip://ins
23
24 tidy:no
25 convert_double_br_tags:yes
26 test_url: http://blog.sina.com.cn/s/blog_5054769e0102dtja.html