]> git.immae.eu Git - github/wallabag/wallabag.git/blame - inc/3rdparty/site_config/standard/blog.sina.com.cn.txt
Merge pull request #391 from Newinx/master
[github/wallabag/wallabag.git] / inc / 3rdparty / site_config / standard / blog.sina.com.cn.txt
CommitLineData
ac4d1142
NL
1# Sina blog, the most popular blog host in China.\r
2# Its source code is horrible.\r
3# \r
4# Issue:\r
5# Only the first image in the article is displayed.\r
6# The rest images are replace by a 1x1 transparent gif by sina blog host.\r
7# \r
8\r
9title://*[contains(@class,'titName SG_txta')]\r
10author://*[contains(@id,'ownernick')]\r
11date://*[contains(@class,'time SG_txtc')]\r
12body://div[contains(@class,'articalContent')]\r
13\r
14# Remove redundant content which has span class start with "MASS"\r
15# Example <span class="MASSf21674ffeef7"></span>\r
16strip://span[contains(@class,'MASS')]\r
17\r
18# Remove comment\r
19strip://div[contains(@class,'allComm')]\r
20\r
21# Remove hiden text and link\r
22strip://ins\r
23\r
24tidy:no\r
25convert_double_br_tags:yes\r
26test_url: http://blog.sina.com.cn/s/blog_5054769e0102dtja.html