aboutsummaryrefslogtreecommitdiffhomepage
path: root/inc/3rdparty/site_config/standard/fnal.gov.txt
diff options
context:
space:
mode:
authorNicolas LÅ“uillet <nicolas.loeuillet@smile.fr>2014-10-10 13:33:54 +0200
committerNicolas LÅ“uillet <nicolas.loeuillet@smile.fr>2014-10-10 13:33:54 +0200
commit44d35257e805856b4913c63fcbed3c0acb64bae8 (patch)
tree11e9d276c34b1b287706cb61182bdc71729661e2 /inc/3rdparty/site_config/standard/fnal.gov.txt
parentaf8292c1de1886cd975d79f0f42df40e0bd1c5bd (diff)
parentcf8a5e1eedbed484dbcb1ddc9f7a13fc19b7a27b (diff)
downloadwallabag-44d35257e805856b4913c63fcbed3c0acb64bae8.tar.gz
wallabag-44d35257e805856b4913c63fcbed3c0acb64bae8.tar.zst
wallabag-44d35257e805856b4913c63fcbed3c0acb64bae8.zip
Merge branch 'dev'1.8.0
Diffstat (limited to 'inc/3rdparty/site_config/standard/fnal.gov.txt')
-rwxr-xr-x[-rw-r--r--]inc/3rdparty/site_config/standard/fnal.gov.txt26
1 files changed, 13 insertions, 13 deletions
diff --git a/inc/3rdparty/site_config/standard/fnal.gov.txt b/inc/3rdparty/site_config/standard/fnal.gov.txt
index 7faa6bfc..e404ccb8 100644..100755
--- a/inc/3rdparty/site_config/standard/fnal.gov.txt
+++ b/inc/3rdparty/site_config/standard/fnal.gov.txt
@@ -1,15 +1,15 @@
1title: normalize(//h1) 1title: normalize(//h1)
2 2
3author: //td/p[position()=last()]/em 3author: //td/p[position()=last()]/em
4 4
5# I swear, this is really the best way to do this 5# I swear, this is really the best way to do this
6date: normalize(//td[contains(@style, "color: #ffffff")]) 6date: normalize(//td[contains(@style, "color: #ffffff")])
7 7
8# my god, it's full of tables 8# my god, it's full of tables
9body: /table/tbody/tr[5]//table/tbody//table/tbody/tr/td 9body: /table/tbody/tr[5]//table/tbody//table/tbody/tr/td
10strip: //h1 10strip: //h1
11 11
12# the following two lines strip the byline at the end of the article (the byline is a <p> that consists of an em dash and then some text in an <em>). I have no idea why I can't just strip //p[position()=last()], but trying to do so includes a bunch of other crap in the output. 12# the following two lines strip the byline at the end of the article (the byline is a <p> that consists of an em dash and then some text in an <em>). I have no idea why I can't just strip //p[position()=last()], but trying to do so includes a bunch of other crap in the output.
13strip: //p[position()=last()]/em 13strip: //p[position()=last()]/em
14strip: //p[position()=last()]/child::text() 14strip: //p[position()=last()]/child::text()
15test_url: http://www.fnal.gov/pub/today/archive_2011/today11-11-09_MuonDepartmentReadMore.html \ No newline at end of file 15test_url: http://www.fnal.gov/pub/today/archive_2011/today11-11-09_MuonDepartmentReadMore.html \ No newline at end of file