]>
Commit | Line | Data |
---|---|---|
ac4d1142 NL |
1 | # need to find a way to eliminate <span> content for "related content" without eliminating important content\r |
2 | \r | |
3 | convert_double_br_tags: [yes]\r | |
4 | #body: //div[@id='leftside']\r | |
5 | title: //h1\r | |
6 | title: //h2\r | |
7 | Author: substring-after(//h4, 'By ')\r | |
8 | Author: substring-after(//h4, 'By: ')\r | |
9 | #Strip: //span\r | |
10 | strip_id_or_class: morefromcat\r | |
11 | strip_id_or_class: mostpopular\r | |
12 | strip_id_or_class: articlepagination\r | |
13 | strip_id_or_class: toolbar\r | |
14 | body: //div[@id='zmodcontent']\r | |
15 | single_page_link: //li[@class='onepage'] //a[contains (@href, 'printer.php')]\r | |
16 | test_url: http://www.menshealth.com/mhlists/pursuit_of_happiness/index.php |