]>
Commit | Line | Data |
---|---|---|
ac4d1142 NL |
1 | title: //div[@id='contentheader']/h1\r |
2 | author: //p[@class='attribution']/span[@class='author']/*\r | |
3 | # Is there a way to pull multiple authors? My XPath here is just grabbing the first\r | |
4 | \r | |
5 | date: /html/head/meta[@name="date"]/@content\r | |
6 | body: //div[@class='main-content']\r | |
7 | \r | |
8 | strip: //p[@class='byline']\r | |
9 | strip: //div[@class='img-gallery']\r | |
10 | strip: //div[@class='callout']\r | |
11 | strip: //div[@class='add-your-view']\r | |
12 | convert_double_br_tags: yes | |
13 | test_url: http://www.brookings.edu/opinions/2011/1018_cyberattack_libya_goldsmith.aspx |