DATA

Code related to a working paper that was first presented at the AFSP Annual Meeting in Paris, 2013. See Section 1 of this paper and its appendix, or read the HOWTO below for a technical summary.

June 2014 – Major update * Updated working paper * Added new appendix * Added five media scrapers * Updated Google Trends data

The scraper currently collects slightly over 6,300 articles from

HOWTO

The entry point is make.r:

get_articles will scrape the news sources (adjust page counters to current website search results to update the data)
get_corpus will extract all entities and list the most common ones (set minimum frequency with threshold; defaults to 10)
get_ranking will export the top 15 central nodes of the co-occurrence network to the tables folder, in Markdown format
get_network returns the co-occurrence network, optionally trimmed to its top quantile of weighted edges (set with threshold; defaults to 0)

The weighting scheme is inversely proportional to the number of entity pairs in each article.
The weighted degree formula is by Tore Opsahl and uses an alpha parameter of 1.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
afsp2013		afsp2013
data		data
google-trends		google-trends
plots		plots
tables		tables
README.md		README.md
functions.r		functions.r
make.r		make.r