ContentMine Demo

Exploring aggregated counts of facts over dictionaries

This demo uses sample code from a ContentMine-API dump to Zenodo and can be recreated with this source code.

The raw data consists of a dump from the ContentMine-API based on a query for "Zika" on EuropePMC, which can be downloaded from Zenodo. It has been preprocessed and the features for each visualization have been extracted, see here for the source code.

The data consists of 34039 individual facts extracted from 503 publications in the range of April 1998 to September 2016 (as indicated by first publication date by EuropePMC). Facts are extracted by matching the fulltext with our dictionaries, a collection of controlled vocabularies of words interesting for a domain or a specific research question.

The upper time series shows the absolute counts of facts, aggregated by their source dictionary. The lower time series shows the relative share of facts per dictionary, as a fraction of the daily total count.