|Related Article||Torvik VI. MapAffil: A bibliographic tool for mapping author affiliation strings to cities and their geocodes worldwide. D-Lib Magazine 2015; 21 (11/12). 10p|
|Related Article||Palmblad M, Torvik VI. Spatiotemporal analysis of tropical disease research combining Europe PMC and affiliation mapping web services. Tropical medicine and health. 2017 Dec;45(1):33|
|Related Article||Mishra S, Fegley BD, Diesner J, Torvik VI (2018) Self-citation is the hallmark of productive authors, of any gender. PLoS ONE 13(9): e0195773. https://doi.org/10.1371/journal.pone.0195773|
MapAffil 2016 dataset -- PubMed author affiliations mapped to cities and their geocodes worldwide. Prepared by Vetle Torvik 2018-04-05
The dataset comes as a single tab-delimited Latin-1 encoded file (only the City column uses non-ASCII characters), and should be about 3.5GB uncompressed.
• How was the dataset created?
• Affiliations are linked to a particular author on a particular article. Prior to 2014, NLM recorded the affiliation of the first author only.
• Affiliations are pre-processed (e.g., transliterated into ASCII from UTF-8 and html) so they may differ (sometimes a lot; see PMID 27487542) from PubMed records.
• All affiliation strings where processed using the MapAffil procedure, to identify and disambiguate the most specific place-name, as described in:
• Look for Fig. 4 in the following article for coverage statistics over time:
• The code and back-end data is periodically updated and made available for query by PMID at Torvik Research Group
• What is the format of the dataset?
1. PMID: positive non-zero integer; int(10) unsigned
|Keywords||PubMed, MEDLINE, Digital Libraries, Bibliographic Databases; Author Affiliations; Geographic Indexing; Place Name Ambiguity; Geoparsing; Geocoding; Toponym Extraction; Toponym Resolution|
|Funder||U.S. National Institutes of Health (NIH) - Grant: P01AG039347|
|Funder||U.S. National Science Foundation (NSF) - Grant: 1348742|
|Corresponding Creator||Vetle I. Torvik|