AffiNorm: A sample of 1,000 PubMed affiliation strings annotated with ROR, GRID and Wikidata identifiers
Dataset Description |
There are two files in this dataset.
AffiNorm contains 1,001 rows, including one header row, randomly sampled from MapAffil 2018 Dataset ([**https://doi.org/10.13012/B2IDB-2556310_V1**](https://databank.illinois.edu/datasets/IDB-2556310)). Each row in the file corresponds to a particular author on a particular PubMed record, and contains the following 26 columns, comma-delimited. All columns are ASCII, except city which contains Latin-1. COLUMN DESCRIPTION
File 2: insVar.json
In InsVar, the data is saved in a python dictionary format. the key is the GRID identifier, for example: "grid.1001.0" (Australian National University), and the value is a list of redirected aliases strings. {"grid.1001.0": ["ANU", "ANU College", "ANU College of Arts and Social Sciences", "ANU College of Asia and the Pacific", "ANU Union", "ANUSA", "Asia Pacific Week", "Australia National University", "Australian Forestry School", "the Australian National University", ...], "grid.1002.3": ...} |
Subject |
Social Sciences |
Keywords |
PubMed; MEDLINE; Digital Libraries; Bibliographic Databases; Institution Names; Author Affiliations; Institution Name Ambiguity; Authority files |
License |
CC BY |
Corresponding Creator |
Yingjun Guan |
Downloaded |
216 times |
| Version | DOI | Comment | Publication Date |
|---|---|---|---|
| 1 | 10.13012/B2IDB-3221174_V1 | 2025-06-05 |
Contact the Research Data Service for help interpreting this log.
| Dataset | update: {"all_globus"=>[nil, true]} | 2026-01-16T15:40:48Z |
| Dataset | update: {"all_medusa"=>[nil, true]} | 2026-01-16T15:36:32Z |
| Creator | destroy: {"family_name"=>"Torvik", "given_name"=>"Vetle", "identifier"=>"0000-0002-0035-1850", "email"=>"vtorvik@illinois.edu", "is_contact"=>false, "row_position"=>3} | 2025-06-13T19:02:23Z |
| RelatedMaterial | update: {"uri"=>["", "10.13012/B2IDB-2556310_V1"], "uri_type"=>["", "DOI"], "datacite_list"=>["", "IsSupplementedBy"]} | 2025-06-05T19:36:38Z |
| RelatedMaterial | update: {"uri"=>[nil, ""], "uri_type"=>[nil, ""], "datacite_list"=>[nil, ""], "note"=>[nil, ""]} | 2025-06-05T18:07:34Z |
| Dataset | update: {"version_comment"=>[nil, ""], "subject"=>[nil, "Social Sciences"], "external_files_link"=>[nil, ""], "external_files_note"=>[nil, ""]} | 2025-06-05T18:07:34Z |