Illinois Data Bank - Dataset

Version DOI Comment Publication Date
1 10.13012/B2IDB-8114408_V1 2024-12-05

7.34 KB File
28.9 MB File

Contact the Research Data Service for help interpreting this log.

Dataset update: {"description"=>["This project investigates retraction indexing agreement among data sources: BCI, BIOABS, CCC, Compendex, Crossref, GEOBASE, MEDLINE, PubMed, Retraction Watch, Scopus, and Web of Science Core. Post-retraction citation may be partly due to authors’ and publishers' challenges in systematically identifying retracted publications. To investigate retraction indexing quality, we investigate the agreement in indexing retracted publications between 11 database sources, restricting to their coverage, resulting in a union list of 85,392 unique items. We also discuss common errors in indexing retracted publications. Our results reveal low retraction indexing agreement scores, indicating that databases widely disagree on indexing retracted publications they cover, leading to a lack of consistency in what publications are identified as retracted. Our findings highlight the need for clear and standard practices in the curation and management of retracted publications. \r\n\r\nPipeline code to get the result files can be found in the GitHub repository\r\nhttps://github.com/infoqualitylab/retraction-indexing-agreement in the ‘src’ file containing iPython notebooks:\r\n\r\nThe ‘unionlist_completed-ria_2024-07-09.csv’ file has been redacted to remove proprietary data, as noted below in README.txt. Among our sources, data is openly available only for Crossref, PubMed, and Retraction Watch.\r\n\r\nFILE FORMATS:\r\n1) unionlist_completed-ria_2024-07-09.csv - UTF-8 CSV file\r\n21) README.txt - text file", "This project investigates retraction indexing agreement among data sources: BCI, BIOABS, CCC, Compendex, Crossref, GEOBASE, MEDLINE, PubMed, Retraction Watch, Scopus, and Web of Science Core. Post-retraction citation may be partly due to authors’ and publishers' challenges in systematically identifying retracted publications. To investigate retraction indexing quality, we investigate the agreement in indexing retracted publications between 11 database sources, restricting to their coverage, resulting in a union list of 85,392 unique items. We also discuss common errors in indexing retracted publications. Our results reveal low retraction indexing agreement scores, indicating that databases widely disagree on indexing retracted publications they cover, leading to a lack of consistency in what publications are identified as retracted. Our findings highlight the need for clear and standard practices in the curation and management of retracted publications. \r\n\r\nPipeline code to get the result files can be found in the GitHub repository\r\nhttps://github.com/infoqualitylab/retraction-indexing-agreement in the ‘src’ file containing iPython notebooks:\r\n\r\nThe ‘unionlist_completed-ria_2024-07-09.csv’ file has been redacted to remove proprietary data, as noted below in README.txt. Among our sources, data is openly available only for Crossref, PubMed, and Retraction Watch.\r\n\r\nFILE FORMATS:\r\n1) unionlist_completed-ria_2024-07-09.csv - UTF-8 CSV file\r\n2) README.txt - text file"]} 2024-12-06T00:44:36Z