Guides

Mission and Commitment to Data Sharing

As a trustworthy repository, the Illinois Data Bank commits to centralize, preserve, and provide reliable access to research data created by affiliates of the University of Illinois Urbana-Champaign. Managed by the Research Data Service at the University Library, it leverages the Library’s digital preservation system and operates within a robust policy framework that articulates the University's commitment to providing persistent and reliable access to research data.

File Only Publication Delay	Metadata and File Publication Delay
You receive an active DOI. You will receive a DOI, and the link will forward to the Illinois Data Bank page for your dataset.	Your DOI is saved, but the link will fail. You will receive a DOI link to place in your publication, but the link will fail until the release date you selected.
Your dataset record is discoverable. Information for your dataset in the Illinois Data Bank will be publicly visible through several search engines and other sources.	Your dataset record is not discoverable. Your dataset will be stored in the Illinois Data Bank, but is not discoverable or visible until the release date you selected.
Dataset files cannot be accessed or seen. Although the record for your dataset is publicly visible, your data files will not be made available until the release date you selected.	Dataset files cannot be accessed or seen. The record for your dataset is not visible, nor are your data files available until the release date you selected.

File naming best practice	Examples
Use YYYY-MM-DD format	project01_2019-01-01
Use combination of letters, numbers, underscores, and hyphens	project01_raw-data.json
Use standard file extensions to indicate file type	myproject.txt
Use leading zeroes for version	name001.csv name010.csv name101.csv
Keep file name short	not_too_long.xml
Use alphanumericals	data_champaign_il_2019-01-01.csv
Use underscore between words	data_location_time.csv
Use lowercase (some systems are case sensitive)	all_lowercase_would-be-safer.tiff

Criteria	Supplemental Material	Illinois Data Bank
Size limitations on files	Often lower than 100 MB	2 Terabytes
Format limitations on files	Restrictions and requirements are possible. Often PDF only.	Any format is accepted
Digital Object Identifier (DOI)	Unlikely to be available	Automatically provided
Metadata available, exportable, and searchable	Unlikely to be available	Automatically provided
Access restriction / findability	Can be hidden by paywalls or publisher-chosen access controls.	Data will be freely available (after release from any embargoes you choose to assign).
Download statistics	Unlikely to be available	Automatically provided
Storage infrastructure	Stability and suitability for long term storage is usually not be guaranteed	Stable preservation environment that complies with many funder and publisher requirements
Cost to publish	Sometimes fee-based	No cost for University of Illinois researchers
Guaranteed availability	Unlikely to be guaranteed for multiple years or regularly reviewed	Every dataset is guaranteed to be available for a minimum of 5 years, with longer storage likely. Regular review and curation will ensure continued availability and preservation best practices as time passes.
Connections to additional papers that uses the data and other materials	Sometimes available	We link your dataset to articles, code, theses, other data sources, etc.
Standard or custom licensing statements	Unlikely to be available	CC0 and CC BY are standard offerings; you can also upload a customized license statement.

Desirable Characteristics of Repositories	Illinois Data Bank
Free and Easy Access	Supports broad, equitable, and open access Provides free access to dataset and metadata after publication
Clear Use Guidance	Provides clear policy describing access and use Offers three license options: CC0, CC-BY, and custom license
Risk Management	Is intended for unrestricted data
Retention Policy	Issues policy for data retention
Long-term Organizational Sustainability	Describes a plan for long-term management of data in its preservation policy Has contingency plans to ensure data are available and maintained during and after unforeseen events
Unique Persistent Identifiers	Assigns citable, DOI to datasets via DataCite DOI points to persistent landing page
Metadata	Follows DataCite metadata schema Enables discovery, reuse, and citation of datasets
Curation and Quality Assurance	Provides curatorial review of datasets Is a sustaining member of the Data Curation Network that provides curation expertise
Broad and Measured Reuse	Displays data citation and download counts on dataset landing pages Maintains links to other related materials
Common Format	Accepts files in any formats Encourages the use of non-proprietary and widely used format
Provenance	Records changes to metadata and records in a change log Change data files only through a robust versioning process
Authentication	Uses the University’s Shibboleth provider for the authentication of data submitters Facilitates persistent identifiers (DOIs, ORCIDs)
Long-term Technical Sustainability	Describes a plan for long-term management of data in its preservation policy Builds on a stable technical infrastructure and permanent central funding

Guides

Mission and Commitment to Data Sharing

Submission Process

Log in with NetID

Describe Your Dataset

Upload Files

Upload Using Command Line Tools

Overview

What do we mean by a draft dataset?

How do I get started?

Notes:

OPTIONS: Python, cURL, or custom script

Use our sample Python file upload script. (click to expand)

Notes on using our Python script on campus systems. (click to expand)

Institute for Genomic Biology

AWS

Virtual Hosting Group at Technology Services

Execute a cURL command. (click to expand)

Create your own custom script using our API. (click to expand)

Simple Protocol

Complex Protocol

Upload Using Globus

Publish Your Dataset

Curation Process

Delay Publication (Optional)

File Only Publication Delay

Metadata and File Publication Delay

You receive an active DOI.

Your DOI is saved, but the link will fail.

Your dataset record is discoverable.

Your dataset record is not discoverable.

Dataset files cannot be accessed or seen.

Dataset files cannot be accessed or seen.

Good Data Practices

File Formats

File Naming

File naming best practice

Examples

File Grouping

Zip Files

Data Documentation

Data Licenses

CC0

CC BY

Other License

Derivative works are allowed

Attribution a legal requirement

Other CC licenses may create reuse difficulties

Request for attribution

May create reuse difficulties

Custom licensing considerations

What is copyright?

What is a license?

How does copyright law apply to research data?

Why license research data?

For more information

Frequently Asked Questions

When do I use the Illinois Data Bank vs. IDEALS?

Why to use/deposit to the Illinois Data Bank?

Criteria

Supplemental Material

Illinois Data Bank

How does Illinois Data Bank help you meet the data sharing requirements?

Desirable Characteristics of Repositories

Illinois Data Bank

References

When do I publish a dataset in the Illinois Data Bank?

What is a DOI?

What is metadata?

What is private sharing link?

How to write a data availability statement

What metrics does the Illinois Data Bank collect?

Download metrics

Other metrics

Can I change, update, or add files?

How do I download large files using Globus?

Related Materials vs. Cited By

Definitions of Terms in the Illinois Data Bank

References

Information for Developers