Selecting a Data Repository
A data repository is a place to archive research datasets and make them publicly available. To select an appropriate data repository, follow these steps:
| Step | Description |
|---|---|
| 1. Are you required to deposit in a certain repository? | Some funders and journals require or recommend datasets be deposited in their repositories. Check their specific requirements or contact the Library for assistance in making this determination. |
| 2. Is there a discipline-specific repository? | If you have a choice, look for commonly used repositories in your discipline. Some repositories are geared towards groups of disciplines, while others are designed for a specific kind of research. Examples: NIH Open Domain-Specific Data Sharing Repositories, Other NIH-Supported Domain-Specific Resources with limitations on submitting and/or accessing data. |
| 3. If there is no discipline-specific repository, select a generalist repository | There are several general purpose repositories that can fulfill funder and journal sharing requirements. The choice often comes down to personal preference. Generalist Repositories |
Tools for Finding Data Repositories: Databases of Data Repositories
| Index | Discipline | Description |
|---|---|---|
| Re3Data | All | Registry of Worldwide Research Data Repositories |
| Fairsharing | All | Database of data repositories and related metadata standards and polices. Recommended for identifying metadata standards when writing a Data Management and Sharing Plan (DMSP) |
| Google Dataset Search | All | A comprehensive search engine across many general and discipline repositories and government websites |
| Awesome Public Datasets | All | High-quality, curated collection of public datasets, most free, some not. |
| Data.world | All | A searchable list of open datasets from around the world |
| NIH Data Sharing Resources | Many | Three lists curated by the National Library of Medicine and highlighted in steps 2 and 3 above |
Other Resources
| Resource | Description |
|---|---|
| DataCite | Has an extensive list of subject repositories |
| Scientific Data’s Recommended Data Repositories | List of recommended data repositories by Nature’s Scientific Data journal. Developed to instruct authors where to deposit their datasets. |
| NIH Data Sharing Repositories | Three lists curated by the National Library of Medicine and highlighted in steps 2 and 3 above |
| American Heart Association Data Repositories | A list of approved data repositories in support of the recently released AHA Open Science Policy. |
| Data Repositories | Open Access Directory |
| E-Commons | For non-data needs (manuscripts, theses, dissertations, books, etc.) |
For more information about Data Repositories, see the Samuel J. Wood Library Data Preservation, Access, and Associated Timelines site.
