CENTER FOR CLINICAL OBSERVATIONAL INVESTIGATIONS

The National Library of Medicine (NLM) provides information about clinical datasets. Inclusion on an NLM Web page does not imply endorsement of, or agreement with, the contents by NLM or the National Institutes of Health. Learn more: NLM Web Policies (nih.gov).

Datasets

Dataset List

Last updated Dec 12, 2024

All of Us

The All of Us Research Program is a large-scale United States based research program that began nationwide enrollment in May 2018 and intends to recruit more than one million participants. All of Us integrates electronic health records (EHR) data with survey questionnaires, genomic data and wearable device data to develop a diverse, information rich database that serves as a central point for many secondary research studies and reduce the need for developing individual single use study specific data collection protocols. The program includes two tiers of data access the Registered tier and the more restricted Controlled tier.

CMS VRDC

The Centers for Medicare & Medicaid Services (CMS) Virtual Research Data Center (VRDC) collection contains populated claim forms and administrative meta data describing individual providers, facilities, patients, care plans and transactions known to CMS. The data is sourced from Medicare, Medicaid, Child Health Insurance Program (CHIP) and Social Security Disability Insurance (SSDI) encounters among others.

CPRD AURUM

Clinical Practice Research Datalink (CPRD) is a research service provided by the Medicines and Healthcare products Regulatory Agency with support from the National Institute for Health and Care Research (NIHR), as part of the United Kingdom (UK) Department of Health and Social Care, CPRD includes de-identified electronic health record patient-level data from a network of general practitioners from the UK.

N3C COVID

The National Clinical Cohort Collaborative (N3C) COVID Enclave developed by National Center for Advancing Translational Sciences (NCATS) systematically and regularly collects data derived from the electronic health records of people who tested positive for COVID-19 or had related symptoms.

NEMSIS

The National Emergency Medical Services Information System (NEMSIS) is a US-based national EMS database recognized as the universal resource for patient-care information resulting from 911 emergency calls.

PEDSnet

PEDSnet is a learning health system (LHS) network comprised of multiple children's hospitals with a goal to accelerate the study of child health.

UK Biobank

The UK Biobank program is a large health and biomedical database that serves multiple retrospective, observational studies. UK Biobank includes data from over half a million participants between the ages of 40 and 69 from the United Kingdom. UK Biobank contains a combination of health, questionnaire and genetic data that is regularly updated and enriched with new data fields.