openFDA
FDA has some of the world's most valuable data stores about human health and medicine publicly available on its web site. The data are relevant and important to a wide range of groups. But the hurdles for a developer or researcher to use FDA data can be exceedingly high. For example, someone wanting to use a structured XML document of known adverse event reports for a particular drug would have to go through many complex steps to be able to obtain accurate results.
Now, openFDA, an FDA Health Informatics initiative launched in June 2014, is making it easier for researchers, scientists, web developers, and other FDA regulatory stakeholders to access and use those datasets in an open standard format.
The project aims to create easy access to public data and a new level of openness and accountability; ensure the privacy and security of public FDA data; educate the public; and save lives.
Members of the scientific community can use openFDA to have their applications automatically query the data through application program interfaces (APIs). OpenFDA increases the efficiency and speed of accessing datasets by using cutting-edge, open-source code modules in a cloud-based environment.
Nimble and user-centered product development methodologies are being used to expand and improve the platform and ensure user friendliness of its functions. The current APIs include:
For Human Drugs*
- labeling (more than 67,000 for drugs currently on the market)
- reports on negative side effects (4.9 million since 2003)
- enforcement reports (4,000 records since 2012)
* over-the-counter and prescription drugs in the United States, including biological therapeutics and generic drugs
For Devices
- classification of 1,700 distinct types of devices organized into 16 medical specialties (6,000 records)
- registration and listing
* companies and establishments registrations (24,000 records)
* companies product listings (100,000 records)
- premarket approvals (PMAs) and approval supplements (30,000 records since 1977)
- clearances through premarket notifications (510(k)s) and granted de novo requests (141,000 records since 1976)
- recalls (9,500 records since 2002)
- adverse event reports (4.4 million records since 1991)
For Foods
- enforcement reports (8,600 records since 2012)
To date, there have been more than 20 million API calls, half of which were from outside the United States, more than 6,000 registered users, 21,000 connected systems worldwide, and dozens of new software applications that the community has built. Also, FDA scientists developed a statistics application for analyzing drug and device adverse events and drug labeling, using an open statistics package, Shiny R, that is being added to openFDA.
FDA's Scientific Computing Day Posters
September 9-10, 2019
- HetEnc: A Deep Learning Predictive Model for Multi-type Biological DatasetUsing Neural Network to Identify CDER Manufacturing Sites with Non-compliant Good Manufacturing Practice (GMP) Inspection Outcomes
- Integrating Adverse Outcome Pathways (AOPs) and High Throughput in vitro Assays for Better Risk Evaluations, a Study with Drug Induced Liver Injury (DILI)
- Utilizing Neural Network to Identify CDER Manufacturing Sites with Non-compliant Good Manufacturing Practice(GMP) Inspection Outcomes
- R Shiny Based Mobile Tools To Aid Regulatory Decisions And Program Evaluation
- Rapid Geographic Risk Assessment and Model Development Using GREAT and BRisk Applications
- HYDRA: An Open-Source Platform for Automated Mining and Visualization of Regulatory Data Using Natural Language Processing and Machine Learning
- An NLP-Based Approach for Metadata Extraction and Analysis fo FDA Guidance Documents
- Using Machine Learning on ICD-10 data to Enhance an Expert Anaphylaxis Case Definition
- Developing Methods for Image Acquisition and Image Analysis for Species-Level Identifying Food Contaminating Beetles
- A Data Anomaly Detection Tool for Site Selection at FDA
- Application Scaling Techniques on HPC Clusters
- Open Format for ION Channel Datasets from Cardiac Electrophysiology In Vitro Assays under CiPA
- Data Visualization Tool to Map CDER's Priorities and Deliverables: Role of Public-Private Partnerships
- Implementation of Galaxy to expand the use of command-line based bioinformatics analysis of genome-editing experiments
- Peridot: A Novel, Prototype, Web-based Toolbox to Conduct Comprehensive Drug Safety and Pharmacovigilance Activities
- Bloat-Ectomy: A Method for the Identification and Removal of Duplicate Text in the Bloated Notes of Electronic Health Records and Other Documents
- Advancing the Regulatory Science of Omics via Crowdsourcing on the precisionFDA Platform
- FDA’s Virtual Assistant: Utilizing Machine Learning for Automated Customer Service
- Terminology and Relationships for a Smokeless Tobacco Product Ontology
- CTP Integrated Research and Data System (CIRDS)
More information about the 2019 Scientific Computing Day Symposium
Resources For You
- OpenFDA: Innovative Initiative Opens Door to Wealth of FDA's Publicly Available Data
- OpenFDA Provides Ready Access to Recall Data
- FDA Leverages Big Data Via Cloud Computing
- Providing Easy Access to Medical Device Reports Submitted to FDA Since the Early 1990s
- OpenFDA Makes Medical Device-Related Data Easier to Access and Use