About the ScienceIO API

An Advanced API Platform

The ScienceIO API enables real-time transformation of unstructured healthcare data into structured data, which can then be leveraged for search and analysis. It uses a single line of code to identify and extract over 9 million healthcare concepts, clinical variables, and medical codes from text that are linked to 20+ industry-standard ontologies, including UMLS, SNOMED, ICD-10, RxNorm, ChEMBL and dbSNP.


In its current state, the ScienceIO API has one input type (text) and one output type (JSON). Because enormous amounts of unstructured healthcare data currently exist in a text-only format, the API is already exceptionally powerful in its early stage. In fact, its healthcare models are built using billions of data points.

Some of the ways you may utilize the API include:

  • Identifying or redacting protected health information (PHI)
  • Finding complex patient attributes such as genetic biomarkers
  • Analyzing clinical trial records and protocols to understand trial recruitment strategies
  • Extracting data from electronic health records (EHRs) for segmentation and outcomes analyses
  • Searching for and monitoring new inventions in patent databases
  • Mapping the progress of novel vaccine development
  • Tracking the scientific, clinical, and/or commercial development of therapeutics
  • Connecting data to perform powerful queries, such as finding potential drug targets for specific disease settings


  • The API is HITRUST certified, SOC 2-compliant, and HIPAA-compliant
  • Customer data is never compromised, re-used, or exposed


ScienceIO's API currently has the following endpoints available:

Web App

ScienceIO's Analyze - Web App makes the identify_phi and redact_phi endpoints accessible to anyone. Use the app to make API calls to these endpoints without needing to code. Support for the structure endpoint is coming soon.

Minimum Requirements

  • Python v3.7 or above

What’s Next