View the data and services available from the Institute of Informatics, Data Science and Biostatistics (I2DB), then review the process to submit a service request.

Data brokerage and warehouse

Clinical research data

I2DB’s ad-hoc chart extraction services provide data for use in patient cohort identification, or for prospective/retrospective study data capture.

Clinical research data warehouse

I2DB data brokers can also fulfill requests for large amounts of data, data registries or data from third-party data vendors.

Administrative data

Administrative Data Core Services provides data and services to Washington University faculty, staff, and students. Non-WashU members of the Institute of Clinical and Translational Sciences (ICTS) who are interested in working with administrative data are encouraged to collaborate with WashU investigators.

Administrative data and services include: 

  • Housing and maintaining a repository of administrative data and information about the data sources 
  • Training investigators on the ethical and secure use of administrative data 
  • Providing leadership on study design and analysis of administrative data 
  • Facilitating collaboration among interdisciplinary investigators 
  • Advancing the effective use of data to inform researchers, policymakers and the public about health and healthcare 

Learn more about Administrative Data Core Services »

Please contact before completing the intake form. 

Data registries

Work with I2DB to create a data registry where you can collect a standard set of variables for large patient cohorts.

Feasibility counts

Need to know how many people meet your research criteria? Feasibility counts can help you answer specific questions for your research planning (i.e. how many diabetics are on a particular medication who are also over 18 years of age).

Biostatistics consultation

I2DB’s Center for Biostatistics and Data Science provides biostatistical consultation in a wide range of areas including: 

  • Statistical design of experiments and clinical trials  
  • Protocol development 
  • Database management 
  • Analysis of data and interpretation of results 

Biostatisticians can aid in preparing grant applications, including discussions and written sections on study design, sample size calculations, randomization schemes, and computer resources. This includes support for the Biostatistics, Epidemiology, and Research Design (BERD) function of the Institute of Clinical and Translational Sciences.

Learn more about the Biostatistics Consulting Service » 

REDCap surveys and databases

Research Electronic Data Capture (REDCap)

REDCap is a secure web application for building and managing online surveys and databases that can be used to collect any type of research data. Consultants are available for REDCap build or assistance. REDCap training is offered by trained scientists at Becker Medical Library. 

Smart on FHIR via REDCap 

This feature allows structured clinical data to be pulled in real-time from the EHR to REDCap. In order to use the Clinical Data Pull functionality, a REDCap project must first have the Clinical Data Pull module enabled on the Project Setup page. Only a REDCap administrator may enable the module. Once enabled, any project user with CDP Mapping privileges will be able to navigate to the CDP Mapping page in the project to begin to map EHR data fields to fields in the REDCap project.  

Learn more about REDCap »

EPIC, third party vendor collaboration

Some projects may require collaboration with the Epic1 research team or other third party vendors. Examples include data from smartforms that are built in Epic, or building a link and putting it in Epic to route them to an appropriate website.

Synthetic data via MDClone

MDClone is a free, secure, self-service platform for building queries and downloading computationally derived (“synthetic”) data from the institute’s research data core. Since the data do not contain protected health information, their use is not classified as human subject research. MDClone workshops and virtual office hours are offered by trained scientists at Becker Medical Library.

Find more MDClone training materials or request access.

Learn more about MDClone »

TriNetX for planning clinical trials

TriNetX is a federated research data network used to assess recruitment potential at a clinical trial site. The web-based application supports local research study cohort estimation as well as enables collaboration with peer institutions for clinical trials and multi-site studies.

Becker Medical Library
Data Management and Sharing Service

In collaboration with I2DB, Bernard Becker Medical Library provides data management and sharing training and consultations to the School of Medicine community.


Informatics Core Services

Albert M. Lai, PhD, FACMI, FAMIA

Albert M. Lai, PhD, FACMI, FAMIA

Deputy Director, Institute for Informatics, Data Science and Biostatistics (I2DB)
Chief Research Information Officer, School of Medicine
Professor of Medicine, Division of General Medical Sciences
Professor of Computer Science and Engineering, School of Engineering and Applied Science

Mackenzie Hofford, MD

Mackenzie Hofford, MD

Associate Chief Research Information Officer, School of Medicine
Assistant Professor of Medicine Division of General Medicine School of Medicine

Sherry Lassa-Claxton, MS

Sherry Lassa-Claxton, MS

Director of Research IT for the Chief Research Information Officer

Biostatistics Consulting Service

Karen Steger-May

Karen Steger-May

Director, Biostatistics Consulting Service
Medical Informaticist III

Administrative Data Core Services

Jing Li, MD, DrPH, MS

Jing Li, MD, DrPH, MS

Director, Administrative Data Core Services
Associate Professor of Medicine, Division of General Medical Sciences