Skip to content
@Medical-Event-Data-Standard

Medical Event Data Standard

A minimal, ML-oriented data standard for medical record data to improve reproducibility, robustness, and computational performance.

Medical Event Data Standard

This organization contains GitHub Repositories for the Medical Event Data Standard (MEDS), a simple dataset schema for machine learning over electronic health record (EHR) data. Unlike existing tools, pipelines, or common data models, MEDS is a minimal standard designed for maximum interoperability across datasets, existing tools, and model architectures. By providing a simple standardization layer between datasets and model-specific code, MEDS can help make machine learning research for EHR data dramatically more reproducible, robust, computationally performant, and collaborative. Alongside this report, we also release several existing integrations with models, datasets, and tools, and will work actively with the community going forward for further adoption and use. See our draft proposal for more details, and please leave comments or questions via github issues to help us improve this effort!

Software Ecosystem

Project Type Documentation URL Repository URL Paper URL Description
Core MEDS Core GitHub GitHub OpenReview A data standard and community for building and sharing EHR machine learning tools
MEDS-Reader Package Docs GitHub arXiv An optimized Python package for efficient EHR data processing achieving 10-100x improvements in memory, speed, and disk usage
MEDS-Transforms Package GitHub A set of functions and scripts for extraction to and transformation/pre-processing of MEDS-formatted data.
MEDS-Tab Package Docs GitHub A library designed for automated tabularization, data preparation with aggregations and time windowing.
ACES Package Docs GitHub arXiv A package and configuration language for reproducible extraction of task cohorts for machine learning over event-stream datasets
MEDS-Torch Package Docs GitHub Advancing healthcare machine learning through flexible, robust, and scalable sequence modeling tools.
MEDS-Evaluation Package GitHub Evaluation pipeline for MEDS.
MEDS-ETL Package GitHub Efficient ETL that supports OMOP, MIMIC, eICU, PyHealth.
FEMR Package GitHub A Python package for manipulating longitudinal EHR data for machine learning, with a focus on supporting the creation of foundation models and verifying their presumed benefits in healthcare.
MEDS-DEV Benchmark GitHub A benchmark for evaluating the performance of machine learning models on MEDS-formatted data.

Pretrained Models

Datasets / Benchmarks

Coming Soon...

Tools that are planned to be compatible with MEDS:

Pinned Loading

  1. meds meds Public

    Schema definitions and Python types for Medical Event Data Standard, a standard for medical event data such as EHR and claims data

    Python 49 4

  2. MEDS-DEV MEDS-DEV Public

    The MEDS Decentralized Extensible Validation (MEDS-DEV) Benchmark: Establishing Reproducibility and Comparability in ML for Health

    Python 24 7

  3. MIMIC_IV_MEDS MIMIC_IV_MEDS Public

    The MIMIC-IV MEDS ETL

    Python 2 2

  4. meds_testing_helpers meds_testing_helpers Public

    Testing, benchmarking, and synthetic data generation helpers for MEDS tools, pipelines, and models.

    Python

  5. ETL_MEDS_Template ETL_MEDS_Template Public template

    A template repository for a MEDS-Transforms powered extraction pipeline for a custom dataset.

    Python 2 1

  6. meds_etl meds_etl Public

    A collection of ETLs from common data formats to Medical Event Data Standard

    Python 28 7

Repositories

Showing 10 of 13 repositories
  • MIMIC_IV_MEDS Public

    The MIMIC-IV MEDS ETL

    Medical-Event-Data-Standard/MIMIC_IV_MEDS’s past year of commit activity
    Python 2 MIT 2 3 (1 issue needs help) 0 Updated Mar 26, 2025
  • eICU_MEDS Public

    An eICU Extraction template for MEDS

    Medical-Event-Data-Standard/eICU_MEDS’s past year of commit activity
    Python 3 MIT 2 1 0 Updated Mar 26, 2025
  • ETL_MEDS_Template Public template

    A template repository for a MEDS-Transforms powered extraction pipeline for a custom dataset.

    Medical-Event-Data-Standard/ETL_MEDS_Template’s past year of commit activity
    Python 2 MIT 1 6 1 Updated Mar 26, 2025
  • medical-event-data-standard.github.io Public

    GitHub Pages site for MEDS

    Medical-Event-Data-Standard/medical-event-data-standard.github.io’s past year of commit activity
    TypeScript 2 0 47 (3 issues need help) 0 Updated Mar 25, 2025
  • meds_etl Public

    A collection of ETLs from common data formats to Medical Event Data Standard

    Medical-Event-Data-Standard/meds_etl’s past year of commit activity
    Python 28 Apache-2.0 7 6 1 Updated Mar 21, 2025
  • meds_etl_cpp Public

    Optimized algorithms for meds_etl implemented in C++

    Medical-Event-Data-Standard/meds_etl_cpp’s past year of commit activity
    C++ 0 Apache-2.0 0 0 0 Updated Mar 21, 2025
  • meds_testing_helpers Public

    Testing, benchmarking, and synthetic data generation helpers for MEDS tools, pipelines, and models.

    Medical-Event-Data-Standard/meds_testing_helpers’s past year of commit activity
    Python 0 MIT 0 13 (1 issue needs help) 0 Updated Mar 19, 2025
  • meds Public

    Schema definitions and Python types for Medical Event Data Standard, a standard for medical event data such as EHR and claims data

    Medical-Event-Data-Standard/meds’s past year of commit activity
    Python 49 Apache-2.0 4 10 0 Updated Mar 15, 2025
  • MEDS-DEV Public

    The MEDS Decentralized Extensible Validation (MEDS-DEV) Benchmark: Establishing Reproducibility and Comparability in ML for Health

    Medical-Event-Data-Standard/MEDS-DEV’s past year of commit activity
    Python 24 MIT 7 41 (7 issues need help) 1 Updated Mar 13, 2025
  • .github Public
    Medical-Event-Data-Standard/.github’s past year of commit activity
    0 3 0 0 Updated Mar 10, 2025

Most used topics

Loading…