ARISE
ARISE Logo

MAST

Resources

Tools, datasets, and guides to help you evaluate and benchmark medical AI systems using the MAST framework.

Training Data & Datasets

Access curated clinical datasets used in the MAST benchmark suite. All datasets are de-identified, IRB-approved, and formatted for direct use in evaluation pipelines.

Browse Datasets

Evaluation Harness

The MAST evaluation harness provides a standardized framework for running benchmarks against medical AI models. Clone the repository, configure your model endpoint, and generate reproducible evaluation results.

View Setup Guide

Clinical Case Libraries

Curated case sets spanning multiple medical specialties, designed for benchmarking clinical reasoning, diagnostic accuracy, and safety. Each case is authored and validated by board-certified physicians.

Explore Cases

API & Integration

Integrate MAST evaluations into your CI/CD pipelines, model training workflows, or internal quality dashboards. Our API supports batch evaluation, webhook notifications, and structured result export.

View API Docs

Join us in shaping the future of
healthcare with AI

Mailing List Signup