CellBench-LS

Rigorous framework for evaluating generalization of single-cell foundation models in low-supervision scenarios across clustering, batch correction, annotation, expression reconstruction, and perturbation prediction.

Composite

65.2

Experimental validation

Retrospective

Stages

Virtual CellDisease Modeling

Modalities

small-moleculebiologic

Task types

classificationregressiongeneration

Size

tasks: 5
datasets: multiple general + multi-batch + perturbation scRNA-seq
models_evaluated: 10

License

Other

First release

2026-04-01

Last updated

2026-04-01

Official site

→ project page

Leaderboard

→ leaderboard

Dataset

→ dataset

Code / GitHub

→ repository

HuggingFace

→ HF

Paper

CellBench-LS: Benchmarking Single-Cell Foundation Models in Low-Supervision Scenarios · Unknown — biorxiv preprint · 2026 · paper · doi:10.64898/2026.04.01.714123 · 0 citations

Flags

none

Experts

—

Groups

—

Hosted by

—

Related benchmarks

scPerturBench, Open Problems: Perturbation Prediction, PerturbArena

Rubric (7-criterion)

rigor

coverage

maintenance

adoption

quality

accessibility

industry_relevance

Notes

Addresses critical gap: most scFM benchmarks use full supervision. Low-supervision evaluation is more realistic for clinical/translational settings. Finds scFMs don't consistently outperform simpler baselines.

← Back to all benchmarks

Compare:

Open comparison →