S4CMDR

ChronoMedKG

A Temporally-Grounded, Evidence-Graded Biomedical Knowledge Graph and Benchmark for Temporal Clinical Reasoning

ChronoMedKG is a temporally grounded, evidence-graded biomedical knowledge graph built by running a four-agent, disease-autonomous pipeline across 13,431 diseases. The pipeline yields 460,497 validated consensus triples out of 13 million extracted triples; 10,852 diseases produce surviving triples after multi-LLM consensus and Quality Controller filtering. Every edge carries temporal metadata (per-phenotype onset windows, progression stages, clinical milestones), PMID-traceable evidence text, and a six-signal credibility score.

ChronoMedKG ships paired with ChronoTQA, the first temporal biomedical QA benchmark: 3,341 questions across eight reported task types plus a 12-question supplementary HPOA negative-temporal MCQ probe.

For documentation and source code visit: https://gitlab.sdu.dk/screen4care/chronomedkg

Publications

(Ahmed et al., 2026)
  1. Md Shamim Ahmed, Farhad Firoozbakht, Lukas Galke Poech, Jan Baumbach and Richard Röttger. ChronoMedKG: A Temporally-Grounded, Evidence-Graded Biomedical Knowledge Graph and Benchmark for Temporal Clinical Reasoning. (2026). Link.