Commentary
Can LLMs assess risk of bias in medical research?
We tested two LLMs on risk-of-bias assessment of 14 medical studies under four prompt conditions. Overall agreement improved substantially with better prompts — but the criterion-level results tell a more nuanced story. Preprint coming soon.
April 13, 2026
Practical Guide
How to design, run, and report an AI evaluation step-by-step
March 15, 2026
Brief Review
Reporting standards for LLM evaluations: a brief review
March 13, 2026
Commentary
From benchmarks to science: the case for eval reporting standards
March 12, 2026
Commentary
Autoresearch meets biology: curing diseases autonomously
March 12, 2026
Brief Review
LLM evals in epidemiology, biostatistics, and health research
March 9, 2026
Science Spotlight
What would it take to reverse biological age? A brief review of current candidates
February 12, 2026