Research

Reliability, fairness, and interpretability of NLP evaluation. Statistical methods for robust conclusions.

Explore publications →

Talks

Invited talks and conference presentations.

See talks →