Publications

PutnamBench: Evaluating Neural Theorem-Provers on the Putnam Mathematical Competition

Published in NeurIPS 2024 Datasets and Benchmarks Track, 2024

We present PutnamBench, a new multi-language benchmark for evaluating the ability of neural theorem-provers to solve competition mathematics problems.

Recommended citation: George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, and Swarat Chaudhuri. Putnambench: Evaluating neural theorem-provers on the putnam mathematical competition. In The Thirty-eighth Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2024a. URL https://arxiv.org/abs/2407.11214.
Download Paper

PutnamBench: A Multilingual Competition-Mathematics Benchmark for Formal Theorem-Proving

Published in AI for Math Workshop @ ICML 2024, 2024

We present PutnamBench, a new multilingual evaluation benchmark for formal theorem-proving.

Recommended citation: George Tsoukalas, Jasper Lee, John Jennings, Jimmy Xin, Michelle Ding, Michael Jennings, Amitayush Thakur, and Swarat Chaudhuri. Putnambench: A multilingual competition-mathematics benchmark for formal theorem-proving. In AI for Math Workshop @ ICML 2024, 2024b. URL https://openreview.net/forum?id=vqW1VRFeVP.
Download Paper