VeriBench: End-to-End Formal Verification Benchmark for AI Coding Agents in Lean 4

Published in ICML 2026 Workshop on Deep Learning for Code (DL4C); ICML 2026 AI for Math Workshop (AI4Math)., 2026

Third-author paper; accepted to the ICML 2026 Workshop on Deep Learning for Code (DL4C) and the ICML 2026 AI for Math Workshop (AI4Math). In review at NeurIPS 2026. Led accompanying technical blog post.