Publications

You can also find my articles on my Google Scholar profile.

Conference Papers


[1] Certifying the Judge: Falsifiable Properties for LLM-Based Evaluation of Formal Code

Accepted at ICML 2026 Workshop on Deep Learning for Code (DL4C); ICML 2026 AI for Math Workshop (AI4Math), 2026

First-author paper on falsifiable properties for LLM-based evaluation of formal code.

[2] VeriBench: End-to-End Formal Verification Benchmark for AI Coding Agents in Lean 4

Accepted at ICML 2026 Workshop on Deep Learning for Code (DL4C); ICML 2026 AI for Math Workshop (AI4Math)., 2026

Third-author paper on an end-to-end formal verification benchmark for AI coding agents in Lean 4.

Technical Blog Post