Papers

Research publications and works in progress.

On the Reasoning Gaps of Large Language Models: A Formal Characterization

Pre-Print

NeurIPS 2026

We present a formal framework connecting LLM reasoning failures to computational complexity classes. Through 176,000 evaluations across 12 model families and 9 diagnostic tasks, we show that reasoning gaps cluster into six types predictable from problem structure. Chain-of-thought closes gaps for serial composition (+35 pp) but not for problems requiring parallel search or architectural capabilities (+9 pp).

reasoningcomputational complexitychain-of-thoughtbenchmarksformal methods

The Computational Complexity of Verifying LLM Outputs

In Progress

ICLR 2027

We develop a formal framework for the computational complexity of verifying language model outputs. We characterize verification difficulty across output types and prove that for several natural problem classes, verification is strictly easier than generation — but this gap varies systematically with output structure.

verificationcomplexity theoryLLM outputsformal verification

A Taxonomy of Failure Modes in LLM-Based Autonomous Agents

In Progress

ACL 2027

Drawing on 50+ deployment incidents from autonomous agent systems, we construct a structured taxonomy of failure modes. Categories span planning failures, tool-use errors, context management breakdowns, and goal drift. We connect each category to architectural mitigations and evaluate their effectiveness.

autonomous agentsfailure modestaxonomydeploymentsafety

Impossibility Results for Unsupervised Self-Improvement in Language Models

Early Stage

ICLR 2027

We establish theoretical bounds on unsupervised self-improvement in language models. Under standard complexity assumptions, we show that certain capability gains require external signal — no amount of self-play or self-evaluation can substitute for ground truth on specific problem classes.

self-improvementimpossibility resultstheoretical boundsself-play