Skip to main content
UsedBy.ai
All articles
Trend Analysis3 min read
Published: May 21, 2026

GPT-5 Pro Disproves 80-Year-Old Erdős Conjecture via Formal Verification

The model disproved the $n^{1+o(1)}$ upper bound for the unit distance problem, identifying point sets with more than $n^{1+\delta}$ distances (source: OpenAI Index, May 20, 2026). Verification was co

Marcus Webb
Marcus Webb
Senior Backend Analyst

The Pitch

OpenAI has released GPT-5 Pro, which autonomously disproved the 80-year-old Erdős planar unit distance conjecture using cross-disciplinary algebraic number theory. Verified via the Lean language, this result marks a shift from conversational mimicry to formal mathematical discovery. See OpenAI profile

Under the Hood

The model disproved the $n^{1+o(1)}$ upper bound for the unit distance problem, identifying point sets with more than $n^{1+\delta}$ distances (source: OpenAI Index, May 20, 2026). Verification was completed using the Lean formal proof language and a peer-review panel including Noga Alon and Melanie Wood (source: Dataconomy/KuCoin).

In terms of benchmarks, GPT-5 Pro reached 94.6% on AIME 2025 and 88.4% on GPQA Diamond (source: Stark Insider). While these numbers are statistically high, the system remains a black box. OpenAI has not disclosed the total compute budget or the number of failed attempts required to generate the successful proof (source: Reddit r/MachineLearning).

Current pricing for GPT-5 sits at $1.25 per 1M input tokens and $10 per 1M output tokens as of May 2026 (source: Vellum.ai). Critics argue the system acts as a "Stockfish for math," discovering counterexamples through massive search rather than fundamental conceptual understanding (source: HN Comment 4).

There are significant technical gaps in the public documentation. We don't know the exact model architecture—speculated to be a "Strawberry/o3" successor—and the specific "non-trivial tweaks" mentioned by postdocs in the Companion Remarks remain undisclosed (source: HN Comment 1).

OpenAI is also contending with a credibility gap following the April 2026 resignation of VP Kevin Weil. Weil departed after it surfaced that the company had falsely claimed GPT-5 solved 10 Erdős problems in late 2025 (source: AutoGPT/Dataconomy). This history of misrepresentation necessitates a cautious approach to their latest claims.

Marcus's Take

Do not mistake a brute-force search success for general reasoning capabilities in your production environment. While the Erdős proof is a genuine academic milestone, the high output costs and lack of transparency regarding failed attempts suggest GPT-5 is a high-compute specialist. Use it for complex verification where accuracy is non-negotiable, but stick to Claude 4.5 Opus or Claude 4 Sonnet for standard backend orchestration.


Ship clean code,
Marcus.

Marcus Webb
Marcus Webb

Marcus Webb - Senior Backend Analyst at UsedBy.ai

Related Articles

Stay Ahead of AI Adoption Trends

Get our latest reports and insights delivered to your inbox. No spam, just data.