First-Proof

Abstract

"To assess the ability of current AI systems to correctly answer research-level mathe matics questions, we share a set of ten math questions which have arisen naturally in the research process of the authors. The questions had not been shared publicly until now; the answers are known to the authors of the questions but will remain encrypted for a short time."

Models based on large languages (such as Gemini) are completely insufficient in a single generation without any prompts, while models combined with formal languages (such as lean) lack similar policy training cannot obtain valid result.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
Gemini 3 Pro		Gemini 3 Pro
Problems		Problems
First_Proof.tex		First_Proof.tex
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

First-Proof

About

Uh oh!

Releases

Packages

Languages

RuoranXu/First-Proof

Folders and files

Latest commit

History

Repository files navigation

First-Proof

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages