On the use of large language and reasoning models in mathematical research.

in #tech2 months ago

1000051680.jpg

Mathematicians used the language models Claude-3.5-Sonnet, Gemini-1.5-pro, GPT-4o, and the reasoning model o1-mini to collaborate on a paper on network information flows and lattice theory. AI-assisted in the initial conjectures, some proofs, and most applications.

In summary, although many incorrect proofs were generated, Claude-3.5/GPT-4o conjectured a new theorem, while o1-mini came up with an entirely new, clever, correct proof, more elegant than a human proof.

Thread: https://x.com/robertghrist/status/1841462507543949581
Paper: https://arxiv.org/abs/2410.00315