The AI revolution in math has arrived

quantamagazine.org

57 points by sonabinu 5 hours ago


sm0ss117 - 29 minutes ago

Mathematics seems like the ideal candidate for AIs to achieve absurd results. It's a purely abstract grammar with true auto-verifiability. Even SWE has the requirement of interacting with real physical things. In math there's no external feedback required, you're solely bounded by the rate and quality of token generation.

bgirard - 16 minutes ago

Last week I got together with my math alumni friend. We cracked some beers, we chatted with voice mode ChatGPT and toyed around with Collatz Conjecture and we sent some prompt to a coding agent to build visualizations and simulation. It was a lot of fun directing these agents while we bounced off ideas and the models could explore them.

I think with the right problem and the right agentic loop it’s clear to me improvements will speed up.

dogscatstrees - an hour ago

> As they did so, they also learned how to improve the prompts they gave AlphaEvolve. One key takeaway: The model seemed to benefit from encouragement. It worked better “when we were prompting with some positive reinforcement to the LLM,” Gómez-Serrano said. “Like saying ‘You can do this’ — this seemed to help. This is interesting. We don’t know why.”

Four top logical people in the world are acknowledging this. It is mind-blowing and we don't know why.

yabutlivnWoods - 12 minutes ago

We can define a Dyson Sphere in math.

We cannot build one.

AI outputting axiomatically valid syntax isn't going to be all that useful. It's possible to generate all axiomatically correct math with a for loop until the machine OOMs

Physics is not math and math is not physics.

claysmithr - 2 hours ago

I wonder when AI will be able to discern the passage of time

norejisace - an hour ago

Interesting development. It feels like AI is getting much better at symbolic reasoning, not just pattern recognition.

themafia - 2 hours ago

There are several high value prizes for mathematical research. Let me know when an "AI" has earned one of them. Otherwise:

> When Ryu asked ChatGPT, “it kept giving me incorrect proofs,” [...] he would check its answers, keep the correct parts, and feed them back into the model

So you had a conversational calculator being operated by an actual domain expert.

> With ChatGPT, I felt like I was covering a lot of ground very rapidly

There's no way to convert that feeling into a measurement of any actual value and we happen to know that domain experts are surprisingly easy to fool when outside of their own domains.

sparin9 - 38 minutes ago

[dead]