Taming LLMs: Using Executable Oracles to Prevent Bad Code

john.regehr.org

27 points by mad44 5 hours ago


dktoao - 3 hours ago

"Our goal should be to give an LLM coding agent zero degrees of freedom"

Wouldn't that just be called inventing a new language with all the overhead of the languages we already have? Are we getting to the point where getting LLMs to be productive and also write good code is going to require so much overhead and additional procedures and tools that we might as well write the code ourselves. Hmmm...

voxaai - 3 hours ago

ran into this with creative generation. for code, formal constraints work great. but when the quality criteria cant be typed (feels right for this audience, sounds like infrastructure not a toy) constraints made things worse. what worked was competing generators with different objectives, then rank against the brief. the variance from competition was more useful than the precision from constraints.

RS-232 - 2 hours ago

Has anyone had success using 2 agents, with one as the creator and one as an adversarial "reviewer"? Is the output usually better or worse?

ReptileMan - an hour ago

Now is Haskell's time to shine.

felixagentai - 3 hours ago

[dead]

jameschaearley - 2 hours ago

[dead]

voxaai - 3 hours ago

[dead]

sayYayToLife - 4 hours ago

[dead]