Monday, May 26, 2025

New "Absolute Zero" Model Learns with NO DATA - Matthew Berman, YouTube

This video discusses a new AI paradigm called "Absolute Zero" [00:34], where language models can learn and improve without human intervention. This method allows AI to propose, solve, and learn from its own problems [00:40], unlike previous methods that relied on human-generated data or verifiable rewards [01:17]. The "Absolute Zero" model can define tasks to maximize learnability and solve them effectively [05:35], leading to self-evolution through self-play. The video highlights that this approach has shown remarkable capabilities in math and coding [08:47], even outperforming models trained with human-curated datasets [09:11]. Key insights from the research include the amplification of reasoning through coding priors, enhanced cross-domain transferability, and the emergence of cognitive behaviors like step-by-step planning in the AI's code [09:24]. The model learns by experimenting and self-play, similar to how humans learn [06:42], continuously improving by proposing problems at the edge of its abilities [08:19].

https://www.youtube.com/watch?v=CqdqZNqljdI

No comments: