Scaffolding to Superhuman: Curriculum Learning Beats 2048 and Tetris
TL;DR An individual researcher used PufferLib, extensive hyperparameter sweeps and curated curricula to train agents that outperform a terabyte-scale search…
Wow News on Tech and AI
TL;DR An individual researcher used PufferLib, extensive hyperparameter sweeps and curated curricula to train agents that outperform a terabyte-scale search…