Tech NewsFebruary 6, 2025These researchers used NPR Sunday Puzzle questions to benchmark AI ‘reasoning’ models
Tech NewsJanuary 11, 2025Researchers improved AI agent performance on unfamiliar tasks using ‘Dungeons and Dragons’
Tech NewsJanuary 11, 2025Google DeepMind researchers introduce new benchmark to improve LLM factuality, reduce hallucinations
Tech NewsNovember 1, 2024Meta AI researchers give robots a sense of touch and we’re getting all the creepy feels
Tech NewsJuly 13, 2024Meta researchers distill System 2 thinking into LLMs, improving performance on complex reasoning