GPT-5 launched yesterday. 94.6% on AIME 2025. 74.9% on SWE-bench.
As we approach the upper bounds of these benchmarks, they die.
What makes GPT-5 and the next generation of models revolutionary isn’t their knowledge. It’s knowing how to act. For GPT-5 this happens at two levels. First, deciding which model to use. But second, and more importantly, through tool calling.
We’ve been living in an era where LLMs mastered knowledge retrieval & reassembly.
Recent Stories
AI helps reveal global surge in floating algae
For the first time and with help from artificial intelligence, researchers have conducted a comprehensive study of global floating algae and found that blooms are expanding across the ocean. These trends ...
Jan 19, 2026Agent Lightning: Train ANY AI Agents with Reinforcement Learning
We present Agent Lightning, a flexible and extensible framework that enables Reinforcement Learning (RL)-based training of Large Language Models (LLMs) for any AI agent. Unlike existing methods...
Jan 19, 2026NEURA Robotics joins forces with Bosch to deploy German humanoid creations
The current CTO of NEURA formerly served in a leading position at Bosch