A benchmark called OSWorld-Verified, designed to monitor AI's ability to navigate desktop environments, found that GPT 5.4 scored 75%, up from 47.3% with its GPT 5.2 model. That also beats the average ...
GPT-5.4 is out now on ChatGPT (where it goes by the name GPT-5.4 Thinking) as well as on the OpenAI API and OpenAI’s coding ...
GPT-5.4 is also more reliable, producing 18% fewer errors and 33% fewer false claims than GPT-5.2, according to OpenAI.
The latest model comes with native computer use capabilities, allowing it to take on jobs across your device and applications.
OpenAI released GPT-5.4 today with native computer use, a 1M-token context window, and new professional benchmarks. Find what ...
OpenAI launches GPT-5.4, calling it its most capable and efficient AI model yet, with AI agents, computer control, improved reasoning, and a 1M-token context.
OpenAI, Google, and Alibaba unveil faster, cheaper AI models built for real-time apps and local devices, signaling a shift from AI power to speed and efficiency.
Choosing an AI assistant for your business in 2026 isn’t as simple as picking the most popular one. Claude, ChatGPT, and Gemini have each matured into serious ...
Lite, its fastest and most cost-efficient AI model, at $0.25 per million tokens and 2.5x faster than Gemini 2.5 Flash.
Microsoft has announced that the Microsoft Agent Framework has reached Release Candidate status for both .NET and Python. This milestone indicates that the API surface is stable and feature-complete ...
Familiarity with basic networking concepts, configurations, and Python is helpful, but no prior AI or advanced programming ...
OpenAI wants to retire the leading AI coding benchmark—and the reasons reveal a deeper problem with how the whole industry measures itself.
Some results have been hidden because they may be inaccessible to you
Show inaccessible results