Model Training in Python

NVIDIA Diffusion LLM Hits 2.42x Throughput Without Retraining: Nemotron TwoTower Released

NVIDIA diffusion language model Nemotron TwoTower achieves 2.42x LLM inference throughput without a full retraining run, ...

Center for Strategic and International Studies

What to Know About Chinese AI Models

Chinese AI models are rapidly closing the gap with U.S. frontier systems. This analysis examines what their growing ...

Decrypt

LongCat-2.0: The Stealth AI Model That Was Quietly Topping OpenRouter All Along

Chinese tech company Meituan officially unveiled LongCat-2.0 on June 30, confirming the open-license, 1.6-trillion-parameter mixture-of-experts AI model is the same system that sp ...

Decrypt

Ornith Is the Open-Source Coding Model Built for Agents, Not Humans

Ornith 1.0 by DeepReinforce is meant for developers who want AI that finishes the job, not just autocompletes the next line.

Tech.eu

Robotics has a data problem. Macrodata Labs wants to solve it

After helping build some of the world's most widely used open AI datasets at Hugging Face, Guilherme Penedo and Hynek ...

IEEE

A Pricing Game for Federated Learning Supporting Lightweight Local Model Training

Abstract: The pervasive distribution of data across clients with privacy concerns and heterogeneous performance in edge networks presents a significant opportunity to enhance AI model performance.

Tech Times

Open-Source Coding Model Ornith-1.0 Writes Its Own Training Scaffold in Reinforcement Learning

Open-source agentic coding model Ornith-1.0, released today under the MIT license, uses a self-improving reinforcement ...

Hollywood Workers Are Training AI Models as Job Prospects Grow Slim

Some TV and film vets are taking gigs in the world of Reinforcement Learning from Human Feedback, helping smooth out Gen AI ...

IEEE

Training Large Models on Heterogeneous and Geo-Distributed Resource with Constricted Networks

Abstract: As the computational demands driven by large model technologies continue to grow rapidly, leveraging GPU hardware to expedite parallel training processes has emerged as a commonly-used ...

Journal of Medical Internet Research

Evaluation Framework of Large Language Models in Medical Documentation: Development and Usability Study

Background: The advancement of large language models (LLMs) offers significant opportunities for health care, particularly in the generation of medical documentation. However, challenges related to ...

14d

Own It Or Rent It? A CIO's Framework For AI Deployment

"Own or rent" has become the pivotal AI question for every CIO. In the rush of the last two years, the default was to ...

16d

Why Weibo’s tiny VibeThinker-3B has the AI world arguing over benchmarks again

B, a 3-billion-parameter AI model, is challenging OpenAI, Google and DeepSeek on math and coding benchmarks while reigniting the debate over AI scaling, benchmark gaming and small-model reasoning.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results