Model Token - Search News

Interesting Engineering on MSN

GPT-5.3-Codex-Spark delivers ultra-fast real-time AI coding at 1,000 tokens per second

OpenAI has launched GPT-5.3-Codex-Spark, its first AI model built specifically for real-time coding, capable ...

Anthropic launches Claude Sonnet 4.6 with 1M token context window

Anthropic launches Claude Sonnet 4.6 with a 1M token context window, delivering near Opus level performance at lower tier pricing.

MiniMax M2.5 Uses 10B Active Parameters per Token, Aiming for Cheaper Always-On Agents

MiniMax M2.5 hits about 80% on Sweetbench and runs near 100 tokens per second, helping teams deploy faster models on tighter budgets.

Geeky Gadgets

New Llama 4 AI Model 10 Million Token Context Window

Meta has unveiled Llama 4, its latest artificial intelligence model, designed to redefine the boundaries of AI technology. This advanced model comes in two distinct variants—Maverick and Scout—each ...

4don MSN

OpenAI dishes out its first model on a plate of Cerebras silicon

GPT-5.3-Codex-Spark may be a mouthfull, but it's certainly fast at 1,000 Tok/s running on Nvidia rival's CS3 accelerators Nvidia and AMD can take a seat. On Thursday, OpenAI unveiled ...

OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized chips

OpenAI has spent the past year systematically reducing its dependence on Nvidia. The company signed a massive multi-year deal with AMD in October 2025, struck a $38 billion cloud computing agreement ...

VentureBeat

Google unveils Gemini 1.5, a next-gen AI model with million-token context window

Google today unveiled Gemini 1.5, the latest iteration of its conversational AI system, touting major advances in efficiency, performance and long-form reasoning capabilities. The new system, detailed ...

SiliconANGLE

Cerebras delivers blazing speed for OpenAI’s new open-model with 3,000 tokens per second

Cerebras Systems Inc., a startup providing ultra-fast artificial intelligence inference, today announced support for OpenAI’s newly released 120 billion-parameter open-weight reasoning model, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results