Interesting Engineering on MSN
GPT-5.3-Codex-Spark delivers ultra-fast real-time AI coding at 1,000 tokens per second
OpenAI has launched GPT-5.3-Codex-Spark, its first AI model built specifically for real-time coding, capable ...
Anthropic launches Claude Sonnet 4.6 with a 1M token context window, delivering near Opus level performance at lower tier pricing.
MiniMax M2.5 hits about 80% on Sweetbench and runs near 100 tokens per second, helping teams deploy faster models on tighter budgets.
Meta has unveiled Llama 4, its latest artificial intelligence model, designed to redefine the boundaries of AI technology. This advanced model comes in two distinct variants—Maverick and Scout—each ...
GPT-5.3-Codex-Spark may be a mouthfull, but it's certainly fast at 1,000 Tok/s running on Nvidia rival's CS3 accelerators Nvidia and AMD can take a seat. On Thursday, OpenAI unveiled ...
OpenAI has spent the past year systematically reducing its dependence on Nvidia. The company signed a massive multi-year deal with AMD in October 2025, struck a $38 billion cloud computing agreement ...
Google today unveiled Gemini 1.5, the latest iteration of its conversational AI system, touting major advances in efficiency, performance and long-form reasoning capabilities. The new system, detailed ...
Cerebras Systems Inc., a startup providing ultra-fast artificial intelligence inference, today announced support for OpenAI’s newly released 120 billion-parameter open-weight reasoning model, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results