Language Model - Search News

Open source Mamba 3 arrives to surpass Transformer architecture with nearly 4% improved language modeling, reduced latency

This release is good for developers building long-context applications, real-time reasoning agents, or those seeking to reduce GPU costs in high-volume production environments.

The Economist

Top AI models underperform in languages other than English

This illustrates a widespread problem affecting large language models (LLMs): even when an English-language version passes a safety test, it can still hallucinate dangerous misinformation in other ...

Nvidia shrinks LLM memory 20x without changing model weights

Nvidia's KV Cache Transform Coding (KVTC) compresses LLM key-value cache by 20x without model changes, cutting GPU memory costs and time-to-first-token by up to 8x for multi-turn AI applications.

Tech Xplore on MSN

A better method for identifying overconfident large language models

Large language models (LLMs) can generate credible but inaccurate responses, so researchers have developed uncertainty quantification methods to check the reliability of predictions. One popular ...

IndustryWeek

Fluke Reliability Puts Large Language Models to the Test

Last year, I participated in a roundtable discussion on artificial intelligence at Fluke Reliability’s Thought Leadership Day ...

OpenAI, Mistral AI release new hardware-efficient language models

OpenAI Group PBC and Mistral AI SAS today introduced new artificial intelligence models optimized for cost-sensitive use cases. OpenAI is rolling out two algorithms called GPT-5.4 mini and GPT 5.4 ...

Tech Xplore

AI model trained on 14,000 Urdu news stories spots misinformation with 96% accuracy

A deep learning model trained on more than 14,000 Pakistani news articles can spot misinformation with 96% accuracy, ...

Communications of the ACM

Measuring What Matters in Large Language Model Performance

As large language models (LLMs) gain momentum worldwide, there’s a growing need for reliable ways to measure their performance. Benchmarks that evaluate LLM outputs allow developers to track ...

7mon

Sam Altman launches GPT-oss, OpenAI's first open-weight AI language model in over 5 years

Sam Altman said that OpenAI's new GPT-oss, comprising a 120b and 20b version, is the "best and most usable open model in the ...

The Conversation

Large language models: how the AI behind the likes of ChatGPT actually works

Mark Stevenson has previously received funding from Google. The arrival of AI systems called large language models (LLMs), like OpenAI’s ChatGPT chatbot, has been heralded as the start of a new ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results