Logical Reasoning Problems

Top “Reasoning” AI Models Can be Brought to Their Knees With an Extremely Simple Trick

A team of Apple researchers has found that advanced AI models’ alleged ability to “reason” isn’t all it’s cracked up to be. But marketing aside, there’s no agreed-upon industrywide definition for what ...

Hosted on MSN

Apple research claims popular AI models fail at hard reasoning: Why does it matter?

Over the weekend, Apple released new research that accuses most advanced generative AI models from the likes of OpenAI, Google and Anthropic of failing to handle tough logical reasoning problems.

Diginomica

AI needs foundational models - so what can we learn from GPT-3, BERT, and DALL-E 2?

Foundational models address a fundamental flaw in bespoke AI. But foundational and large language models have limitations. GPT-3, BERT, and DALL·E 2 garnered gushing headlines, but models like these ...

Wired

OpenAI Upgrades Its Smartest AI Model With Improved Reasoning Skills

A day after Google announced its first model capable of reasoning over problems, OpenAI has upped the stakes with an improved version of its own. OpenAI’s new model, called o3, replaces o1, which the ...

Finextra

Challenging the Notion That LLMs Can't Reason: A Case Study with Einstein's Puzzle

We set out to test LLM reasoning capabilities using Einstein's puzzle, a complex logic problem involving 5 houses with different characteristics and 15 clues to determine who owns a fish. Our initial ...

GEN

Brain Regions Essential for Logical Thinking and Problem Solving in Humans Identified

Using two newly developed types of reasoning tests, a team of researchers at UCL and UCLH has identified key brain regions that are essential for logical thinking and problem-solving. The results will ...

14d

OpenAI Leak Highlights “Extreme Reasoning Mode” for GPT-5.4

Leaked OpenAI GPT-5.4 details include Extreme Reasoning Mode and 6,000 lines per prompt, aimed at complex coding work.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results