ai benchmark - Search News

14hon MSN

These researchers used NPR Sunday Puzzle questions to benchmark AI ‘reasoning’ models

Researchers used questions from the NPR Sunday Puzzle challenge to build a benchmark to test AI 'reasoning' models.

Nvidia counters AMD DeepSeek AI benchmarks, claims RTX 4090 is nearly 50% faster than 7900 XTX

Nvidia published RTX 5090, RTX 4090 DeepSeek benchmarks against the RX 7900 XTX, countering AMD's performance claims that the ...

Alibaba’s Qwen2.5-Max challenges U.S. tech giants, reshapes enterprise AI

Alibaba's Qwen2.5-Max AI model sets new performance benchmarks in enterprise-ready artificial intelligence, promising reduced ...

6don MSN

DeepSeek: Everything you need to know about the AI chatbot app

DeepSeek has gone viral. Chinese AI lab DeepSeek broke into the mainstream consciousness this week after its chatbot app rose ...

3don MSN

Humanity’s Last Exam Explained – The ultimate AI benchmark that sets the tone of our AI future

Humanity's Last Exam”, an evaluation is being hailed as the definitive test to determine whether AI can match – or surpass – ...

Alleged Xiaomi 15 Ultra spotted on Geekbench AI benchmark with Snapdragon 8 Elite

A Xiaomi device with model number 25010PN30G surfaced on the Geekbench AI benchmark platform yesterday, which is expected to be the global version of the Xiaomi 15 Ultra. The phone comes with Android ...

Yahoo13d

Could you pass 'Humanity’s Last Exam'? Probably not, but neither can AI

Did you know some of the smartest people on the planet create benchmarks to test AI’s capabilities at replicating human intelligence? Well, scarily enough most AI benchmarks are easily completed ...

1hon MSN

TRAIT Explained – How AI chatbots are evolving with distinct personalities?

A study titled Do LLMs Have Distinct and Consistent Personality?, detailed in a paper from Yonsei University and Seoul National University, introduces TRAIT.

Alibaba’s Qwen 2.5 Max scores big in AI benchmarks, claims world-leading performance

DeepSeek, a 20-month-old startup founded in Alibaba’s home city, Hangzhou, became a global sensation this week and figures prominently as the first benchmark that Alibaba appears to measure itself ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results