Programming Language Benchmarks

Most powerful programming language of the future isn’t C++ or Python, it’s..., says Nvidia CEO Jensen Huang

Nvidia CEO Jensen Huang says English could become the most powerful programming language as AI reduces the need for traditional coding and shifts focus toward intent-driven human-machine interaction.

Quesma Releases OTelBench: Independent Benchmark Reveals Frontier LLMs Struggle with Real-World SRE Tasks

New benchmark shows top LLMs achieve only 29% pass rate on OpenTelemetry instrumentation, exposing the gap between ...

Sarvam AI launches 30B and 105B models, says 105B outperforms DeepSeek R1 and Gemini Flash on key benchmarks

Bengaluru-based AI startup Sarvam AI on February 18 announced the launch of two new large language models, a 30-billion-parameter model and a 105-billion-parameter model, both trained from scratch, ...

10d

Sarvam AI claims edge over larger global models on Indic benchmarks

Capable of reasoning, designed for voice, and fluent in Indian languages, the model would be ready for population-scale deployment ...

15d

OpenAI introduces Frontier agent management platform and new GPT-5.3-Codex model

OpenAI Group PBC today introduced a platform called Frontier that companies can use to build and manage artificial ...

New agent framework matches human-engineered AI systems — and adds zero inference cost to deploy

A new group-evolving agent framework from UC Santa Barbara matches human-engineered AI systems on SWE-bench — and adds zero ...

14don MSN

OpenAI’s GPT-5.3-Codex thinks deeper and wider about coding work

The company says its latest model’s agentic skills also apply to a broader set of knowledge work such as presentations and spreadsheets. On Thursday, OpenAI released GPT-5.3-Codex, a new model that ...

Morning Overview on MSN

AI cracks 'impossible' math problems, but can it intimidate top geniuses?

Google DeepMind’s AlphaProof system scored at a silver-medal level when tested against the 2024 International Mathematical ...

Sarvam AI unveils indigenously-built 30B and 105B LLM models

Sarvam AI launches two advanced LLM models, 30B and 105B, outperforming competitors in key benchmarks, focusing on Indian language support.

43m

Shopsense AI and Bell Media Expand Content-to-Commerce Partnership in Canada

Partnership extends nation’s only shoppable TV offering at scale, unlocking incremental living room engagement and new ...

1dOpinion

Is China About To "BYD" Tesla’s Humanoid Dreams?

During China's latest New Year festivities, humanoid robots from firms such as Unitree, AgiBot, and LimX Dynamics showcased ...

17d

Qwen3-Coder-Next offers vibe coders a powerful open source, ultra-sparse model with 10x higher throughput for repo tasks

On SWE-Bench Verified, the model achieved a score of 70.6%. This performance is notably competitive when placed alongside significantly larger models; it outpaces DeepSeek-V3.2, which scores 70.2%, ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results