Alibaba launches new AI model targeting rival DeepSeek, China’s hottest start-up

Alibaba launches new AI model targeting rival DeepSeek, China’s hottest start-up

In a statement posted on WeChat, the e-commerce giant’s cloud computing and AI arm Alibaba Cloud said its new Qwen 2.5-Max model also outperformed OpenAI’s GPT-4o and Meta Platforms’ Llama-3.1-405B in LLM performance benchmark platforms Arena-Hard and LiveBench. Alibaba owns the South China Morning Post. The benchmark performance of Qwen 2.5-Max, part of Alibaba’s Tongyi … Read more

Anthropic’s Claude can now control computers like people do

Anthropic’s Claude can now control computers like people do

Anthropic Anthropic’s already impressive Claude 3.5 Sonnet gains a significant performance boost on Tuesday as the generative AI startup rolls out an enhanced and updated version of the model alongside the new, lightweight Claude 3.5 Haiku. The Sonnet update includes a public beta feature that gives the AI basic control over the computer it’s running … Read more

Maths test stumps AI models: which number is bigger, 9.90 or 9.11?

Maths test stumps AI models: which number is bigger, 9.90 or 9.11?

The wave of artificial intelligence (AI) chatbots allowed for public use in mainland China enables many users to create new content – including audio, code, images, simulations, videos and grammatically correct text – to entertain and help with everyday tasks. That demand has led to the local development of more than 200 large language models … Read more

Alibaba’s AI model outperforms Chinese rivals, ranks just behind OpenAI, Anthropic

Alibaba’s AI model outperforms Chinese rivals, ranks just behind OpenAI, Anthropic

Qwen2-72B-Instruct – the most advanced version of the Hangzhou-based e-commerce giant’s Qwen family of large language models (LLMs), the open source version of Tongyi Qianwen – came in just behind OpenAI’s GPT-4o and Anthropic’s Claude 3.5 Sonnet in a ranking from SuperClue, a benchmarking platform that evaluates models based on metrics such as calculations, logic … Read more

Claude 3.5 Sonnet vs. GPT-4o: here’s how they stack up

Claude 3.5 Sonnet vs. GPT-4o: here’s how they stack up

In the ever-growing large language model (LLMs) landscape, two front-runners stand out from the rest of the race: Anthropic’s Claude 3.5 Sonnet and OpenAI’s GPT-4o (the “o” stands for “Omni”). Both AIs boast impressive capabilities, but which reigns supreme? This guide dives deep into Claude 3.5 Sonnet and GPT-4o, dissecting their strengths and weaknesses across various … Read more