ChatGPT’s latest model may be a regression in performance

ChatGPT’s latest model may be a regression in performance

According to a new report from Artificial Analysis, OpenAI’s flagship large language model for ChatGPT, GPT-4o, has significantly regressed in recent weeks, putting the state-of-the-art model’s performance on par with the far smaller, and notably less capable, GPT-4o-mini model. This analysis comes less than 24 hours after the company announced an upgrade for the GPT-4o … Read more

Alibaba launches maths-specific AI models said to outperform LLMs from OpenAI, Google

Alibaba launches maths-specific AI models said to outperform LLMs from OpenAI, Google

“Over the past year, we have dedicated significant efforts to researching and enhancing the reasoning capabilities of large language models, with a particular focus on their ability to solve arithmetic and mathematical problems,” the Qwen team, part of Alibaba’s cloud computing unit, said in a post published on developer platform GitHub on Thursday. Alibaba owns … Read more

Maths test stumps AI models: which number is bigger, 9.90 or 9.11?

Maths test stumps AI models: which number is bigger, 9.90 or 9.11?

The wave of artificial intelligence (AI) chatbots allowed for public use in mainland China enables many users to create new content – including audio, code, images, simulations, videos and grammatically correct text – to entertain and help with everyday tasks. That demand has led to the local development of more than 200 large language models … Read more

OpenAI partners with Los Alamos to test AI’s value for lab work

OpenAI partners with Los Alamos to test AI’s value for lab work

OpenAI is teaming up with Los Alamos National Laboratory, best known for developing the world’s first atomic bomb, to study the opportunities and risks for using artificial intelligence systems to assist with scientific research. The Microsoft-backed start-up said Wednesday that it is working with Los Alamos to evaluate how its latest AI model, GPT-4o, can … Read more

Claude 3.5 Sonnet vs. GPT-4o: here’s how they stack up

Claude 3.5 Sonnet vs. GPT-4o: here’s how they stack up

In the ever-growing large language model (LLMs) landscape, two front-runners stand out from the rest of the race: Anthropic’s Claude 3.5 Sonnet and OpenAI’s GPT-4o (the “o” stands for “Omni”). Both AIs boast impressive capabilities, but which reigns supreme? This guide dives deep into Claude 3.5 Sonnet and GPT-4o, dissecting their strengths and weaknesses across various … Read more

Alibaba’s large language model tops global ranking of AI developer platform Hugging Face

Alibaba’s large language model tops global ranking of AI developer platform Hugging Face

Three of the four top 10-ranked Chinese LLMs were from the Tongyi Qianwen series, also known as Qwen, developed by e-commerce and cloud computing giant Alibaba, according to AI and machine-learning developer platform Hugging Face, which released its updated leaderboard with new metrics on Wednesday. Alibaba owns the South China Morning Post. Hangzhou-based Alibaba’s Qwen-72B-Instruct … Read more

You’re Making Me Blush!’: Voice-Activated ChatGPT ‘Shy’ When Told How Amazing ‘She’ Is

You’re Making Me Blush!’: Voice-Activated ChatGPT ‘Shy’ When Told How Amazing ‘She’ Is

OpenAI has launched GPT-4o, the latest iteration in its GPT series, marking a significant leap in AI. GPT-4o, with “o” signifying “omnimodal,” ushers in a new era of human-computer interaction by accepting and generating combinations of text, audio, and image inputs and outputs. The large language model (LLM) race is heating up. While OpenAI is … Read more