Добавить новость
ru24.net
Game24.pro
Январь
2025
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30
31

Forget DeepSeek R1, apparently it's now Alibaba that has the most powerful, the cheapest, the most everything-est chatbot

0

As Ferris astutely observed, life moves pretty fast in chatbot land. So, forget that ancient news about DeepSeek's cheap and powerful LLM tanking Nvidia's share price. Because here comes another Chinese tech giant, Alibaba, with its own new AI model that surpasses the lot. Well, it does according to Alibaba.

Qwen 2.5-Max, for it is he (she? they? take your pick), was released today according to Reuters and with it some pretty bombastic claims.

"Qwen 2.5-Max outperforms...almost across the board GPT-4o, DeepSeek V3 and Llama-3.1-405B," Alibaba says. Notably, that's DeepSeek V3, not DeepSeek R1, which is the updated model that helped wipe $600 billion from Nvidia's share price in a day.

Still, those are OpenAI and Meta's most advanced open-source models. So, if Alibaba's claims are true, Qwen 2.5-Max is no slouch.

Reuters notes that the Alibaba release plays into a wider price war operating in China for access to AI models. It was an earlier DeepSeek model that moved Alibaba to announce massive 97% price cuts for access to its AI models.

For now, it's unclear how resource intensive the Qwen 2.5-Max is or is not. The thing that really rattled the markets when it comes to DeepSeek R1 arguably isn't its outright performance, but rather claims that it was trained on just $6 million dollars' worth of slightly hobbled Nvidia H800 GPUs, a small fraction of the cost associated with the huge GPU arrays used by the likes of OpenAI and Meta to train their models.

It's also emerged that the full-precision DeepSeek R1 model can run on just $6,000 of PC hardware, that cost mostly being eaten up by lots of memory but without the need for a megabucks Nvidia GPU.

So, the fact alone that Alibaba has a competitive model isn't earth shattering news. But the questions of the hardware used and the costs involved are intriguing.

There may also be an extent to which this first species of chatbot tech is maturing and converging. Anyone familiar with ChatGPT 4o can't help, for instance, to note the eerie similarities with, say, DeepSeek R1.

Your next upgrade

(Image credit: Future)

Best CPU for gaming: The top chips from Intel and AMD.
Best gaming motherboard: The right boards.
Best graphics card: Your perfect pixel-pusher awaits.
Best SSD for gaming: Get into the game ahead of the rest.

The style and tone of the text, the delivery, the length of responses, the actual content, the tendency to hallucinate falsehoods, the sense of very comparable invisible guard rails and, for want of a better term, social adjustment—everything seems virtually identical, for better and indeed also for worse.

Where this all leads is anyone's guess. Maybe we'll be stuck with a load of samey, semi-useful, oft-hallucinating bots for the foreseeable. Or maybe the AGI or artificial general intelligence explosion is just around the corner. Either way, I for one...no, actually, I don't welcome them. But I am fascinated in what happens next.




Moscow.media
Частные объявления сегодня





Rss.plus




Спорт в России и мире

Новости спорта


Новости тенниса
ATP

Российский теннисист Медведев покинул топ-5 рейтинга ATP






Человек труда - герой нашего времени

Максим Ликсутов: В Митине появится промтехнопарк в ...

День рождения киностудии «Мосфильм»

ФАС России оштрафовала ВТБ за вводящую в заблуждение рекламу карт с бонусами