Добавить новость
ru24.net
News in English
Декабрь
2024
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31

Are LLMs capable of non-verbal reasoning?

0

Large language models have found great success so far by using their transformer architecture to effectively predict the next words (i.e., language tokens) needed to respond to queries. When it comes to complex reasoning tasks that require abstract logic, though, some researchers have found that interpreting everything through this kind of "language space" can start to cause some problems, even for modern "reasoning" models.

Now, researchers are trying to work around these problems by crafting models that can work out potential logical solutions completely in "latent space"—the hidden computational layer just before the transformer generates language. While this approach doesn't cause a sea change in an LLM's reasoning capabilities, it does show distinct improvements in accuracy for certain types of logical problems and shows some interesting directions for new research.

Wait, what space?

Modern reasoning models like ChatGPT's o1 tend to work by generating a "chain of thought." Each step of the logical process in these models is expressed as a sequence of natural language word tokens which are fed back through the model.

Read full article

Comments




Moscow.media
Частные объявления сегодня





Rss.plus




Спорт в России и мире

Новости спорта


Новости тенниса
WTA

Екатерина Александрова уступила в первом круге турнира WTA-125 в Лиможе






В "Бетисе" сообщили, что не интересуются игроком "Локомотива" Тикнизяном

Умер актер сериала «Интерны» Николай Ледовских

Мэр Москвы рассказал о выставках и концертах на станциях столичного метро

Глава Минэкономразвития: прогноз по инфляции на 2025 год уточним к апрелю