Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

The LLM is still a "normal algorithm", just one with a fairly large dataset to use. Assigning the algorithm part of LLMs magical properties hinders understanding. The work needed to pick the next output token is very much a classical algorithm.

Algorithms can be based on training and/or use data just fine, too. https://arxiv.org/abs/1712.01208

(Now, the weights used, those we kinda really don't understand the same way we understand the processing, and the approach to looking for structures in weights sometimes looks more like archeology or anthropology than computer science.)

It sounds like you're trying to express some kind of "but LLMs are so much more" thought. Yes, very much, they are. It's because of the size of the data, there's interesting emergence there. They're still a normal algorithm. (And our brains aren't quite like that; biological things are much more random/chaotic and generally non-reproducible. And the data and algorithm aren't separate.)

 help



Anthropic has a blog article on how they analyse their LLM to understand better how the LLM is doing math and estimates.

For this they needed extra tools to do so.

This 'algorithm' of how the lLM does that, was unknown before their research.

Our brains are not that chaotic though. They have even more complexity to size for sure and the issue that its hard to look into a humans brain.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: