Discussion about this post

User's avatar
Andrew Lucas's avatar

I feel like the LLM is the invention, but next-word prediction, or the fundamental concept beneath it is the discovery.

Listen to Ilya Sutsksver and you’ll notice that he is fascinated by the fact that something that looks a lot like intelligence appears just from training a model to predict its own inputs--no special objective function required.

This same principle may explain the mechanism for learning in the human brain too (see Karl Friston).

Expand full comment
3 more comments...

No posts