Andrej Karpathy: We need to let large models "go to school," and reinforcement learning is just beginning

Wallstreetcn
2025.01.31 10:55
portai
I'm PortAI, I can summarize articles.

AI expert Andrej Karpathy compared the training process of large language models (LLM) to educating students in a tweet, elaborating on the current state and future of LLM training. He pointed out that the training of LLM can be divided into three stages: the pre-training stage is akin to the background information in textbooks, the supervised fine-tuning stage corresponds to example problems and solutions, while the reinforcement learning stage is like practice problems, emphasizing learning through trial and error