Scaling Law is in a dilemma, is reinforcement learning the only hope for the whole village?

Huxiu
2024.09.12 06:08
portai
I'm PortAI, I can summarize articles.

Facing bottlenecks, Scaling Law is seen as the key to AI breakthroughs. Recently, the Q3 summary of the AI industry pointed out that pre-training Scaling Law is no longer effective, and 80% of companies may abandon this strategy. Instead, Self-play RL is considered the future hope, especially in terms of coding ability, where Claude Sonnet 3.5 has outperformed GPT-4o, demonstrating the potential of RL. Meanwhile, OpenAI is about to release a new model, and the ChatGPT Pro subscription plan has also been launched, priced at $200 per month