DeepSeek releases OCR 2 with new visual encoding architecture, targeting more human-like machine vision

动点科技英文源
2026.01.28 02:34
portai
I'm PortAI, I can summarize articles.

Chinese AI startup DeepSeek has released its latest optical character recognition model, DeepSeek-OCR 2, which features a new visual encoding architecture aimed at enhancing machine interpretation of visual information. The model utilizes the DeepEncoder V2 architecture, allowing for dynamic rearrangement of image components based on context. It improves data compression efficiency and reduces computational costs, achieving a benchmark score of 91.09%, a 3.73% increase from its predecessor. This release reflects the growing competition among Chinese AI developers in foundational models and open-source capabilities.