<p>作者：爱德华多·巴普蒂斯塔</p>
<p>北京，3 月 18 日（路透社）——上周在一个开发者平台上匿名出现的强大人工智能模型引发了猜测，认为中国初创公司 DeepSeek 可能正在悄然测试其下一代系统，以便在正式发布之前做好准备。</p>
<p>这个名为 Hunter Alpha 的免费模型于 3 月 11 日在 AI 网关平台 OpenRouter 上出现，没有任何开发者归属，后来该平台将其描述为 “隐形模型”。</p>
<p>在路透社进行的测试中，Hunter Alpha 聊天机器人自称为 “一个主要在中文环境下训练的中国 AI 模型”，并表示其训练数据延续到 2025 年 5 月，这与 DeepSeek 自己的聊天机器人报告的知识截止点相同。</p>
<p>然而，当被问及其创造者时，该系统拒绝透露其开发者的身份。</p>
<p>“我只知道我的名字、我的参数规模和我的上下文窗口长度，” 聊天机器人说。</p>
<p>DeepSeek 和 OpenRouter 都没有确认该模型的创造者，并且没有回应评论请求。</p>
<p>Hunter Alpha 的个人资料页面将其描述为一个拥有 1 万亿参数的模型，这意味着它是使用大约 1 万亿个可调值进行训练的，这些值决定了系统如何处理语言和生成响应。参数更多的模型通常需要显著更多的计算能力来运行。</p>
<p>该系统还宣传了高达一百万个标记的上下文窗口，这是衡量 AI 模型在单次交互中可以处理或记住多少文本的指标。一个标记大致对应于一小段文本，例如一个单词的一部分。</p>
<p>“最引人注目的组合是 Hunter Alpha 的 100 万个标记上下文与推理能力和免费访问的结合，” 构建 AI 代理系统的工程师纳比尔·哈乌阿姆说。</p>
<p>“大多数具有该上下文窗口的前沿模型在规模上都需要真实的成本，” 他补充道。</p>
<p>这些规格与当地媒体对 DeepSeek 下一代 V4 模型的预期相似，中国媒体报道称该模型可能最早在 4 月发布。DeepSeek 与许多中国竞争对手一样资金充足，但由于其母公司是量化对冲基金而非科技集团，其结构较为特殊。</p>
<p>尽管这种重叠并未建立直接联系，但它加剧了开发者之间的猜测，认为这个匿名系统可能是 DeepSeek 即将发布的早期测试版本。</p>
<p>“思维链模式可能是最强的信号，” AI 工程师丹尼尔·德赫斯特在模型发布后分析时表示，指的是 AI 模型的推理方式。</p>
<p>“推理风格很难伪装，往往反映了模型的训练方式。”</p>
<p>他说，Hunter Alpha 的规模和记忆容量也与今年早些时候流传的 DeepSeek V4 的规格相匹配。</p>
<p>不过，一些开发者警告说，将该模型与 DeepSeek 联系起来的证据并不确凿。</p>
<p>“我的分析表明，Hunter Alpha 可能不是 DeepSeek V4，” 独立 AI 基准测试负责人乌穆尔·奥兹库尔说，他提到与 DeepSeek 现有系统相比，标记相关行为和架构模式的差异。</p>
<p>他说，考虑到时间和宣传的能力，将该模型与 DeepSeek 联系起来的猜测是可以理解的。</p>
<h3>开发者测试</h3>
<p>匿名模型的发布并不罕见，因为像 OpenRouter 这样的平台允许开发者通过单一接口向数十个 AI 模型发送查询，使其成为新系统的热门测试场。</p>
<p>一个名为 Pony Alpha 的匿名模型于 2 月份出现在 OpenRouter 上，五天后中国公司 Zhipu AI 确认它是其 GLM-5 系统的一部分。</p>
<p>Hunter Alpha 个人资料页面上的通知表示，所有模型的提示和完成 “都由提供者记录，并可能用于改进模型”，强调了行业普遍采用隐形模型发布以获取无偏见反馈的做法。</p>
<p>该模型在平台上出现后迅速被采用，截至周日，根据 OpenRouter 的统计数据，处理了超过 1600 亿个标记。</p>
<p>大部分活动来自软件开发工具和 AI 代理框架，如 OpenClaw，这些工具允许 AI 系统自主规划任务并与外部软件互动。</p>

深度求索

<p>By Eduardo Baptista</p>
<div class="lb-trans"><p>作者：爱德华多·巴普蒂斯塔</p>
</div><p>BEIJING, March 18 (Reuters) - A powerful artificial intelligence model that appeared anonymously on a developer platform last week has sparked speculation that Chinese startup DeepSeek may be quietly testing its next-generation system ahead of an official launch.</p>
<div class="lb-trans"><p>北京，3 月 18 日（路透社）——上周在一个开发者平台上匿名出现的强大人工智能模型引发了猜测，认为中国初创公司 DeepSeek 可能正在悄然测试其下一代系统，以便在正式发布之前做好准备。</p>
</div><p>The free model, called Hunter Alpha, surfaced on the AI gateway platform OpenRouter on March 11 without any developer attribution and was later described by the platform as a “stealth model.”</p>
<div class="lb-trans"><p>这个名为 Hunter Alpha 的免费模型于 3 月 11 日在 AI 网关平台 OpenRouter 上出现，没有任何开发者归属，后来该平台将其描述为 “隐形模型”。</p>
</div><p>During tests conducted by Reuters, the Hunter Alpha chatbot described itself as “a Chinese AI model primarily trained in Chinese” and said its training data extended to May 2025, the same knowledge cutoff point reported by DeepSeek’s own chatbot.</p>
<div class="lb-trans"><p>在路透社进行的测试中，Hunter Alpha 聊天机器人自称为 “一个主要在中文环境下训练的中国 AI 模型”，并表示其训练数据延续到 2025 年 5 月，这与 DeepSeek 自己的聊天机器人报告的知识截止点相同。</p>
</div><p>When asked about its creator, however, the system declined to identify its developer.</p>
<div class="lb-trans"><p>然而，当被问及其创造者时，该系统拒绝透露其开发者的身份。</p>
</div><p>“I only know my name, my parameter scale and my context window length,” the chatbot said.</p>
<div class="lb-trans"><p>“我只知道我的名字、我的参数规模和我的上下文窗口长度，” 聊天机器人说。</p>
</div><p>Neither DeepSeek nor OpenRouter has identified the model’s creator and they did not respond to requests for comment.</p>
<div class="lb-trans"><p>DeepSeek 和 OpenRouter 都没有确认该模型的创造者，并且没有回应评论请求。</p>
</div><p>Hunter Alpha’s profile page describes it as a 1-trillion-parameter model, meaning it was trained using roughly one trillion adjustable values that determine how the system processes language and generates responses. Models with more parameters generally require significantly more computing power to operate.</p>
<div class="lb-trans"><p>Hunter Alpha 的个人资料页面将其描述为一个拥有 1 万亿参数的模型，这意味着它是使用大约 1 万亿个可调值进行训练的，这些值决定了系统如何处理语言和生成响应。参数更多的模型通常需要显著更多的计算能力来运行。</p>
</div><p>The system also advertises a context window of up to one million tokens, a measure of how much text an AI model can process or remember during a single interaction. A token roughly corresponds to a short piece of text, such as part of a word.</p>
<div class="lb-trans"><p>该系统还宣传了高达一百万个标记的上下文窗口，这是衡量 AI 模型在单次交互中可以处理或记住多少文本的指标。一个标记大致对应于一小段文本，例如一个单词的一部分。</p>
</div><p>“The combination that stood out was Hunter Alpha’s 1 million token context paired with reasoning capability and free access,” said Nabil Haouam, an engineer who builds AI agent systems.</p>
<div class="lb-trans"><p>“最引人注目的组合是 Hunter Alpha 的 100 万个标记上下文与推理能力和免费访问的结合，” 构建 AI 代理系统的工程师纳比尔·哈乌阿姆说。</p>
</div><p>“Most frontier models with that context window come with real cost at scale,” he added.</p>
<div class="lb-trans"><p>“大多数具有该上下文窗口的前沿模型在规模上都需要真实的成本，” 他补充道。</p>
</div><p>Those specifications resemble expectations in local media for DeepSeek’s next-generation V4 model, which Chinese outlets have reported could launch as early as April. DeepSeek, like many of its Chinese competitors, is well-funded, though it has an unusual structure given its parent company is a quantitative hedge fund rather than a tech conglomerate.</p>
<div class="lb-trans"><p>这些规格与当地媒体对 DeepSeek 下一代 V4 模型的预期相似，中国媒体报道称该模型可能最早在 4 月发布。DeepSeek 与许多中国竞争对手一样资金充足，但由于其母公司是量化对冲基金而非科技集团，其结构较为特殊。</p>
</div><p>While the overlap does not establish a direct connection, it has intensified speculation among developers that the anonymous system could be an early test version of the upcoming release by DeepSeek.</p>
<div class="lb-trans"><p>尽管这种重叠并未建立直接联系，但它加剧了开发者之间的猜测，认为这个匿名系统可能是 DeepSeek 即将发布的早期测试版本。</p>
</div><p>“The chain-of-thought pattern is probably the strongest signal,” said Daniel Dewhurst, an AI engineer who analysed the model after its release, referring to how the AI model reasons.</p>
<div class="lb-trans"><p>“思维链模式可能是最强的信号，” AI 工程师丹尼尔·德赫斯特在模型发布后分析时表示，指的是 AI 模型的推理方式。</p>
</div><p>“Reasoning style is hard to disguise and tends to reflect how a model was trained.”</p>
<div class="lb-trans"><p>“推理风格很难伪装，往往反映了模型的训练方式。”</p>
</div><p>Hunter Alpha’s scale and memory capacity also match specifications that have circulated for DeepSeek V4 since early this year, he said.</p>
<div class="lb-trans"><p>他说，Hunter Alpha 的规模和记忆容量也与今年早些时候流传的 DeepSeek V4 的规格相匹配。</p>
</div><p>Still, some developers cautioned that the evidence linking the model to DeepSeek was inconclusive.</p>
<div class="lb-trans"><p>不过，一些开发者警告说，将该模型与 DeepSeek 联系起来的证据并不确凿。</p>
</div><p>“My analysis suggests Hunter Alpha is likely not DeepSeek V4,” said Umur Ozkul, who runs independent AI benchmark tests, citing differences in token-related behaviour and architectural patterns when compared with DeepSeek’s existing systems.</p>
<div class="lb-trans"><p>“我的分析表明，Hunter Alpha 可能不是 DeepSeek V4，” 独立 AI 基准测试负责人乌穆尔·奥兹库尔说，他提到与 DeepSeek 现有系统相比，标记相关行为和架构模式的差异。</p>
</div><p>He said speculation connecting the model to DeepSeek was understandable given the timing and capabilities advertised.</p>
<div class="lb-trans"><p>他说，考虑到时间和宣传的能力，将该模型与 DeepSeek 联系起来的猜测是可以理解的。</p>
</div><h3>DEVELOPER TESTING</h3>
<div class="lb-trans"><h3>开发者测试</h3>
</div><p>Anonymous model launches are not unusual, as platforms like OpenRouter allow developers to send queries to dozens of AI models through a single interface, making them a popular testing ground for new systems.</p>
<div class="lb-trans"><p>匿名模型的发布并不罕见，因为像 OpenRouter 这样的平台允许开发者通过单一接口向数十个 AI 模型发送查询，使其成为新系统的热门测试场。</p>
</div><p>An anonymous model called Pony Alpha appeared on OpenRouter in February before Chinese firm Zhipu AI confirmed it was part of its GLM-5 system five days later.</p>
<div class="lb-trans"><p>一个名为 Pony Alpha 的匿名模型于 2 月份出现在 OpenRouter 上，五天后中国公司 Zhipu AI 确认它是其 GLM-5 系统的一部分。</p>
</div><p>A notice on Hunter Alpha’s profile page said all prompts and completions for the model “are logged by the provider and may be used to improve the model,” underscoring the industry-wide practice of using stealth model launches for unbiased feedback.</p>
<div class="lb-trans"><p>Hunter Alpha 个人资料页面上的通知表示，所有模型的提示和完成 “都由提供者记录，并可能用于改进模型”，强调了行业普遍采用隐形模型发布以获取无偏见反馈的做法。</p>
</div><p>The model was adopted rapidly after appearing on the platform and processed more than 160 billion tokens as of Sunday, according to OpenRouter statistics.</p>
<div class="lb-trans"><p>该模型在平台上出现后迅速被采用，截至周日，根据 OpenRouter 的统计数据，处理了超过 1600 亿个标记。</p>
</div><p>Much of the activity came from software development tools and AI agent frameworks like OpenClaw, which allow AI systems to autonomously plan tasks and interact with external software.</p>
<div class="lb-trans"><p>大部分活动来自软件开发工具和 AI 代理框架，如 OpenClaw，这些工具允许 AI 系统自主规划任务并与外部软件互动。</p>
</div>

一个神秘的 AI 模型引发了开发者们的热议：这会是深度求索的最新力作吗？