<p>作者：愛德華多·巴普蒂斯塔</p>
<p>北京，3 月 18 日（路透社）——上週在一個開發者平台上匿名出現的強大人工智能模型引發了猜測，認為中國初創公司 DeepSeek 可能正在悄然測試其下一代系統，以便在正式發佈之前做好準備。</p>
<p>這個名為 Hunter Alpha 的免費模型於 3 月 11 日在 AI 網關平台 OpenRouter 上出現，沒有任何開發者歸屬，後來該平台將其描述為 “隱形模型”。</p>
<p>在路透社進行的測試中，Hunter Alpha 聊天機器人自稱為 “一個主要在中文環境下訓練的中國 AI 模型”，並表示其訓練數據延續到 2025 年 5 月，這與 DeepSeek 自己的聊天機器人報告的知識截止點相同。</p>
<p>然而，當被問及其創造者時，該系統拒絕透露其開發者的身份。</p>
<p>“我只知道我的名字、我的參數規模和我的上下文窗口長度，” 聊天機器人説。</p>
<p>DeepSeek 和 OpenRouter 都沒有確認該模型的創造者，並且沒有回應評論請求。</p>
<p>Hunter Alpha 的個人資料頁面將其描述為一個擁有 1 萬億參數的模型，這意味着它是使用大約 1 萬億個可調值進行訓練的，這些值決定了系統如何處理語言和生成響應。參數更多的模型通常需要顯著更多的計算能力來運行。</p>
<p>該系統還宣傳了高達一百萬個標記的上下文窗口，這是衡量 AI 模型在單次交互中可以處理或記住多少文本的指標。一個標記大致對應於一小段文本，例如一個單詞的一部分。</p>
<p>“最引人注目的組合是 Hunter Alpha 的 100 萬個標記上下文與推理能力和免費訪問的結合，” 構建 AI 代理系統的工程師納比爾·哈烏阿姆説。</p>
<p>“大多數具有該上下文窗口的前沿模型在規模上都需要真實的成本，” 他補充道。</p>
<p>這些規格與當地媒體對 DeepSeek 下一代 V4 模型的預期相似，中國媒體報道稱該模型可能最早在 4 月發佈。DeepSeek 與許多中國競爭對手一樣資金充足，但由於其母公司是量化對沖基金而非科技集團，其結構較為特殊。</p>
<p>儘管這種重疊並未建立直接聯繫，但它加劇了開發者之間的猜測，認為這個匿名系統可能是 DeepSeek 即將發佈的早期測試版本。</p>
<p>“思維鏈模式可能是最強的信號，” AI 工程師丹尼爾·德赫斯特在模型發佈後分析時表示，指的是 AI 模型的推理方式。</p>
<p>“推理風格很難偽裝，往往反映了模型的訓練方式。”</p>
<p>他説，Hunter Alpha 的規模和記憶容量也與今年早些時候流傳的 DeepSeek V4 的規格相匹配。</p>
<p>不過，一些開發者警告説，將該模型與 DeepSeek 聯繫起來的證據並不確鑿。</p>
<p>“我的分析表明，Hunter Alpha 可能不是 DeepSeek V4，” 獨立 AI 基準測試負責人烏穆爾·奧茲庫爾説，他提到與 DeepSeek 現有系統相比，標記相關行為和架構模式的差異。</p>
<p>他説，考慮到時間和宣傳的能力，將該模型與 DeepSeek 聯繫起來的猜測是可以理解的。</p>
<h3>開發者測試</h3>
<p>匿名模型的發佈並不罕見，因為像 OpenRouter 這樣的平台允許開發者通過單一接口向數十個 AI 模型發送查詢，使其成為新系統的熱門測試場。</p>
<p>一個名為 Pony Alpha 的匿名模型於 2 月份出現在 OpenRouter 上，五天後中國公司 Zhipu AI 確認它是其 GLM-5 系統的一部分。</p>
<p>Hunter Alpha 個人資料頁面上的通知表示，所有模型的提示和完成 “都由提供者記錄，並可能用於改進模型”，強調了行業普遍採用隱形模型發佈以獲取無偏見反饋的做法。</p>
<p>該模型在平台上出現後迅速被採用，截至週日，根據 OpenRouter 的統計數據，處理了超過 1600 億個標記。</p>
<p>大部分活動來自軟件開發工具和 AI 代理框架，如 OpenClaw，這些工具允許 AI 系統自主規劃任務並與外部軟件互動。</p>

深度求索

<p>By Eduardo Baptista</p>
<div class="lb-trans"><p>作者：愛德華多·巴普蒂斯塔</p>
</div><p>BEIJING, March 18 (Reuters) - A powerful artificial intelligence model that appeared anonymously on a developer platform last week has sparked speculation that Chinese startup DeepSeek may be quietly testing its next-generation system ahead of an official launch.</p>
<div class="lb-trans"><p>北京，3 月 18 日（路透社）——上週在一個開發者平台上匿名出現的強大人工智能模型引發了猜測，認為中國初創公司 DeepSeek 可能正在悄然測試其下一代系統，以便在正式發佈之前做好準備。</p>
</div><p>The free model, called Hunter Alpha, surfaced on the AI gateway platform OpenRouter on March 11 without any developer attribution and was later described by the platform as a “stealth model.”</p>
<div class="lb-trans"><p>這個名為 Hunter Alpha 的免費模型於 3 月 11 日在 AI 網關平台 OpenRouter 上出現，沒有任何開發者歸屬，後來該平台將其描述為 “隱形模型”。</p>
</div><p>During tests conducted by Reuters, the Hunter Alpha chatbot described itself as “a Chinese AI model primarily trained in Chinese” and said its training data extended to May 2025, the same knowledge cutoff point reported by DeepSeek’s own chatbot.</p>
<div class="lb-trans"><p>在路透社進行的測試中，Hunter Alpha 聊天機器人自稱為 “一個主要在中文環境下訓練的中國 AI 模型”，並表示其訓練數據延續到 2025 年 5 月，這與 DeepSeek 自己的聊天機器人報告的知識截止點相同。</p>
</div><p>When asked about its creator, however, the system declined to identify its developer.</p>
<div class="lb-trans"><p>然而，當被問及其創造者時，該系統拒絕透露其開發者的身份。</p>
</div><p>“I only know my name, my parameter scale and my context window length,” the chatbot said.</p>
<div class="lb-trans"><p>“我只知道我的名字、我的參數規模和我的上下文窗口長度，” 聊天機器人説。</p>
</div><p>Neither DeepSeek nor OpenRouter has identified the model’s creator and they did not respond to requests for comment.</p>
<div class="lb-trans"><p>DeepSeek 和 OpenRouter 都沒有確認該模型的創造者，並且沒有回應評論請求。</p>
</div><p>Hunter Alpha’s profile page describes it as a 1-trillion-parameter model, meaning it was trained using roughly one trillion adjustable values that determine how the system processes language and generates responses. Models with more parameters generally require significantly more computing power to operate.</p>
<div class="lb-trans"><p>Hunter Alpha 的個人資料頁面將其描述為一個擁有 1 萬億參數的模型，這意味着它是使用大約 1 萬億個可調值進行訓練的，這些值決定了系統如何處理語言和生成響應。參數更多的模型通常需要顯著更多的計算能力來運行。</p>
</div><p>The system also advertises a context window of up to one million tokens, a measure of how much text an AI model can process or remember during a single interaction. A token roughly corresponds to a short piece of text, such as part of a word.</p>
<div class="lb-trans"><p>該系統還宣傳了高達一百萬個標記的上下文窗口，這是衡量 AI 模型在單次交互中可以處理或記住多少文本的指標。一個標記大致對應於一小段文本，例如一個單詞的一部分。</p>
</div><p>“The combination that stood out was Hunter Alpha’s 1 million token context paired with reasoning capability and free access,” said Nabil Haouam, an engineer who builds AI agent systems.</p>
<div class="lb-trans"><p>“最引人注目的組合是 Hunter Alpha 的 100 萬個標記上下文與推理能力和免費訪問的結合，” 構建 AI 代理系統的工程師納比爾·哈烏阿姆説。</p>
</div><p>“Most frontier models with that context window come with real cost at scale,” he added.</p>
<div class="lb-trans"><p>“大多數具有該上下文窗口的前沿模型在規模上都需要真實的成本，” 他補充道。</p>
</div><p>Those specifications resemble expectations in local media for DeepSeek’s next-generation V4 model, which Chinese outlets have reported could launch as early as April. DeepSeek, like many of its Chinese competitors, is well-funded, though it has an unusual structure given its parent company is a quantitative hedge fund rather than a tech conglomerate.</p>
<div class="lb-trans"><p>這些規格與當地媒體對 DeepSeek 下一代 V4 模型的預期相似，中國媒體報道稱該模型可能最早在 4 月發佈。DeepSeek 與許多中國競爭對手一樣資金充足，但由於其母公司是量化對沖基金而非科技集團，其結構較為特殊。</p>
</div><p>While the overlap does not establish a direct connection, it has intensified speculation among developers that the anonymous system could be an early test version of the upcoming release by DeepSeek.</p>
<div class="lb-trans"><p>儘管這種重疊並未建立直接聯繫，但它加劇了開發者之間的猜測，認為這個匿名系統可能是 DeepSeek 即將發佈的早期測試版本。</p>
</div><p>“The chain-of-thought pattern is probably the strongest signal,” said Daniel Dewhurst, an AI engineer who analysed the model after its release, referring to how the AI model reasons.</p>
<div class="lb-trans"><p>“思維鏈模式可能是最強的信號，” AI 工程師丹尼爾·德赫斯特在模型發佈後分析時表示，指的是 AI 模型的推理方式。</p>
</div><p>“Reasoning style is hard to disguise and tends to reflect how a model was trained.”</p>
<div class="lb-trans"><p>“推理風格很難偽裝，往往反映了模型的訓練方式。”</p>
</div><p>Hunter Alpha’s scale and memory capacity also match specifications that have circulated for DeepSeek V4 since early this year, he said.</p>
<div class="lb-trans"><p>他説，Hunter Alpha 的規模和記憶容量也與今年早些時候流傳的 DeepSeek V4 的規格相匹配。</p>
</div><p>Still, some developers cautioned that the evidence linking the model to DeepSeek was inconclusive.</p>
<div class="lb-trans"><p>不過，一些開發者警告説，將該模型與 DeepSeek 聯繫起來的證據並不確鑿。</p>
</div><p>“My analysis suggests Hunter Alpha is likely not DeepSeek V4,” said Umur Ozkul, who runs independent AI benchmark tests, citing differences in token-related behaviour and architectural patterns when compared with DeepSeek’s existing systems.</p>
<div class="lb-trans"><p>“我的分析表明，Hunter Alpha 可能不是 DeepSeek V4，” 獨立 AI 基準測試負責人烏穆爾·奧茲庫爾説，他提到與 DeepSeek 現有系統相比，標記相關行為和架構模式的差異。</p>
</div><p>He said speculation connecting the model to DeepSeek was understandable given the timing and capabilities advertised.</p>
<div class="lb-trans"><p>他説，考慮到時間和宣傳的能力，將該模型與 DeepSeek 聯繫起來的猜測是可以理解的。</p>
</div><h3>DEVELOPER TESTING</h3>
<div class="lb-trans"><h3>開發者測試</h3>
</div><p>Anonymous model launches are not unusual, as platforms like OpenRouter allow developers to send queries to dozens of AI models through a single interface, making them a popular testing ground for new systems.</p>
<div class="lb-trans"><p>匿名模型的發佈並不罕見，因為像 OpenRouter 這樣的平台允許開發者通過單一接口向數十個 AI 模型發送查詢，使其成為新系統的熱門測試場。</p>
</div><p>An anonymous model called Pony Alpha appeared on OpenRouter in February before Chinese firm Zhipu AI confirmed it was part of its GLM-5 system five days later.</p>
<div class="lb-trans"><p>一個名為 Pony Alpha 的匿名模型於 2 月份出現在 OpenRouter 上，五天後中國公司 Zhipu AI 確認它是其 GLM-5 系統的一部分。</p>
</div><p>A notice on Hunter Alpha’s profile page said all prompts and completions for the model “are logged by the provider and may be used to improve the model,” underscoring the industry-wide practice of using stealth model launches for unbiased feedback.</p>
<div class="lb-trans"><p>Hunter Alpha 個人資料頁面上的通知表示，所有模型的提示和完成 “都由提供者記錄，並可能用於改進模型”，強調了行業普遍採用隱形模型發佈以獲取無偏見反饋的做法。</p>
</div><p>The model was adopted rapidly after appearing on the platform and processed more than 160 billion tokens as of Sunday, according to OpenRouter statistics.</p>
<div class="lb-trans"><p>該模型在平台上出現後迅速被採用，截至週日，根據 OpenRouter 的統計數據，處理了超過 1600 億個標記。</p>
</div><p>Much of the activity came from software development tools and AI agent frameworks like OpenClaw, which allow AI systems to autonomously plan tasks and interact with external software.</p>
<div class="lb-trans"><p>大部分活動來自軟件開發工具和 AI 代理框架，如 OpenClaw，這些工具允許 AI 系統自主規劃任務並與外部軟件互動。</p>
</div>

一個神秘的 AI 模型引發了開發者們的熱議：這會是深度求索的最新力作嗎？