---
title: "OpenAI releases GPT-5.4 mini and nano, approaching flagship model performance at a lower cost"
type: "News"
locale: "en"
url: "https://longbridge.com/en/news/279483630.md"
description: "OpenAI stated that as smaller models become faster and more powerful, developers no longer need to use a single model to handle all tasks, but can build systems where large models are responsible for decision-making, while smaller models execute tasks quickly and at scale. \"GPT-5.4 mini is the most powerful small model we have developed for this workflow so far.\""
datetime: "2026-03-18T03:08:24.000Z"
locales:
  - [zh-CN](https://longbridge.com/zh-CN/news/279483630.md)
  - [en](https://longbridge.com/en/news/279483630.md)
  - [zh-HK](https://longbridge.com/zh-HK/news/279483630.md)
---

# OpenAI releases GPT-5.4 mini and nano, approaching flagship model performance at a lower cost

OpenAI launched its two most powerful small models to date, GPT-5.4 mini and GPT-5.4 nano, on Tuesday, significantly narrowing the performance gap with flagship models at lower latency and cost.

**GPT-5.4 mini surpasses the previous generation GPT-5 mini across core dimensions such as programming, reasoning, multimodal understanding, and tool invocation, with a speed increase of over 2 times, and approaches the performance of the larger GPT-5.4 in benchmark tests like SWE-Bench Pro.**

**GPT-5.4 nano is positioned as the lowest-cost, lowest-latency lightweight option, available to developers solely through API, designed specifically for data classification, extraction, and simple programming sub-tasks.**

The launch of these two models aims to fill the gap in real-time interaction scenarios where large models struggle to be implemented due to high latency, directly impacting the rapidly growing commercial markets covering programming assistants, AI agent systems, and multimodal applications.

## mini for consumer end, nano exclusive to API

GPT-5.4 mini is now available across three channels: OpenAI API, Codex platform, and ChatGPT.

**The API pricing for GPT-5.4 mini is $0.75 per million input tokens and $4.50 per million output tokens**, supporting text and image input, tool invocation, function calls, web searches, file retrieval, computer control, and skill expansion, with a context window of up to 400,000 tokens.

**On the Codex platform, GPT-5.4 mini consumes only 30% of the GPT-5.4 quota, reducing the cost for developers handling simple programming tasks to about one-third of the flagship model.** Codex also supports delegating workloads to sub-agents running on GPT-5.4 mini, allowing tasks with lower reasoning density to automatically fall to the cheaper model.

**On the ChatGPT side, Free and Go users can select the "Thinking" feature to use GPT-5.4 mini through the "+" menu; other paid users will have this model activated as an automatic downgrade option once the GPT-5.4 Thinking throughput rate limit is reached.**

GPT-5.4 nano is currently available only through API for developers, priced at $0.20 per million input tokens and $1.25 per million output tokens, making it the lowest-priced among the two new models. OpenAI states that nano is suitable for scenarios where higher-level models coordinate and manage sub-agents responsible for handling secondary support tasks.

## mini approaches flagship, nano surpasses previous generation

From the evaluation data released by OpenAI, the performance of GPT-5.4 mini in programming and multimodal tasks is particularly outstanding.

**In the programming benchmark SWE-bench Pro, mini scored 54.4%, narrowing the gap with GPT-5.4's 57.7% to 3.3 percentage points, significantly higher than GPT-5 mini's 45.7%.**

****

**In the computer control benchmark** OSWorld-Verified, mini approached GPT-5.4's 75.0% with a score of 72.1%, significantly ahead of GPT-5 mini's 42.0%.

**In terms of tool invocation capability**, GPT-5.4 mini scored 93.4% in the τ2-bench telecommunications test, a significant improvement over GPT-5 mini's 74.1%. In the general intelligence test GPQA Diamond, mini scored 88.0%, while nano also reached 82.8%, both surpassing GPT-5 mini's 81.6%.

It is noteworthy that GPT-5.4 nano performed worse than GPT-5 mini in some visual tasks, with an OSWorld-Verified score of 39.0%, lower than the latter's 42.0%. However, in programming and tool invocation tasks, nano still achieved significant improvements over the previous generation.

**OpenAI stated that the design priority of nano is low latency and low cost, rather than overall performance, and developers need to weigh trade-offs based on specific tasks when making selections.**

## Sub-agent architecture, multi-model collaboration as a new paradigm for product design

OpenAI emphasized the position of the two new models in a multi-model hierarchical system in the release materials.

**Taking its self-developed programming assistant Codex as an example, GPT-5.4 is responsible for planning, coordinating, and final judgment, while the GPT-5.4 mini sub-agent processes finer-grained sub-tasks such as codebase retrieval, large file review, and document assistance in parallel.****OpenAI stated that with smaller models being faster and more powerful, developers no longer need to use a single model to handle all tasks, but can build systems where large models are responsible for decision-making while smaller models execute tasks quickly and at scale.** OpenAI said:

> **GPT-5.4 mini is the most powerful small model we have developed for this workflow to date.**

This architecture is particularly critical for high-concurrency work, where response latency directly affects product experience in scenarios such as programming assistance, screenshot analysis, and real-time image understanding. The optimal choice is often not the most capable model, but rather the one that achieves the best balance between speed, tool reliability, and task performance.

For developers, the release of GPT-5.4 mini and nano means a clearer path to significantly reducing inference costs without sacrificing the overall intelligence level of the system

### Related Stocks

- [OpenAI.NA](https://longbridge.com/en/quote/OpenAI.NA.md)

## Related News & Research

- [Ashton Kutcher Invested In OpenAI Early: $30 Million Bet Could Be Worth Billions At $1.5 Trillion IPO](https://longbridge.com/en/news/287437429.md)
- [OpenAI "made a dollar but lost a dollar and two ounces," while Anthropic has started making money.](https://longbridge.com/en/news/287320908.md)
- [ChatIPO: Deutsche Breaks Down What To Expect From OpenAI's Record-Breaking Public Plans](https://longbridge.com/en/news/287258845.md)
- [Five things to know about OpenAI’s potentially record-breaking IPO plans](https://longbridge.com/en/news/287381107.md)
- [Cheap AI could derail OpenAI and Anthropic's IPOs](https://longbridge.com/en/news/287102333.md)