--- title: "OpenAI targets customer service with new audio models" description: "OpenAI has launched new audio models aimed at enhancing customer service through voice agents. These models, including speech-to-text and text-to-speech capabilities, are designed for enterprise use a" type: "news" locale: "en" url: "https://longbridge.com/en/news/233026731.md" published_at: "2025-03-24T23:52:27.000Z" --- # OpenAI targets customer service with new audio models > OpenAI has launched new audio models aimed at enhancing customer service through voice agents. These models, including speech-to-text and text-to-speech capabilities, are designed for enterprise use and allow developers to customize speech tones. Analysts note that these innovations could reduce the need for human agents and improve automation in customer interactions. However, OpenAI faces competition from specialized AI vendors and existing contact center solutions. Challenges remain in handling specific speech nuances, such as acronyms, and the need for business integration in utilizing these APIs effectively. OpenAI introduced a new suite of audio models that power voice agents in specific enterprise settings, such as customer service. The models include speech-to-text and text-to-speech audio models in OpenAI’s Realtime API. The AI vendor also introduced gpt-4o-transcribe and gpt-4o-mini-transcribe. Gpt-4o-transcribe has an improved word error rate performance over OpenAI's open source speech-to-text model OpenAI said. The new models capture nuances of speech, reduce misrecognitions and increase transcription reality. OpenAI also introduced gpt-4o-mini TTS, a text-to-speech model that allows developers to "instruct" the model on what to say and how. The models build on GPT-4o and GPT-4o-mini architectures. ## Tone and audience According to OpenAI, developers can instruct the models to speak in a specific way. For example, users can tell the models to speak like a "sympathetic customer service agent. " The new audio models target both OpenAI's consumer audience and a small portion of the enterprise market, said Gartner analyst Arun Chandrasekaran. Many consumers use ChatGPT, so those audiences would be interested in some of the tones introduced in the audio API, such as Medieval Knight, True Crime Buff and Bedtime Story, he said. At the same time, tones like Professional and Calm will be useful in customer service settings in which the agent is dealing with an angry customer, Chandrasekaran said. "Customer service is one of the fastest growing use cases we are starting to see in the enterprise, and I'm not very surprised that all of these companies are trying to gravitate toward where the money is," he said. The new models will reduce the number of human agents needed to handle every interaction and allow for more automated interactive voice response  systems, said Forrester Research analyst William McKeon-White. "We've been seeing these already actually coming online, working with several other second-order consumers of these services who are vendors themselves," he said. "They've already been seeing strong successes with these capabilities." McKeon-White said users should benefit from OpenAI's voice models because of the level of automation and delivery that the vendor provides. "The fact that it's just natively part of what open AI is providing now is quite helpful to a lot of enterprises who are seeing a lot of different models at this point," he said. OpenAI's breakdown of the error rate of the new models shows that the models are effective across widely used languages like French and Spanish. ## Some challenges However, McKeon-White said it would be good to see how well the models handle acronyms since speech models find them challenging. Moreover, because of the competitiveness of customer service applications, OpenAI faces some challenges. One is that the vendor competes with vendors that approach customer service from a narrow perspective. For example, Sierra AI is an AI startup that focuses solely on customer service. Chandrasekaran said this differs from OpenAI, which has multiple models and multiple applications for its models. Another challenge is that many contact center vendors such as Genesys are already embedding AI technology into their products. "They're all starting to embed AI into it and, of course, are competitive to what OpenAI is doing," Chandrasekaran continued. Moreover, while the APIs are helpful for teams looking to build applications, they are not beneficial for those without teams, McKeon-White said. "Most organizations we talk with are not ready just to go consume raw APIs to go and build out a net new system," he said. "It needs business logic, it needs business understanding, and it needs like business integrations to make everything work." *Esther Shittu is an Informa TechTarget news writer and podcast host covering artificial intelligence software and systems.* ### Related Stocks - [OpenAI.NA - OpenAI](https://longbridge.com/en/quote/OpenAI.NA.md) ## Related News & Research | Title | Description | URL | |-------|-------------|-----| | India’s top telco tackles AI with $110 billion build plan and proven fast market dominance playbook | India’s top telco, Reliance Jio, plans to invest $110 billion in AI infrastructure over seven years to enhance its servi | [Link](https://longbridge.com/en/news/276406785.md) | | 10:27 ETAmigos For Kids Selected as OpenAI Ready Award Recipient Through People-First AI Fund | Amigos For Kids has been selected as a recipient of the OpenAI Ready Award through the People-First AI Fund, which suppo | [Link](https://longbridge.com/en/news/276142228.md) | | More than 20,000 sign a petition for OpenAI to resurrect GPT-4o | More than 20,000 sign a petition for OpenAI to resurrect GPT-4o | [Link](https://longbridge.com/en/news/276155147.md) | | Bartronics India Denies Collaboration With OpenAI | Bartronics India Ltd :COMPANY DENIES LAUNCHING 500 MW DATA CENTRECOMPANY DENIES COLLABORATION WITH OPENAI | [Link](https://longbridge.com/en/news/276327596.md) | | OpenAI Predicts $112 Billion More Cash Burn Through 2030 - The Information | OPENAI PREDICTS $112 BILLION MORE CASH BURN THROUGH 2030 - THE INFORMATION | [Link](https://longbridge.com/en/news/276493575.md) | --- > **Disclaimer**: This article is for reference only and does not constitute any investment advice.