--- title: "Microsoft and Google Release New AI Models on the Same Day: Featuring Voice, Image, and Local Open-Source Capabilities" type: "News" locale: "zh-HK" url: "https://longbridge.com/zh-HK/news/281581173.md" description: "Microsoft and Google announced new AI models on the same day. Microsoft launched the MAI foundational model, including MAI-Transcribe-1, MAI-Voice-1, and MAI-Image-2, primarily available through Azure Foundry. Google, on the other hand, released the Gemma 4 open-source model under an Apache 2.0 license, featuring advanced reasoning and generation capabilities optimized for local execution. Both differ significantly in features and delivery methods" datetime: "2026-04-03T01:13:51.000Z" locales: - [zh-CN](https://longbridge.com/zh-CN/news/281581173.md) - [en](https://longbridge.com/en/news/281581173.md) - [zh-HK](https://longbridge.com/zh-HK/news/281581173.md) --- > 支持的語言: [简体中文](https://longbridge.com/zh-CN/news/281581173.md) | [English](https://longbridge.com/en/news/281581173.md) # Microsoft and Google Release New AI Models on the Same Day: Featuring Voice, Image, and Local Open-Source Capabilities Microsoft and Google both announced new AI models on Thursday, but with notable differences: Microsoft released its new foundational model, MAI, available only through its Azure Foundry and the US-only MAI Playground platform; whereas Google launched its entirely new Gemma 4 open-source model, capable of running locally. Furthermore, Google has changed the license for these new open-source models to Apache 2.0. ## Three "World-Class" In-House MAI Models Microsoft's "world-class" in-house MAI models include three in total: First is MAI-Transcribe-1, an "state-of-the-art" speech-to-text model that can understand the 25 most widely spoken languages globally, boasting a transcription speed 2.5 times faster than Microsoft's existing Azure Fast solution for batch transcription. Second is MAI-Voice-1, a new speech generation model capable of producing 60 seconds of audio in just 1 second. It also supports creating custom voices from short audio samples within Microsoft Foundry. Finally, MAI-Image-2 is a faster text-to-image model that has already begun rolling out in Copilot and will be progressively applied to Bing and PowerPoint. Microsoft stated: > "We are rapidly deploying these top-tier models to power our own consumer and commercial products. You will soon see more models in Foundry and across Microsoft's various products and experiences." ## Google's Gemma 4 Open-Source Model Google's Gemma 4 open-source model uses the Apache 2.0 license, moving away from its previous custom Gemma license. Google stated that these models possess advanced reasoning capabilities, agentic workflows, code generation, and visual and audio generation features, offered in four different versions optimized for local execution, even on "billions of Android devices." Google commented: > "Gemma 4 is based on the same world-class research and technology as Gemini 3 and represents the most capable set of models you can run on local hardware today. They complement our Gemini models, offering developers the industry's most powerful combination of open-source and proprietary tools." The larger 26B and 31B versions of Gemma 4 models are designed to run on consumer GPUs and can be used to power IDEs, programming assistants, and agentic workflows. The lighter E2B and E4B versions focus more on multimodal capabilities and low-latency processing, suitable for mobile and IoT devices (including Raspberry Pi). These models also support fully offline operation. Google's Gemma 4 open-source models are available for download on multiple platforms, including Hugging Face, Kaggle, and Ollama. Google emphasized: > "These models adhere to the same stringent safety protocols for infrastructure security as our proprietary models." More news, continuously updated Risk Disclosure and Disclaimer Markets involve risks; investment requires caution. This article does not constitute personal investment advice, nor has it considered individual users' specific investment objectives, financial situation, or needs. Users should consider whether any opinion, view, or conclusion in this article is appropriate for their specific circumstances. Investment based on this is at your own risk. ### 相關股票 - [Microsoft (MSFT.US)](https://longbridge.com/zh-HK/quote/MSFT.US.md) - [Alphabet - C (GOOG.US)](https://longbridge.com/zh-HK/quote/GOOG.US.md) - [Alphabet (GOOGL.US)](https://longbridge.com/zh-HK/quote/GOOGL.US.md) - [ISHRS S&P Glb It (IXN.US)](https://longbridge.com/zh-HK/quote/IXN.US.md) - [Direxion Daily MSFT Bull 2X Shares (MSFU.US)](https://longbridge.com/zh-HK/quote/MSFU.US.md) - [Direxion Daily GOOGL Bull 2X Shares (GGLL.US)](https://longbridge.com/zh-HK/quote/GGLL.US.md) - [T-Rex 2X Long Microsoft Daily Target ETF (MSFX.US)](https://longbridge.com/zh-HK/quote/MSFX.US.md) - [SPDR S&P Semicon (XSD.US)](https://longbridge.com/zh-HK/quote/XSD.US.md) - [iShares Semiconductor ETF (SOXX.US)](https://longbridge.com/zh-HK/quote/SOXX.US.md) - [Roundhill GOOGL WeeklyPay ETF (GOOW.US)](https://longbridge.com/zh-HK/quote/GOOW.US.md) - [Global X Internet (SNSR.US)](https://longbridge.com/zh-HK/quote/SNSR.US.md) - [GraniteShares 2x Long MSFT Daily ETF (MSFL.US)](https://longbridge.com/zh-HK/quote/MSFL.US.md) - [SPDR S&P Software (XSW.US)](https://longbridge.com/zh-HK/quote/XSW.US.md) - [iShares Expanded Tech Software Sector ETF (IGV.US)](https://longbridge.com/zh-HK/quote/IGV.US.md) - [First Trust Index NextG ETF (NXTG.US)](https://longbridge.com/zh-HK/quote/NXTG.US.md) - [Invesco Semiconductors ETF (PSI.US)](https://longbridge.com/zh-HK/quote/PSI.US.md) - [YieldMax MSFT Option Income Strategy ETF (MSFO.US)](https://longbridge.com/zh-HK/quote/MSFO.US.md) - [Global X Data Center & Dgtl Infrs ETF (DTCR.US)](https://longbridge.com/zh-HK/quote/DTCR.US.md) - [Global X Cloud Computing ETF (CLOU.US)](https://longbridge.com/zh-HK/quote/CLOU.US.md) - [VanEck Semiconductor ETF (SMH.US)](https://longbridge.com/zh-HK/quote/SMH.US.md) - [Direxion Semicon Bull 3X (SOXL.US)](https://longbridge.com/zh-HK/quote/SOXL.US.md) - [Direxion Daily MSFT Bear 1x Shares (MSFD.US)](https://longbridge.com/zh-HK/quote/MSFD.US.md) - [Kurv Yield Premium Strategy Microsoft MSFT ETF (MSFY.US)](https://longbridge.com/zh-HK/quote/MSFY.US.md) - [Direxion Daily Googl Bear 1x Shares (GGLS.US)](https://longbridge.com/zh-HK/quote/GGLS.US.md) ## 相關資訊與研究 - [Google in Talks With Poolside to Revive Data Center Project](https://longbridge.com/zh-HK/news/281507610.md) - [Microsoft Challenges OpenAI With Faster, In-House 'MAI' Models](https://longbridge.com/zh-HK/news/281555368.md) - [Microsoft Releases AI Models for Transcription, Voice and Image Generation](https://longbridge.com/zh-HK/news/281557276.md) - [Microsoft Rolls Out New AI Models to Take On Rivals](https://longbridge.com/zh-HK/news/281563234.md) - [Microsoft takes on AI rivals with three new foundational models](https://longbridge.com/zh-HK/news/281556605.md)