---
title: "ZAWYA-PRESSR: VAST Data redesigns AI inference architecture for the agentic era with NVIDIA"
type: "News"
locale: "en"
url: "https://longbridge.com/en/news/271621226.md"
description: "VAST Data has unveiled a new AI inference architecture designed for the NVIDIA Inference Context Memory Storage Platform, aimed at enhancing long-lived, agentic AI deployments. This architecture utilizes NVIDIA BlueField-4 DPUs and Spectrum-X Ethernet to improve AI-native key-value cache access and context sharing across nodes, significantly boosting power efficiency. VAST's AI Operating System integrates critical data services directly into GPU servers, optimizing performance and resource management. The company emphasizes that effective context management is crucial for AI performance as it transitions from experimentation to regulated services. VAST will showcase its innovations at the VAST Forward conference in February 2026."
datetime: "2026-01-06T08:12:23.000Z"
locales:
  - [zh-CN](https://longbridge.com/zh-CN/news/271621226.md)
  - [en](https://longbridge.com/en/news/271621226.md)
  - [zh-HK](https://longbridge.com/zh-HK/news/271621226.md)
---

> Supported Languages: [简体中文](https://longbridge.com/zh-CN/news/271621226.md) | [繁體中文](https://longbridge.com/zh-HK/news/271621226.md)


# ZAWYA-PRESSR: VAST Data redesigns AI inference architecture for the agentic era with NVIDIA

**Dubai, UAE –** **VAST Data**, the AI Operating System company, today announced a new inference architecture that enables the NVIDIA Inference Context Memory Storage Platform – deployments for the era of long-lived, agentic AI. The platform is a new class of AI-native storage infrastructure for gigascale inference. Built on NVIDIA BlueField-4 DPUs and Spectrum-X Ethernet networking, it accelerates AI-native key-value (KV) cache access, enables high-speed inference context sharing across nodes, and delivers a major leap in power efficiency.

As inference evolves from single prompts into persistent, multi-turn reasoning across agents, the notion that context stays local breaks down. Performance is increasingly governed by how efficiently inference history (KV cache) can be stored, restored, reused, extended, and shared under sustained load – not simply by how fast GPUs can compute.

VAST is rebuilding the inference data path by running VAST AI Operating System (AI OS) software natively on NVIDIA BlueField-4 DPUs, embedding critical data services directly into the GPU server where inference executes, as well as in a dedicated data node architecture. This design removes classic client-server contention and eliminates unnecessary copies and hops that inflate time-to-first-token (TTFT) as concurrency rises. Combined with VAST’s parallel Disaggregated Shared-Everything (DASE) architecture, each host can access a shared, globally coherent context namespace without the coordination tax that causes bottlenecks at scale, enabling a streamlined path from GPU memory to persistent NVMe storage over RDMA fabrics.

“Inference is becoming a memory system, not a compute job. The winners won’t be the clusters with the most raw compute – they’ll be the ones that can move, share, and govern context at line rate,” said John Mao, Vice President, Global Technology Alliances at VAST Data “Continuity is the new performance frontier. If context isn’t available on demand, GPUs idle and economics collapse. With the VAST AI Operating System on NVIDIA BlueField-4, we’re turning context into shared infrastructure – fast by default, policy-driven when needed, and built to stay predictable as agentic AI scales.”

Beyond raw performance, VAST gives AI-native organizations and enterprises deploying NVIDIA AI factories a path to production-grade inference coordination with high levels of efficiency and security. As inference moves from experimentation into regulated and revenue-driving services, teams need the ability to manage context with policy, isolation, auditability, lifecycle controls, and optional protection – all while keeping KV cache fast and usable as a shared system resource. VAST delivers those AI-native data services as part of the AI OS, helping customers avoid rebuild storms, reduce idle-GPU resource waste, and improve infrastructure efficiency as context sizes and session concurrency explode.

“Context is the fuel of thinking. Just like humans that write things down to remember them, AI agents need to save their work so they can reuse what they’ve learned," said Kevin Deierling, Senior Vice President of Networking, NVIDIA. "Multi-turn and multi-user inferencing fundamentally transforms how context memory is managed at scale. VAST Data AI OS with NVIDIA BlueField-4 enables the NVIDIA Inference Context Memory Storage Platform and a coherent data plane designed for sustained throughput and predictable performance as agentic workloads scale.”

Experience VAST’s industry-leading approach to AI and data infrastructure at **VAST Forward**, our inaugural user conference, February 24–26, 2026 in Salt Lake City, Utah. Engage with VAST leadership, customers, and partners through deep technical sessions, hands-on labs, and certification programs. **Register here to join.**

Additional Resources:

-   BLOG: More Inference, Less Infrastructure: How Customers Achieve Breakthrough Efficiency with VAST Data and NVIDIA

**About VAST Data**

VAST Data is the AI Operating System company – powering the next generation of intelligent systems with a unified software infrastructure stack that was purpose-built to unlock the full potential of AI. The VAST AI OS consolidates foundational data and compute services and agentic execution into one scalable platform, enabling organizations to deploy and facilitate communication between AI agents, reason over real-time data, and automate complex workflows at global scale. Built on VAST’s breakthrough DASE architecture – the world’s first true parallel distributed system architecture that eliminates tradeoffs between performance, scale, simplicity, and resilience – VAST has transformed its modern infrastructure into a global fabric for reasoning AI. Learn more at vastdata.com and follow VAST Data on LinkedIn, YouTube and X.

Media Contact: Vastdata@activedmc.com

Send us your press releases to pressrelease.zawya@lseg.com

Disclaimer: The contents of this press release was provided from an external third party provider. This website is not responsible for, and does not control, such external content. This content is provided on an “as is” and “as available” basis and has not been edited in any way. Neither this website nor our affiliates guarantee the accuracy of or endorse the views or opinions expressed in this press release.

The press release is provided for informational purposes only. The content does not provide tax, legal or investment advice or opinion regarding the suitability, value or profitability of any particular security, portfolio or investment strategy. Neither this website nor our affiliates shall be liable for any errors or inaccuracies in the content, or for any actions taken by you in reliance thereon. You expressly agree that your use of the information within this article is at your sole risk.

To the fullest extent permitted by applicable law, this website, its parent company, its subsidiaries, its affiliates and the respective shareholders, directors, officers, employees, agents, advertisers, content providers and licensors will not be liable (jointly or severally) to you for any direct, indirect, consequential, special, incidental, punitive or exemplary damages, including without limitation, lost profits, lost savings and lost revenues, whether in negligence, tort, contract or any other theory of liability, even if the parties have been advised of the possibility or could have foreseen any such damages.

### Related Stocks

- [GraniteShares 2x Long NVDA Daily ETF (NVDL.US)](https://longbridge.com/en/quote/NVDL.US.md)
- [XL2CSOPNVDA (07788.HK)](https://longbridge.com/en/quote/07788.HK.md)
- [XI2CSOPNVDA (07388.HK)](https://longbridge.com/en/quote/07388.HK.md)
- [YieldMax NVDA Option Income Strategy ETF (NVDY.US)](https://longbridge.com/en/quote/NVDY.US.md)
- [Direxion Daily NVDA Bear 1X ETF (NVDD.US)](https://longbridge.com/en/quote/NVDD.US.md)
- [T-REX 2X Long NVIDIA Daily Target ETF (NVDX.US)](https://longbridge.com/en/quote/NVDX.US.md)
- [T-REX 2X Inverse NVIDIA Daily Target ETF (NVDQ.US)](https://longbridge.com/en/quote/NVDQ.US.md)
- [Direxion Daily Semicondct Bull 3X ETF (SOXL.US)](https://longbridge.com/en/quote/SOXL.US.md)
- [NVIDIA Corporation (NVDA.US)](https://longbridge.com/en/quote/NVDA.US.md)

## Related News & Research

- [Nvidia’s $2 Billion Investment in Marvell: What Kind of Company Is Marvell?](https://longbridge.com/en/news/281372238.md)
- [Nvidia invests $2 billion in Marvell, launches AI partnership](https://longbridge.com/en/news/281185701.md)
- [Aviz Networks Validates Multi-Tenant Storage Networking for VAST AI Operating System Powering NVIDIA AI Factory Deployments](https://longbridge.com/en/news/281523225.md)
- [10:56 ETAt WMF a new edition of AI Global Summit, an international reference point on Artificial Intelligence](https://longbridge.com/en/news/281647788.md)
- [Nvidia (NVDA) H100 Prices Surge 40% as New GPUs Fail to Meet Insatiable Demand](https://longbridge.com/en/news/281588202.md)