The next "AI shovel seller": Computing power scheduling is the key to inference profitability, and vector databases have become a necessity

Wallstreetcn
2025.12.24 04:14
portai
I'm PortAI, I can summarize articles.

Shenwan Hongyuan stated that as generative AI applications accelerate their penetration, AI infrastructure software (AI Infra) is becoming the key "shovel seller" for application implementation, and the computing power scheduling capability directly determines the profitability of model inference services. According to estimates, with a daily query volume of 1 billion, if using H800 chips, a 10% increase in single-card throughput can improve the gross profit margin by 2-7 percentage points. On the data level, vector databases have become a necessity, and Gartner predicts that by 2025, the adoption rate of enterprise RAG technology will reach 68%