Qualcomm's and CXMT's partnership highlighted by analyst

Qualcomm and China’s CXMT Aim to Build 3D DRAM NPUs, Redefining Smartphones by 2027

Smartphone AI is improving fast, but today’s on-device intelligence is still limited by one big design compromise: the Neural Processing Unit (NPU) is typically just a small block inside the main chipset. Because it shares power and thermal headroom with the CPU and GPU, it can’t always sustain high performance for long, demanding tasks. That’s why a new report suggests a different approach is coming—one that could reshape how mobile AI workloads are handled.

According to analyst Ming-Chi Kuo, Qualcomm is working with Chinese memory makers CXMT and GigaDevice on a standalone NPU solution designed specifically for smartphones. Instead of relying entirely on the integrated NPU inside the system-on-chip, this dedicated AI accelerator would include its own high-speed memory, allowing it to run heavier AI features more efficiently and for longer periods.

The key detail is the memory design. The standalone NPU is said to pair with customized 3D DRAM, featuring 4GB of onboard memory and delivering bandwidth higher than LPDDR5X. That speed boost reportedly comes from advanced packaging methods such as Through-Silicon Via (TSV) and Hybrid Bonding—technologies often associated with stacking memory to improve throughput while keeping power use in check. In practical terms, more bandwidth means the NPU can keep feeding data to AI models without bottlenecks, which is critical for real-time and generative workloads.

Kuo claims the solution is expected to deliver around 40 TOPS (trillion operations per second). While some flagship chips advertise far higher peak AI numbers—such as claims of 100 TOPS—those peaks can depend on ideal conditions and may not be sustained during real-world usage. A dedicated accelerator holding a steady 40 TOPS could significantly lift consistent on-device AI performance, potentially making features like real-time video translation, background image generation, or other always-on AI tasks smoother and more practical on a phone.

Timing-wise, this standalone NPU is reportedly targeting mass shipments in late 2026 or early 2027. The initial focus is said to be Chinese smartphone brands, with the first devices expected in the RMB 4,000–4,500 price range (roughly $585–$660). That puts it squarely in the upper-mid to near-flagship tier—exactly where brands often look for standout features to justify higher pricing.

Still, there are obstacles that could slow adoption. Adding a separate AI chip with 4GB of 3D DRAM won’t be cheap, and memory pricing remains a major concern. Chinese phone makers including Xiaomi, OPPO, Vivo, Huawei, and others are already exploring ways to reduce DRAM and storage costs to protect margins. A dedicated NPU package could raise the bill of materials, forcing brands to decide whether the AI benefits are compelling enough to absorb the added expense or pass it on to buyers.

There’s also a software and consumer-demand challenge. Kuo notes that even if the hardware is ready, the market still lacks truly “must-have” applications that fully exploit strong on-device AI. And without killer apps, shoppers may not be convinced to pay extra for a dedicated smartphone NPU—especially if the improvements feel incremental rather than transformative in daily use.

If this standalone NPU plan becomes reality, it could mark a meaningful step toward more powerful, more consistent on-device AI—less dependent on cloud processing and less constrained by the usual CPU/GPU power-sharing limits. But whether it becomes a mainstream smartphone feature may depend less on raw TOPS and memory bandwidth, and more on cost, software adoption, and whether consumers see AI acceleration as a feature worth paying for.