AIpolisAIpolis
Back
ProductJun 10, 2026, 01:49 AM· United States

On-device AI agents hit a hard memory limit. Apple's new architecture routes around it.

Summary

Apple announced third-generation foundation models AFM 3 at WWDC26, breaking on-device AI memory constraints. The 20-billion-parameter AFM 3 Core Advanced stores weights in NAND flash instead of DRAM, using per-prompt routing decisions to avoid token-by-token weight swapping. The family includes two on-device and three server-based models, with server models running on NVIDIA GPUs in Google Cloud and on-device architecture being Apple's own.

Why it matters

This architecture solves the core bottleneck of on-device AI models being limited by DRAM capacity, potentially reshaping edge AI deployment.

Source links

Content is from official & reputable-media public sources, AI-assisted and auto-published, for information only — not investment advice.

Market reaction

The following is market reference information for related companies, and does not constitute investment advice.

NVIDIA
NASDAQ · NVDA
Anthropic
Private / not listed