China's Navy Deploys AI to Eliminate Air Defense Blind Spots on New FrigateDeepSeek V4 to Launch on Huawei Chips With One Trillion ParametersFoxconn Posts Record Q1 Revenue as AI Server Demand Surges 30 PercentAsia's AI Boom Faces Its First Real Stress Test as Iran War Disrupts Energy and ChipsThe Physical AI Era Is Here: Why Robots Are Moving From Simulation to Factory FloorsAI Captured 80 Percent of Global Venture Funding in Q1 2026 — What That Means for Everything ElseAI Virtual Try-On Startups Take On Retail's Multibillion-Dollar Returns ProblemEclipse Raises $1.3 Billion to Build the 'Physical AI' EconomyChina's Navy Deploys AI to Eliminate Air Defense Blind Spots on New FrigateDeepSeek V4 to Launch on Huawei Chips With One Trillion ParametersFoxconn Posts Record Q1 Revenue as AI Server Demand Surges 30 PercentAsia's AI Boom Faces Its First Real Stress Test as Iran War Disrupts Energy and ChipsThe Physical AI Era Is Here: Why Robots Are Moving From Simulation to Factory FloorsAI Captured 80 Percent of Global Venture Funding in Q1 2026 — What That Means for Everything ElseAI Virtual Try-On Startups Take On Retail's Multibillion-Dollar Returns ProblemEclipse Raises $1.3 Billion to Build the 'Physical AI' EconomyChina's Navy Deploys AI to Eliminate Air Defense Blind Spots on New FrigateDeepSeek V4 to Launch on Huawei Chips With One Trillion ParametersFoxconn Posts Record Q1 Revenue as AI Server Demand Surges 30 PercentAsia's AI Boom Faces Its First Real Stress Test as Iran War Disrupts Energy and ChipsThe Physical AI Era Is Here: Why Robots Are Moving From Simulation to Factory FloorsAI Captured 80 Percent of Global Venture Funding in Q1 2026 — What That Means for Everything ElseAI Virtual Try-On Startups Take On Retail's Multibillion-Dollar Returns ProblemEclipse Raises $1.3 Billion to Build the 'Physical AI' Economy
Google Gemini AI logo and branding
MacRumors
News

Apple Distills Google Gemini Models for On-Device Apple Intelligence Processing

Apple is using distillation techniques to shrink Google's Gemini models for on-device use, revealing a deeper partnership than previously known as nearly 1 billion iPhones remain unable to run Apple Intelligence.

R
Rina ChandraTech Reporter
4 min read

Apple is using model distillation to compress Google's Gemini large language models into smaller, more efficient versions that can run directly on iPhones and other Apple devices, according to details that have emerged about the partnership between the two companies. The arrangement gives Apple significantly more freedom to reshape Google's technology than was previously understood, pointing to a collaboration that extends well beyond the surface-level integration of Gemini into Siri's cloud processing pipeline.

Distillation at Scale

Model distillation is a technique in which a large, capable AI model — the "teacher" — is used to train a much smaller "student" model that retains as much of the original's performance as possible while running within tighter computational constraints. Apple is applying this process to Gemini models, producing compact variants optimized for the neural engines in its A-series and M-series chips.

The distilled models are designed to handle a range of on-device Apple Intelligence tasks, including text summarization, writing assistance, image understanding, and contextual suggestions. By running these capabilities locally rather than routing every request to cloud servers, Apple can offer faster response times, better privacy guarantees, and functionality that works without a network connection.

A Deeper Partnership

The distillation arrangement reveals that Apple's relationship with Google on AI goes considerably further than the public-facing deal to integrate Gemini as one of several optional cloud AI providers accessible through Siri. Under the terms of the partnership, Apple has access to Gemini model weights and architectures at a level that allows it to perform its own distillation and optimization work, rather than simply calling Google's API.

This level of access is unusual. Most companies that license large language models from providers like Google or OpenAI are limited to using the models through hosted endpoints, with no ability to modify the underlying architecture. Apple's arrangement suggests it negotiated significant technical latitude, likely leveraging its position as the world's largest consumer device platform.

Google benefits from the deal as well. Having its AI technology running on nearly every new iPhone strengthens Gemini's distribution footprint and generates licensing revenue, even if the models are substantially modified through distillation.

The Device Gap

The push for on-device AI comes against the backdrop of a significant hardware limitation. Nearly 1 billion active iPhones in the global installed base lack the processing power to run Apple Intelligence features. Apple Intelligence requires an A17 Pro chip or later, which means only iPhone 15 Pro and newer models are capable of running the full suite of on-device AI capabilities.

This has created a two-tier experience within Apple's user base. Owners of older devices are excluded from features that Apple has positioned as central to its software roadmap, including intelligent notifications, generative writing tools, and visual search enhancements. The gap is a significant driver of upgrade cycles, but it also means that the majority of iPhone users worldwide will not benefit from Apple's AI investments for several years.

Strategic Implications

Apple's distillation strategy reflects a pragmatic approach to the AI race. Rather than building every model from scratch, the company is leveraging external capabilities where they are strongest and applying its own hardware-software integration expertise to deliver the final product. The approach allows Apple to move faster than it could with purely in-house model development, while maintaining the on-device experience that differentiates its platform.

For Google, the arrangement validates Gemini's position as a foundational AI technology with reach extending beyond its own products. The deal demonstrates that even Apple, a company with vast engineering resources and a deep commitment to controlling its technology stack, has determined that partnering on foundational models is more practical than going it alone.

Newsletter

Get Lanceum in your inbox

Weekly insights on AI and technology in Asia.

Share

More in News

Lanceum

Independent coverage of AI and technology across Asia. We go beyond headlines to explain what matters.

Colophon

Typeset in Space Grotesk & DM Serif Display. Built with Nuxt & Tailwind. Powered by curiosity.

© 2026 Lanceum. All rights reserved.

Independent • Rigorous • Asia-Focused