← 回總覽

Google AI Edge Gallery

📅 2026-04-06 13:18 Simon Willison 人工智能 9 分鐘 11183 字 評分: 86
Google Gemma 本地 LLM iOS 移动端 AI 边缘计算
📌 一句话摘要 Google 官方推出的 AI Edge Gallery 应用将本地 Gemma 4 和 Gemma 3 模型引入 iPhone,具备高性能、多模态能力以及交互式工具调用功能。 📝 详细摘要 Simon Willison 评测了 Google 的 AI Edge Gallery,这是一款官方 iOS 应用,旨在移动硬件上本地运行 Gemma 4(E2B 和 E4B 版本)和 Gemma 3 模型。该应用展现了令人印象深刻的性能,2.54GB 的 E2B 模型提供了快速且实用的功能。主要特性包括基于图像的问答、短音频转录,以及一个独特的“技能”演示,展示了针对八个交互式 HT

Title: Google AI Edge Gallery | BestBlogs.dev

URL Source: https://www.bestblogs.dev/article/87b99a3f

Published Time: 2026-04-06 05:18:26

Markdown Content: Skip to main content ![Image 2: LogoBestBlogs](https://www.bestblogs.dev/ "BestBlogs.dev")Toggle navigation menu Toggle navigation menuArticlesPodcastsVideosTweetsSourcesNewsletters

⌘K

Change language Switch ThemeSign In

Narrow Mode

Google AI Edge Gallery

S Simon Willison's Weblog @Simon Willison

One Sentence Summary

Google's official AI Edge Gallery app brings local Gemma 4 and Gemma 3 models to the iPhone, featuring fast performance, multimodal capabilities, and interactive tool-calling skills.

Summary

Simon Willison reviews Google's AI Edge Gallery, an official iOS app designed to run Gemma 4 (E2B and E4B sizes) and Gemma 3 models locally on mobile hardware. The app demonstrates impressive performance, with the 2.54GB E2B model providing fast and practical utility. Key features include image-based Q&A, short audio transcription, and a unique 'skills' demo that showcases tool calling against eight interactive HTML widgets, such as maps and hash calculators. While the app marks a significant milestone as the first official local model app from a major vendor, it currently lacks persistent conversation logs and exhibits some stability issues in complex demos.

Main Points

* 1. Google has launched an official iOS app for local execution of Gemma models.This represents the first time a major model vendor has released a dedicated mobile app for running their LLMs locally on consumer hardware, specifically targeting the Gemma 4 and Gemma 3 families. * 2. The Gemma 4 E2B model offers a balance of size and performance on mobile.At 2.54GB, the model is small enough for mobile storage while remaining fast and useful for on-device tasks without requiring cloud connectivity. * 3. The app features a sophisticated 'skills' demo for tool calling.It demonstrates the model's ability to interact with HTML-based widgets (e.g., interactive maps, QR code generators), showcasing the potential for local agentic workflows. * 4. Current UX limitations include ephemeral conversations.The app does not yet support permanent logs or conversation history, and the tool-calling demo can occasionally freeze during follow-up prompts.

Metadata

AI Score

86

Website simonwillison.net

Published At Today

Length 171 words (about 1 min)

Sign in to use highlight and note-taking features for a better reading experience. Sign in now

6th April 2026 - Link Blog Google AI Edge Gallery (via) Terrible name, really great app: this is Google's official app for running their Gemma 4 models (the E2B and E4B sizes, plus some members of the Gemma 3 family) directly on your iPhone.

It works _really_ well. The E2B model is a 2.54GB download and is both fast and genuinely useful.

The app also provides "ask questions about images" and audio transcription (up to 30s) with the two small Gemma 4 models, and has an interesting "skills" demo which demonstrates tool calling against eight different interactive widgets, each implemented as an HTML page (though sadly the source code is not visible): interactive-map, kitchen-adventure, calculate-hash, text-spinner, mood-tracker, mnemonic-password, query-wikipedia, and qr-code.

!Image 3: Screenshot of an "Agent Skills" chat interface using the Gemma-4-E2B-it model. The user prompt reads "Show me the Castro Theatre on a map." The model response, labeled "Model on GPU," shows it "Called JS skill 'interactive-map/index.html'" and displays an embedded Google Map centered on a red pin at The Castro Theatre in San Francisco, with nearby landmarks visible including Starbelly, Cliff's Variety, Blind Butcher, GLBT Historical Society Museum, and Fable. An "Open in Maps" link and "View in full screen" button are shown. Below the map, the model states "The interactive map view for the Castro Theatre has been shown." with a response time of 2.4 s. A text input field with "Type prompt..." placeholder, a "+" button, and a "Skills" button appear at the bottom.

(That demo did freeze the app when I tried to add a follow-up prompt though.)

This is the first time I've seen a local model vendor release an official app for trying out their models on in iPhone. Sadly it's missing permanent logs - conversations with this app are ephemeral.

S Simon Willison's Weblog @Simon Willison

One Sentence Summary

Google's official AI Edge Gallery app brings local Gemma 4 and Gemma 3 models to the iPhone, featuring fast performance, multimodal capabilities, and interactive tool-calling skills.

Summary

Simon Willison reviews Google's AI Edge Gallery, an official iOS app designed to run Gemma 4 (E2B and E4B sizes) and Gemma 3 models locally on mobile hardware. The app demonstrates impressive performance, with the 2.54GB E2B model providing fast and practical utility. Key features include image-based Q&A, short audio transcription, and a unique 'skills' demo that showcases tool calling against eight interactive HTML widgets, such as maps and hash calculators. While the app marks a significant milestone as the first official local model app from a major vendor, it currently lacks persistent conversation logs and exhibits some stability issues in complex demos.

Main Points

* 1. Google has launched an official iOS app for local execution of Gemma models.

This represents the first time a major model vendor has released a dedicated mobile app for running their LLMs locally on consumer hardware, specifically targeting the Gemma 4 and Gemma 3 families.

* 2. The Gemma 4 E2B model offers a balance of size and performance on mobile.

At 2.54GB, the model is small enough for mobile storage while remaining fast and useful for on-device tasks without requiring cloud connectivity.

* 3. The app features a sophisticated 'skills' demo for tool calling.

It demonstrates the model's ability to interact with HTML-based widgets (e.g., interactive maps, QR code generators), showcasing the potential for local agentic workflows.

* 4. Current UX limitations include ephemeral conversations.

The app does not yet support permanent logs or conversation history, and the tool-calling demo can occasionally freeze during follow-up prompts.

Key Quotes

* This is Google's official app for running their Gemma 4 models... directly on your iPhone. * The E2B model is a 2.54GB download and is both fast and genuinely useful. * This is the first time I've seen a local model vendor release an official app for trying out their models on in iPhone. * Conversations with this app are ephemeral.

AI Score

86

Website simonwillison.net

Published At Today

Length 171 words (about 1 min)

Tags

Google Gemma

Local LLM

iOS

Mobile AI

Edge Computing

Related Articles

* Introducing Moltworker: a self-hosted personal AI agent, minus the minis * My fireside chat about agentic engineering at the Pragmatic Summit * Clawdbot/moltbot Clearly Explained (and how to use it), an AI agent framework that acts as an autonomous digital employee for solopreneurs to automate coding, research, and business operations.") * First impressions of Claude Cowork, Anthropic’s general agent * Introducing Showboat and Rodney, so agents can demo what they’ve built * GPT-5.4 mini and GPT-5.4 nano: 76,000 Photos for $52 * Bring state-of-the-art agentic skills to the edge with Gemma 4 * Slashing agent token costs by 98% with RFC 9457-compliant error responses * GPT-5.4 Makes A Splash, AI’s Growth on Mobile, Data Centers Go Off-Grid, and more... * Wilson Lin on FastRender: a browser built by thousands of parallel agents HomeArticlesPodcastsVideosTweets

Google AI Edge Gallery | BestBlogs.dev

查看原文 → 發佈: 2026-04-06 13:18:26 收錄: 2026-04-06 14:00:56

🤖 問 AI

針對這篇文章提問,AI 會根據文章內容回答。按 Ctrl+Enter 送出。