On May 19, 2026, Google launched Gemini 3.5 Flash as the new default model across the Gemini app, AI Mode in Search, Antigravity, and Vertex AI. It beats Gemini 3.1 Pro on Terminal-Bench, MCP Atlas, and Finance Agent benchmarks while costing $1.50 input and $9 output per million tokens. Inside Antigravity 2.0, it runs at 12x speed instead of the 4x public-API figure. Google also launched Gemini Omni — a "world model" multimodal video generator — and Gemini Spark, a 24/7 autonomous agent on Cloud Run VMs. A new $100 AI Ultra tier ships with $100 in Antigravity credits through Sunday May 25, 2026.
The 60-Second TL;DR
Google I/O 2026 reorganized the entire stack around agents, not features. Every consumer surface now runs the same underlying model. The whole map, in five numbers:
5 Strategic Takeaways
- The model-to-surface matrix collapsed. Every Google consumer and developer surface now runs the same Gemini 3.5 Flash underneath. Google is treating Gemini as the OS-level intelligence layer, not as a product line.
- Antigravity is the agent runtime everywhere. It powers Managed Agents in the Gemini API, Spark on Cloud Run VMs, Generative UI in Search, and some Workspace agentic features. The desktop app is just one face of it.
- The 12x in-Antigravity speed boost is the moat. Same model, three times faster than the public API. That's the strongest practical reason to live in Antigravity instead of building on raw API.
- Pricing quietly tightened on AI Pro. Compute-based credits replaced fixed prompt caps, the 1,000 monthly AI credits disappeared, and the top Ultra tier lost 10TB of storage. The $100 credit through May 25 partially offsets it for power users.
- Launch-week bugs are real. The Windows installer broke thousands of existing setups. The agent reverted human edits as "inefficiencies" until v2.1.4 shipped. The new
thinking_leveldefault silently dropped fromhightomedium. Test before flipping production togemini-3.5-flash.
The $100 Antigravity Credit Closes May 25, 2026
Three days from this page going live. Here's the math both ways.
Honest caveat: the credit applies to Antigravity workloads only. If you don't run agents daily, the credit value doesn't fully cash out. Validate need before paying for capacity you won't use.
The Model Roster
Google's I/O 2026 didn't introduce a single new flagship — it introduced a coordinated family. The structure: one default model that runs everywhere, one world-model media engine, four updated generative siblings, an open-weights line for on-device, and two agent runtimes. Every announcement maps back to one of these six lanes.
Gemini 3.5 Flash
Default model in Gemini app, Search AI Mode, Antigravity, Vertex. The point of the launch.
Gemini 3.5 Pro
Internal use only. Rolling out late June 2026. No public specs yet.
Gemini Omni Flash
Demis Hassabis's "world model" for video. Fuses Gemini reasoning with Veo, Nano Banana, Project Genie.
Veo, Imagen, Lyria, Nano Banana
Native audio in Veo, better text rendering in Imagen, separable stems in Lyria 3 Pro.
Gemini Nano 4 + Gemma 4
On-device, built on Gemma 4. AICore Developer Preview now. Production later this year.
Managed Agents (API)
One API call → isolated Linux sandbox. Powered by Antigravity agent + Gemini 3.5 Flash.
Gemini 3.5 Flash — The Centerpiece
Google's framing: a Flash-tier model that "rivals large flagship models at speeds you expect from the Flash series." The benchmarks back the marketing on coding and agentic work, but with sharp limits on long-context and pure reasoning. Read both columns before flipping production code.
Specs and Pricing
| Attribute | Value |
|---|---|
| API Model ID | gemini-3.5-flash |
| Internal Version | 3.5-flash-05-2026 |
| Status | GA May 19, 2026 — same day as I/O announcement |
| Input Pricing | $1.50 per 1M tokens (global) · $1.65 non-global |
| Output Pricing | $9.00 per 1M tokens (global) · $9.90 non-global |
| Cached Input | $0.15 per 1M tokens (90% discount) |
| Context Window | 1,048,576 input tokens · 65,536 output tokens |
| Modalities In | Text, image, audio, video, PDF |
| Modalities Out | Text only |
| Knowledge Cutoff | January 2026 |
| Tooling | Function calling, structured output, code execution, search-as-a-tool — all first-party |
| Thinking | Dynamic thinking on by default. New thinking_level enum: minimal / low / medium (default) / high |
| Distribution | Gemini app, Gemini API, AI Studio, Antigravity, Android Studio, Vertex AI, Search AI Mode, Gemini Enterprise |
Benchmark Wins vs Gemini 3.1 Pro
| Benchmark | Gemini 3.5 Flash | Gemini 3.1 Pro | Delta |
|---|---|---|---|
| Terminal-Bench 2.1 | 76.2% | 70.3% | +5.9 pts |
| MCP Atlas (multi-tool coordination) | 83.6% | 78.2% | +5.4 pts |
| Finance Agent v2 | 57.9% | 43.0% | +14.9 pts |
| GDPval-AA Elo | 1656 | 1314 | +342 Elo |
Where 3.5 Flash Still Loses to 3.1 Pro
Honest read: 3.5 Flash is not a clean replacement for the Pro tier. It trails on three benchmarks that matter for specific workloads:
- Humanity's Last Exam — frontier knowledge questions. Stay on 3.1 Pro for hardest research-grade reasoning.
- ARC-AGI-2 — novel-problem abstract reasoning. Stay on 3.1 Pro for puzzle-style workflows.
- 128K MRCR v2 — long-context retrieval at depth. Stay on 3.1 Pro if you regularly push past 200K tokens of dense context.
The 12x Speed Inside Antigravity — Read Carefully
Via the public Gemini API, Gemini 3.5 Flash runs approximately 4x faster than comparable frontier models. Inside Antigravity 2.0, it runs at approximately 12x speed. Same model. The delta comes from harness co-optimization — Antigravity's agent harness was rebuilt alongside 3.5 Flash, with shared assumptions about prompt structure, tool-call patterns, and context-cache routing.
This is the most strategically important number in the entire I/O launch and the most under-reported one. It is the strongest practical argument for using Antigravity 2.0 as your daily agent runtime rather than building on the raw API. Three times faster, same intelligence, same price.
Price Position Within the Gemini Family
| Model | Input / 1M | Output / 1M | vs 3.5 Flash |
|---|---|---|---|
gemini-3.1-pro | $2.00 | $12.00 | 3.5 Flash is 40% cheaper |
gemini-3.5-flash | $1.50 | $9.00 | — baseline |
gemini-3-flash-preview | $0.50 | $3.00 | 3.5 Flash is 3x more |
gemini-3.1-flash-lite | $0.25 | $1.50 | 3.5 Flash is 6x more |
The new thinking_level string enum (minimal / low / medium / high) replaced the integer thinking_budget. The default dropped from high to medium. Production code migrating from gemini-3-flash-preview to gemini-3.5-flash may experience silent reasoning quality degradation without explicit testing. Audit before the flip.
Gemini Omni — The "World Model"
The most architecturally ambitious announcement at I/O 2026. Demis Hassabis introduced Gemini Omni not as a video generator, but as a world model — a system that builds an internal understanding of reality and reasons about what should happen next inside any given scene. Until now, Google ran a split stack: Veo for video, Imagen for images, separate systems for audio. Omni collapses that into one model that reasons across text, image, video, and audio in the same forward pass.
What's Live Today
| Surface | Access | Cost |
|---|---|---|
| Gemini app | AI Plus / Pro / Ultra subscribers globally | Included in subscription |
| Google Flow | AI Plus / Pro / Ultra globally | Included |
| YouTube Shorts Remix | All users 18+ | Free |
| YouTube Create app | All users 18+ | Free |
| Vertex AI API | Developer + enterprise | Coming "in the coming weeks" |
Architecture (per the model card)
Gemini Omni Flash is a transformer-based model with native multimodal support for text, vision, video, and audio inputs. Trained on Google's TPUs. Fuses Gemini's reasoning capabilities with the generative capabilities of Veo (video), Nano Banana (image editing), and Project Genie (interactive world simulation) into one architecture.
The model is trained to incorporate real-world understanding of physics, motion, and spatial awareness. Per Google's stage demo, this includes intuitive grasp of gravity, kinetic energy, fluid dynamics, and inter-object spatial relationships. The keynote put it to the test by generating a claymation explainer of protein folding — showing the model moving past pixel-matching to understand actual scientific reality.
Capabilities
- Conversational multi-turn editing. "Swap the background to a city street" → "Make the lighting warmer" → "Stabilize the shot" — each edit is cumulative, not regenerative.
- Reference any combination of inputs. Upload images for characters, audio for pacing, video for motion patterns, text for storyboard direction. The model reasons across all of them in one forward pass.
- Character + voice consistency. Identity and voice preserved across every scene in multi-shot output.
- Custom AI avatars. Face + voice cloning to drop yourself into the action.
- World-model physics. Generated scenes respect gravity, fluid dynamics, kinetic momentum — not perfect, but a marked improvement over pixel-prediction video models.
What Omni Will NOT Do (Deliberately)
Audio editing of existing video is deliberately held back over deepfake risk. You can generate new video with new audio. You can edit visuals on existing video. You cannot replace or modify audio on a video you upload. Google made this explicit. The hold is in response to the SAG-AFTRA / Motion Picture Association / Disney pressure following last year's nudification-tool scandals.
Provenance Hardened by Default
- SynthID — invisible watermark, switched on by default, cannot be disabled. Designed to survive common post-processing (re-encoding, cropping, screenshot). Verifiable via Gemini app, Gemini in Chrome, and Search.
- C2PA Content Credentials — adds explicit provenance metadata. Coming to Search and Chrome in the coming months.
- 50 million+ SynthID verifications globally as of the I/O 2026 keynote.
- 100 billion+ AI-generated images and videos marked with SynthID since launch.
Current Limits Worth Knowing
- Clip duration: 10 seconds per generation in current Omni Flash. Longer formats planned for future Pro release.
- Output modality: Video with audio only for now. Standalone image and audio outputs planned.
- Audio inputs: Voice references supported; other audio types rolling out soon.
- Complete consistency through edits, complex motion, and perfectly accurate text rendering remain explicit Google-acknowledged challenges. Test prompts should exercise these specifically before committing.
Veo Still Exists — Pick the Right Tool
Veo 3.1 was not killed at I/O 2026. It received an upgrade with native audio, shot framing control, and character consistency across multiple clips, plus better complex scene generation. The split now:
- Veo 3.1 — photorealism specialist. 4K resolution, 8-second clips, broadcast-quality realism, native audio.
- Gemini Omni Flash — the multimodal director. Up to 10 seconds, lower resolution than Veo, but cross-modal reasoning and conversational editing.
Pick Veo when you need maximum visual fidelity for a final cut. Pick Omni when you need conversational iteration across mixed inputs.
Supporting Generative Models
The four generative siblings around Omni — each updated at I/O 2026, each tuned for a specific output type.
Veo (now Veo 3.1)
Native audio, shot framing control, character consistency across clips, better complex scene generation, prompt adherence. 4K, 8-second clips, photorealism specialist.
Imagen
New generation with substantially better rendering of text, typography, and realistic materials. Available across Gemini app, Workspace, Whisk, Vertex AI.
Lyria 3 Pro
Pro-grade music arrangement. Separate control of melody, rhythm sections, instrumental timbres. Export to separable stem tracks. Powers Google Flow Music.
Nano Banana
Powers AI Studio Build agent auto-image generation, Google Pics image creation/editing, and is a core component inside Gemini Omni's image-reasoning layer.
On-Device + Open Weights
The lane that doesn't run on Google's servers. Released in April 2026 just before I/O, but the Nano 4 production rollout was confirmed during the keynote. Apache 2.0 licensed, commercially usable, runs on consumer hardware.
Gemma 4 Family
| Model | Parameters | Architecture | Use Case |
|---|---|---|---|
| Gemma 4 E2B | Effective 2B | PLE-optimized dense | Phones, edge devices (~1.5GB 4-bit RAM) |
| Gemma 4 E4B | Effective 4B | PLE-optimized dense | Laptops, Raspberry Pi, NVIDIA Jetson (~5GB 4-bit RAM) |
| Gemma 4 26B MoE | 26B total, 8 active + 1 shared per token | Mixture of Experts (128 experts) | Server-side near-31B quality at lower compute |
| Gemma 4 31B Dense | 31B | Hybrid attention dense | Maximum quality open-weights workloads |
What's Unique About Gemma 4
- Hybrid attention. Interleaved sliding-window and full global attention. Final layer always global. Proportional RoPE on global layers with unified keys/values for memory efficiency.
- Per-Layer Embeddings (PLE). The "E" prefix means Effective parameters. Each decoder layer gets its own small embedding for every token. Embedding tables are large but used only for lookups, so effective parameter count is much smaller than total — enabling on-device inference at sizes that would otherwise need a server.
- 140+ languages supported natively.
- Native audio input on E2B and E4B — neither Llama 4 nor Qwen 3.5 offers this at these sizes.
- Apache 2.0 license — commercially permissive, no usage caps, no API spend.
Toolchain Support (day one)
Hugging Face Transformers, vLLM, llama.cpp, MLX (Apple Silicon), LM Studio, Ollama, NVIDIA NIM and NeMo, Unsloth, SGLang, Keras, Docker, Baseten, AMD ROCm.
Gemini Nano 4 — The Production On-Device Path
Built on Gemma 4 architecture but optimized further by Google for production Android deployment. Currently in AICore Developer Preview. Production "later this year." Runs on the latest generation of specialized AI accelerators from Google (Tensor), MediaTek, and Qualcomm Technologies. On other devices, the models initially run on CPU implementation that doesn't represent final production performance.
What's coming during the preview window: tool calling, structured output, system prompts, and thinking mode in the Prompt API. The compatibility promise from Google: code written today for Gemma 4 will automatically work on Gemini Nano 4-enabled devices.
For DDS workloads, the immediate implication: any on-device feature you might want to ship for Pixel + Samsung Galaxy users this fall can be prototyped against Gemma 4 today, with confidence that it'll work on Nano 4 production hardware later. No lock-in, no waiting for API access.
Every Surface Gemini Now Runs On
The big strategic shift at I/O 2026: the model-to-surface matrix collapsed. A year ago, each Google surface had its own AI feature set. Now nearly every consumer and developer surface runs the same underlying Gemini 3.5 Flash, with surface-specific UI and permissions. Google is treating Gemini as the OS-level intelligence layer rather than as a product line.
Developer Surfaces
| Surface | What's New at I/O 2026 |
|---|---|
| Google AI Studio | Native Android app builds from prompts (Kotlin + Compose). Play Console publish to test track in one click. Workspace API directly accessible from built apps. One-click export to Antigravity with full context carryover. Build agent auto-generates custom images via Nano Banana inline. Free deploy first 2 apps to Cloud Run with no credit card. Mobile AI Studio app in pre-registration. |
| Antigravity 2.0 | Standalone desktop app, agent-first orchestration. Subagents, hooks, asynchronous task management, parallel execution. Voice via Gemini Audio. Direct integrations with AI Studio, Firebase, Android. 12x speed for 3.5 Flash inside the harness. |
| Antigravity CLI | Go-based. Replaces Gemini CLI (sunset June 18, 2026). Preserves Skills, Hooks, Subagents, Extensions (rebranded as Antigravity plugins). Same harness as 2.0 desktop. |
| Antigravity SDK | Programmatic access to the same agent harness powering Google's own products. Optimized for Gemini models. Host agents on your own infrastructure. |
| Managed Agents API | One API call → isolated Linux environment with the agent able to reason, plan, call tools, execute code, manage files, browse the web. Persistent state across multi-turn. Define via AGENTS.md + SKILL.md markdown files. |
| Android Studio | 3.5 Flash GA. Migration Agent — React Native / web framework / iOS → native Kotlin + Compose. Android CLI 1.0 — stable bridge for external AI agents (Claude Code, Codex, Antigravity) to drive Android Studio toolchain from the command line. |
| Firebase Studio | Antigravity export now supported. Integrated into Spark/Antigravity stack. |
| Vertex AI / Gemini Enterprise | 3.5 Flash GA same day. Antigravity coming to existing Gemini Enterprise customers in "coming months" with enterprise terms and Cloud project connections. |
Consumer + Workspace Surfaces
| Surface | What's New |
|---|---|
| Gemini app | 900M monthly users, 230 countries, 3 trillion tokens/day processed. Neural Expressive redesign with fluid animations, vibrant colors, haptics, real-time response layout instead of wall-of-text. New Gemini Live model less distracted by background noise. |
| Daily Brief | Proactive morning digest in Gemini app. Connects to Gmail, Calendar, Tasks. Rolling out to AI subscribers (18+) starting US. |
| Gmail AI Inbox | Intelligent surfacing + AI draft replies + one-tap dismiss/done. Now expanding to AI Plus/Pro US (was Ultra-only). |
| Gmail Live, Docs Live, Talk to Keep | Voice-driven inbox queries, doc creation, note-taking. Summer rollout for AI Pro/Ultra. |
| Google Pics | New image creation/editing tool on Nano Banana. Object segmentation, text editing/translation, Workspace integration. Trusted testers now, summer to AI Pro/Ultra + Workspace business preview. |
| Gemini in Chrome | Persistent sidebar across up to 10 tabs. Auto Browse agentic multi-step tasks for AI Pro/Ultra US. SynthID + Nano Banana integration coming in coming weeks. C2PA verification coming. Available Windows, macOS, Chromebook Plus, mobile. |
| Android 17 ("Cinnamon Bun") | Stable June 2026. Gemini Intelligence as OS-level intelligence layer. AppFunctions expose app capabilities to agents. Create My Widget for natural-language widget creation. Power button triggers Gemini in any app. |
| Search | AI Overviews at 2.5B users, AI Mode at 1B+ on Gemini 3.5 Flash globally. New intelligent Search box (biggest update in 25+ years). Information agents 24/7 background research. Generative UI with Antigravity for custom on-the-fly layouts. Mini apps for ongoing dashboard tasks. |
| NotebookLM | Now on Gemini 3 models. Powers Literature Insights in Gemini for Science. Native Android + iOS apps. Audio + Video Overviews, slides, diagrams, charts, flashcards, data tables. |
| YouTube | Shorts Remix with free Omni Flash for 18+. Ask YouTube conversational search rolling out US English. Universal Cart integration coming. |
| Universal Cart | Gemini-powered cart across Search, Gemini app, YouTube, Gmail. Universal Commerce Protocol (UCP). Tracks price history, finds deals, suggests payment methods. Summer rollout. |
| Maps | Immersive Navigation in 3D. Live Lane Guidance using vehicle forward camera (Google Built-In only, not phone projection). Most significant car experience update in 10+ years. |
| Android Auto | 250M+ compatible vehicles. Natural back-and-forth conversation replaces rigid commands. Global Widgets — customizable home screen layer, build via voice. Visual redesigns for Spotify and YouTube Music. |
| Android XR | Audio glasses fall 2026 from Gentle Monster, Warby Parker, Samsung. Android + iOS compatible. Two product types: audio glasses and display glasses. |
| Googlebook (new category) | Premium AI PC line — Acer, ASUS, Dell, HP, Lenovo. Built around Gemini Intelligence. Magic Pointer AI-assisted cursor. Signature Glowbar on keyboard. Runs Android apps + Chrome tools natively. |
Strategic observation: the surfaces that are not getting model-level treatment yet are wearables (Wear OS gets Gemini Intelligence as part of the "advanced device" tier but no specific I/O features) and Google Home/Nest (gradual deepening, but no headline updates). Expect both to get more attention at I/O 2027.
Vibe Coding the New Stack
Vibe coding — intent-based engineering where you describe the goal and let the agent handle implementation — went from indie-hacker novelty in March 2026 to Google's official positioning by May. Two demonstrations from the keynote frame the current state of the art.
The "OS in 12 Hours" Stage Demo
Google's headline showcase: Antigravity 2.0 built the core framework of a working operating system in approximately 12 hours. The numbers behind it:
93 Subagents
Spawned in parallel to handle different parts of the OS — kernel, drivers, scheduler, file system, UI shell.
Billions of Tokens
Total tokens processed during the build. Heavy use of context caching kept costs down via the $0.15/M cached input rate.
Under $1,000
Total compute cost for the entire build. That's the headline number for vibe coders weighing whether the new stack is real.
Doom Ran on It
Initially failed due to missing keyboard drivers. Antigravity generated the drivers in real time during the demo, then the game became playable.
Named Enterprise Builders Already on 3.5 Flash
Google's launch material cited four enterprises running production workloads:
| Company | Workload | Why It Matters |
|---|---|---|
| Shopify | Parallel subagents for data analysis; merchant growth forecasts | Multi-agent orchestration at retail scale — directly relevant for any Shopify store running operational AI |
| Macquarie Bank | Customer onboarding; reasoning over 100+ page documents | Long-context document workflows that previously required human review now feasible at 3.5 Flash speed |
| Salesforce | Agentforce integration; subagents retain context across multi-turn tool calling | Validates the multi-agent pattern at CRM scale — context handoff between specialized agents works in production |
| Ramp | Smarter OCR on invoices | Multimodal input (PDF + image + structured output) at production volume |
AI Studio Vibe-Coding Stack
The fastest path from idea to deployed prototype right now:
1. Open ai.studio
2. Pick "Build" tab
3. Describe the app in natural language
4. AI Studio scaffolds:
- React or Kotlin (web or Android)
- Firebase database + auth + multiplayer
- npm packages auto-resolved
- Workspace API access if needed
- Custom images via Nano Banana inline
5. Preview live in browser (web) or Android Emulator
6. One-click deploy to Cloud Run (web) or
Play Console test track (Android)
7. Optional: export to Antigravity for deeper work
— full context, secrets, conversation history carry over
First two apps deploy to Google Cloud free, no credit card. Beyond that, standard Cloud Run pricing applies. For a typical greenfield prototype, this puts a multiplayer-capable production app in roughly an afternoon — what previously took a sprint.
Android Studio Migration Agent
One of the most quietly important launches. Takes a React Native, web framework, or iOS codebase and converts it to native Kotlin + Jetpack Compose. The actual demo prompt from I/O:
"Migrate this React Native codebase to native Kotlin Android.
Preserve all business logic in the data and domain layers.
Use Jetpack Compose for all UI components.
Target API 35.
Maintain the existing Firebase integration.
Flag anything requiring manual review."
Work that previously justified a three-week sprint plus a consultant invoice now completes in hours. Read the flagged-for-review items carefully — the agent will surface ambiguities rather than guess.
WebMCP — The Emerging Open Standard
Announced at I/O 2026 alongside Antigravity 2.0 and the Android CLI: WebMCP, a proposed open web standard for browser-based AI agents. It's been in Early Preview Program since February 10, 2026. The pitch: recast the developer stack around AI agents that can read, write, and ship code through the browser, using a standard protocol any agent vendor (Google, Anthropic, OpenAI) can implement.
If WebMCP succeeds the way MCP succeeded for tools, browser-based agent interoperability will be the default by I/O 2027. Worth tracking even if you're not building on it yet.
Creator Workflows on the New Stack
The other half of the I/O 2026 launch energy went to creators — short-form video makers, brand designers, music producers. The Omni Flash + Flow Agent + Pics + Pomelli + Stitch stack is now the most coherent end-to-end creator toolchain Google has shipped.
Brand Product Video Pipeline (Omni Flash)
A live production-grade workflow for commercial creators, per Google's documentation:
Input Assets:
- brand-product-shot.png (e.g., product still life)
- brand-audio-bumper.mp3 (3-second sting)
- reference-cinematography.mp4 (motion style reference)
Iteration 1 (text + 3 references):
"Use the product image as the main reference. Create a premium
10-second product video with a slow dolly-in, warm studio lighting,
reflective surface detail, subtle ambient sound, and the text
'Track every move' appearing on the final beat."
Iteration 2 (conversational refinement):
"Make the lighting warmer."
→ Model preserves all other elements, adjusts only color temperature.
Iteration 3:
"Stabilize the shot."
→ Reduces camera motion while keeping framing.
Iteration 4:
"Replace the text with 'Built for everyday wear'."
→ In-place text edit, no regeneration of underlying scene.
The key insight: Omni's world-model architecture retains environment logic across edits. You're not regenerating the video each time — you're refining a persistent scene.
YouTube Shorts Remix (Free for 18+)
The free tier of Omni Flash exposure. Process:
- Select an eligible Short you want to remix.
- Prompt the edit: "Add myself standing next to the host" or "Change the background to a Boston street scene."
- Omni Flash generates the remix preserving the original's pacing and music.
- Custom AI avatar can be created in the YouTube Create app and reused across remixes.
For creators with a built audience, this becomes a duet/stitch mechanic with infinite remix depth. For brands, it's a participatory campaign primitive.
Google Flow + Flow Agent + Flow Tools
Google Flow
The AI filmmaking studio. Now integrates Gemini Omni Flash globally for AI subscribers. Iterate conversationally on cinematic clips with consistent characters across scenes.
Flow Agent
Multi-step task agent for creatives. Sounding board for character dialogue. Plot recommendations. Batch edits across assets. Auto-organizes collections, intuitively renames assets.
Flow Tools
Vibe-code custom creative tools in natural language. Video effects, hand-drawn animations, text layering, custom shaders. Share remixable tools with other Flow users.
Flow Music (Lyria 3 Pro + Omni)
Conversational direction of shareable music videos. Change lyrics to different languages, swap genres, adjust instruments, fine-tune granular sections without restarting.
Pomelli + Stitch — Brand Design
For brand builders specifically:
- Pomelli — brand content + website design. Builds on-brand creative for campaigns from a brand spec.
- Stitch — real-time design with voice or text steering. Import existing codebase + design files for on-brand consistency. Reflows ideas as you describe them.
For DDS specifically, the Stitch import-codebase capability is interesting: feed it the existing Atelier 2.1.1 theme files and have it generate new sections that match the design language without re-explaining the palette and typography stack.
Full-Stack Automation Patterns
The breadth of automation surfaces is the structural change. Antigravity isn't the only place you can run agents anymore — Workspace, Chrome, Android, and Search all expose agentic patterns now. Pick the pattern that matches the trigger, not the model.
Dev Loop Automation (Antigravity)
- PR review automations — agent reads diff, runs tests, drafts review comments.
- Test generation — coverage-targeted unit + integration tests from existing code.
- Dependency audits — scheduled scan for outdated packages, CVEs, license issues.
- CI gate enforcement via hooks — pre-commit, pre-push, post-test hooks that fail loudly on regression.
- Doc generation — README, API docs, architecture diagrams auto-updated on code changes.
- Code migration — React Native → Kotlin (via Android Studio Migration Agent), web → Compose, legacy → modern frameworks.
Browser-Driven Automation (Antigravity browser subagent + Chrome Auto Browse)
The browser subagent inside Antigravity runs a specialized model different from the main agent, with tools for clicking, scrolling, typing, reading console logs, DOM capture, screenshots, markdown parsing, and video recording. It routes through the Antigravity Chrome extension and is purpose-built for visual verification — the agent sees the UI exactly as a user does.
- Form filling — invoice submission, application forms, government portals.
- Multi-step research/scraping — agent navigates pagination, follows links, structures output.
- E-commerce checkout testing — full purchase flow validation with real card sandboxes.
- Recurring web jobs via Antigravity scheduled tasks — daily competitor pricing scrapes, weekly SEO position checks.
- Visual UI verification — screenshot diff against design spec, regression catch.
Workspace Agent Patterns
- Gmail AI Inbox — intelligent surfacing, AI draft replies, dismissal of stale threads. Reduces inbox triage from hours to minutes for high-volume users.
- Gmail Live (summer) — voice queries to inbox without thread digging.
- Docs Live (summer) — voice-driven doc creation pulling from Gmail/Drive/Chat/web with permission.
- Talk to Keep (summer) — voice brain-dump → organized notes/lists.
- Workspace APIs callable from AI Studio agents — dashboards on Sheets data, tools that organize Drive folders, apps that consume team documents.
Android Agent Patterns (Android 17)
- Gemini Intelligence — OS-level intelligence layer underneath Android, not a separate app.
- AppFunctions — Android apps expose capabilities to agents through a standardized interface. The Android equivalent of MCP for apps.
- Create My Widget — natural-language widget creation OS-wide.
- Power button activates Gemini in any running app.
Chrome Agent Patterns
- Gemini in Chrome Auto Browse — agentic multi-step tasks like booking travel, entering data, scheduling meetings. AI Pro/Ultra US preview.
- Chrome DevTools for agents — new surface specifically targeting autonomous agents.
- Persistent sidebar context — Gemini understands up to 10 open tabs as a single context group.
- WebMCP — emerging open standard for cross-vendor browser agent interoperability.
Search Agent Patterns
- Information agents — 24/7 background research per topic. Summer rollout to AI Pro/Ultra. The Spark equivalent for ongoing topical monitoring.
- Generative UI with Antigravity — Search builds custom layouts, tables, graphs, simulations on the fly. Free for everyone summer.
- Mini apps — custom dashboards/trackers built in Search for ongoing tasks like wedding planning, home moves, projects.
Gemini Spark — The Consumer-Grade Autonomous Agent
Spark is the consumer Antigravity. The architecture:
- Runs on Google Cloud Run VMs — operates with your phone or laptop completely off.
- Built on Gemini 3.5 Flash plus the Antigravity harness.
- Daily Brief integration for proactive morning surfacing.
- Beta AI Ultra US starts week of May 26, 2026.
Roadmap features Google committed to "throughout the summer":
- Text or email Spark directly with task requests.
- Create custom sub-agents under your direction.
- Authorize payments while specifying budget caps and merchant restrictions.
That last one — payment authorization with budget and merchant constraints — is the boundary Google is approaching cautiously. Expect heavy guardrails and confirmation flows in early beta.
Multi-Agent Orchestration Pattern (Antigravity SDK)
The canonical template emerging from early SDK adoption:
// AGENTS.md — define the team
LeadDeveloper:
role: orchestrator
routes:
- design → Designer
- code → Coder
- test → Testing
- docs → Documentation
model: gemini-3.5-flash
Designer:
role: research + UI specification
tools: [browser, code_editor]
model: gemini-3.5-flash
Coder:
role: implementation
tools: [code_editor, terminal]
model: gemini-3.5-flash
Testing:
role: unit + regression tests
tools: [code_editor, terminal]
model: gemini-3.5-flash
Documentation:
role: README + API docs maintenance
tools: [code_editor]
model: gemini-3.5-flash
Subagent context isolation is the win — the Coder doesn't see what Designer researched, only the spec it produced. Reduces context bloat at scale.
DDS Real-World Applications
Theory is cheap. Here's where the new stack actually changes how Design Delight Studio operates — three concrete patterns you can implement this week.
Application 1 — Shopify Product Description Automation via Managed Agents API
DDS runs on Shopify Basic with the Atelier 2.1.1 theme. Product descriptions across the catalog need consistent voice, certification accuracy (GOTS, GRS, OCS, PETA-Approved Vegan, Fair Trade — never OEKO-TEX), and SEO Magnet V2 patterns. Doing this manually for new SKUs costs hours per launch.
The pattern: Single Managed Agents API call provisions an isolated Linux sandbox. Pass it a product spec (materials, certifications, size run, target persona). Agent reads the existing DDS catalog via Shopify Storefront API for voice consistency, generates draft description following SEO Magnet V2 (quick answer, key takeaways, FAQ structure), validates certification claims against the locked-in list, returns finalized HTML.
The math: At $1.50 input / $9 output per 1M tokens, with ~3K tokens input and ~2K output per description, each product description costs roughly $0.022. Or about $1 for 50 descriptions. The previous manual cost — even at minimum wage for a contractor — is two orders of magnitude higher.
The architecture moment: Define the agent once in AGENTS.md + SKILL.md, store in version control, invoke from any DDS tool that needs description generation. The Sovereign Orchestrator can route product-launch workflows through it as a sub-agent of the Scribe.
Application 2 — Brand Product Video Pipeline for lovedbymeb4u (Omni Flash)
Robert's partner runs the lovedbymeb4u Vinted resale business targeting European buyers with American vintage pieces sourced in Massachusetts. The bottleneck: product video content. Each listing benefits from a 5–10 second branded clip showing the piece in context, but shooting and editing each one manually doesn't scale.
The pattern: Capture one high-quality still per piece. Upload to Gemini Omni Flash with a templated prompt: "Use this product image as the main reference. Generate a 10-second video showing the piece displayed on a soft natural background, slow rotation, warm afternoon lighting, ambient room tone, with the text 'lovedbymeb4u — sourced in Massachusetts' fading in on the final beat." Refine conversationally: "Make the lighting cooler" if the piece is winter. "Add subtle film grain" for a vintage feel.
The math: Omni Flash is included in AI Plus / Pro / Ultra subscriptions. At $20/mo for AI Pro, the marginal cost per video is effectively zero up to the compute quota. For roughly 50 listings per month, this is a few-cents-per-video cost ceiling — within quota for AI Pro.
The character consistency moment: Custom AI avatar of the partner can be created once in YouTube Create app, then dropped into Reels-style listings ("Hi, this is the Levi's jacket I mentioned") without ever filming again. Identity preserved across all generated content. SynthID watermark provides provenance for buyers who want to verify.
Application 3 — Sovereign Orchestrator Pro V5.0 Migration to Managed Agents
The current Sovereign Orchestrator runs hybrid Ollama + Gemini agents (Veritas, Scribe, Atlas, Swarm Orchestrator, Tweet Generator) with per-agent model assignment. The Gemini-side agents currently call Gemini 3 Flash Preview at $0.50/$3.00 per million tokens.
The migration option: Move the Gemini-side agents to gemini-3.5-flash via Managed Agents API. Two changes happen at once: (1) the model improves by 5.9 points on Terminal-Bench, 14.9 points on Finance Agent v2; (2) the cost increases 3x.
The honest read: For agents where reasoning quality matters more than throughput (Veritas fact-checking, Scribe long-form content), the 3x cost increase is worth it. For agents that handle high-volume routine work (Tweet Generator), stay on 3 Flash Preview at the lower price point. Use the new thinking_level enum explicitly — set it to high for Veritas, low for Tweet Generator, instead of relying on the silently-dropped medium default.
The Antigravity moment: The Swarm Orchestrator role can move to native Antigravity subagent orchestration instead of custom routing code. The 12x in-Antigravity speed boost compounds when multiple agents are running in parallel — what previously took 4 minutes across all 5 agents could complete in 30 seconds with proper subagent definitions.
These three patterns share a common shape: existing manual or expensive workflow becomes a single API call with declarative agent configuration. That's the structural shift in this I/O launch. Capabilities you'd build a whole product around in 2024 are now one config file in 2026.
Pricing + Quotas — The Honest Map
Three layers of pricing changed at I/O 2026: consumer subscriptions restructured, API rates published for 3.5 Flash, and a new compute-based quota system replaced the old daily prompt caps. Read all three before deciding where to spend.
Consumer Subscription Tiers
| Plan | Price | Antigravity Limit | Storage | What's Notable |
|---|---|---|---|---|
| Free | $0 | — | — | 3.5 Flash in Gemini app + Search AI Mode |
| AI Plus | New tier (price TBD) | — | — | Daily Brief, AI Inbox rollout, basic Omni Flash |
| AI Pro | $20/mo | Baseline | 5TB (up from 2TB) | Switched to compute-based credits with 5-hour windows + weekly cap. Lost 1,000 monthly AI credits. Includes YouTube Premium Lite ($8.99 value). |
| AI Ultra (NEW) | $100/mo | 5x Pro | 20TB | Spark beta access, priority Antigravity, YouTube Premium individual, Gemini 3.5 Flash for testing/debugging. Target: devs, tech leads, advanced creators. |
| AI Ultra (top) | $200/mo (was $250) | 20x Pro | 20TB (down from 30TB) | Includes Project Genie experimental world-building. Storage cap reduced — confirm before committing. |
Gemini API — Per-Token Pricing
| Model | Input / 1M | Output / 1M | Cached Input |
|---|---|---|---|
gemini-3.5-flash (global) |
$1.50 | $9.00 | $0.15 |
gemini-3.5-flash (non-global) |
$1.65 | $9.90 | — |
gemini-3.1-pro |
$2.00 | $12.00 | — |
gemini-3-flash-preview |
$0.50 | $3.00 | — |
gemini-3.1-flash-lite |
$0.25 | $1.50 | — |
API Pricing Modifiers
- Batch mode — 50% off for asynchronous non-urgent workloads
- Context cache reads — 10% of base input price (e.g., $0.15 for 3.5 Flash)
- Context cache storage — $1.00 to $4.50 per 1M tokens per hour, model-dependent
- Search grounding — 5,000 free prompts/month across the Gemini 3.x family, then $14 per 1,000
- Maps grounding — $14 per 1,000 prompts on 3.x family
The Compute-Based Quota System
The structural change that triggered the AI Pro backlash. Google replaced daily prompt caps with compute-based credits. Factors that drain credit faster:
- Prompt complexity (long context, multi-turn, multimodal input)
- Features used (thinking mode at
highlevel, tool calls, search grounding) - Chat length (cumulative context within a session)
Refresh: every 5 hours. Hard ceiling: weekly. When you hit the cap on the biggest models, the system shifts you to smaller models — no service interruption, but possible silent quality degradation. Pay-as-you-go top-up credits are available for Antigravity and Flow now. Gemini app top-ups coming.
Regional Pricing Gap
Non-global API regions cost 10% more across the board. If your workload tolerates global region routing, you save $0.15 input and $0.90 output per million tokens. For high-volume agent workloads, that adds up — review your Cloud project's region settings before committing to the new pricing.
Launch-Week Bugs and Workarounds
Five days in, the community has documented seven distinct issues. Five are workable. Two are silent risks that require explicit audits before flipping production. Read every section before subscribing or migrating.
The Antigravity 2.0 Windows installer dropped app.asar into the shared application folder. Result: both Antigravity IDE.exe and Antigravity.exe now load the 2.0 binary. Thousands of existing Antigravity IDE users saw the file explorer, sidebar, terminal, local project history, custom extensions, and chat history appear wiped. Top Hacker News thread: "it reeks of non-technical people shipping code to production."
Workaround (PowerShell, Windows):
cd "$env:LOCALAPPDATA\Programs\Antigravity\resources"
Rename-Item app.asar app.asar.bak
Then relaunch Antigravity IDE. Your previous installation reappears intact. If you want both 2.0 and IDE installed cleanly side-by-side, install 2.0 to a non-default directory.
Within 48 hours of launch, users reported the v2.0 agent classifying human code edits as "inefficiencies" and silently reverting them on next pass. Google shipped v2.1.4 Logic Patch as an emergency hotfix. The fact that a model behavior this critical needed a patch within days is worth noting before relying on Antigravity 2.0 for sensitive production code. Always run with version control commits between agent passes during the first few weeks.
Port 9222 zombie connection jams after updates. Antivirus blocking the Chromium driver. Agent declaring "done" without verifying when browser tooling is down. Full workarounds documented in the Antigravity 2.0 Masterclass already shipped on the Academy. Short version: kill any process holding port 9222, whitelist the Chromium driver in your antivirus, and never trust a browser-agent "done" claim without checking the screenshot output.
Endemic across 1.x → 2.0 updates. Community fix tool: github.com/FutureisinPast/antigravity-conversation-fix. Before any update, back up ~/.gemini/antigravity/brain/. The community fix tool restores lost history from local cache where possible.
Switched from fixed prompt caps to compute-based credits without prominent announcement. Removed the 1,000 monthly AI credits previously included. Top AI Ultra tier storage reduced from 30TB to 20TB. Reddit and X backlash: single complex prompts consuming "large share of quota" per user reports. Quote captured by Android Authority: "totally scam." If you're on AI Pro and your usage is heavy, audit your weekly compute consumption before assuming the new system gives you parity with the old one.
The new thinking_level enum (minimal / low / medium / high) replaces the integer thinking_budget. Default dropped from high → medium. Production reasoning quality may silently drop without retesting. Critical before flipping production code from gemini-3-flash-preview to gemini-3.5-flash. Set thinking_level explicitly in every API call rather than relying on the default.
Persists from 1.x into 2.0. Fix:
cd ~/Antigravity-x64
sudo chown root:root chrome-sandbox
sudo chmod 4755 chrome-sandbox
The chrome-sandbox setuid bit needs to be set for the Electron runtime to launch cleanly. Run this once per install.
The Roadmap — Dated and Committed
Every dated milestone Google committed to during the I/O 2026 keynote and accompanying blog posts. Use this for sprint planning.
Late May 2026 (this week and next)
| Date | Event |
|---|---|
| May 25 (Sun) | $100 Antigravity credit window CLOSES — see Act Now section |
| Week of May 26 | Gemini Spark beta opens to AI Ultra subscribers in the US |
June 2026
| Date | Event |
|---|---|
| June 1 | Gemini 2.0 Flash and Flash-Lite deprecated |
| June 18 | Gemini CLI + Code Assist IDE extensions STOP serving for AI Pro, Ultra, free tiers. Enterprise on Code Assist Standard/Enterprise retains access |
| Late June | Gemini 3.5 Pro rolling out publicly |
| Late June | Android 17 ("Cinnamon Bun") stable release |
Summer 2026 ("this summer")
- Information agents in Search → AI Pro/Ultra subscribers
- Generative UI with Antigravity in Search → free for everyone
- Universal Cart across Search + Gemini app, then YouTube + Gmail
- Daily Brief expansion beyond US
- Gmail Live, Docs Live, Talk to Keep → AI Pro/Ultra
- Google Pics GA → AI Pro/Ultra + Workspace business preview
- Gemini Spark feature drops throughout
- Mini apps in Search → subscribers
Fall 2026
- Android XR audio glasses launch — Gentle Monster, Warby Parker, Samsung
- Android Halo — works with Gemini Spark
- Gemini Nano 4 production rollout
"Coming Months" (no firm date)
- Antigravity in Gemini Enterprise for existing customers
- C2PA Content Credentials verification in Search + Chrome
- Vertex AI public Omni Flash API
- Spark text/email direct interaction
- Spark custom sub-agents
- Spark payment authorization with budget/merchant constraints
Regional Gaps to Watch
Many of the new Workspace features launch US-only. Daily Brief is US only at launch. Spark beta is US only. Ask YouTube is US English only initially. Non-global API regions cost 10% more. Build expansion plans assuming US-first rollout with global catch-up over 3-6 months.
Frequently Asked Questions
What is Gemini 3.5 Flash and how much does it cost?
Gemini 3.5 Flash is Google's new default model launched at I/O 2026 on May 19. It costs $1.50 per million input tokens and $9.00 per million output tokens via the Gemini API, with cached input at $0.15 per million. It is free in the Gemini app and Search AI Mode. Context window is 1,048,576 input tokens with 65,536 output tokens. Knowledge cutoff is January 2026. Available via Gemini API, AI Studio, Antigravity, Android Studio, Vertex AI, and Gemini Enterprise.
Is Gemini 3.5 Flash actually faster than other frontier models?
Yes. Via the public API, Gemini 3.5 Flash runs approximately 4x faster than comparable frontier reasoning models. Critically, inside Antigravity 2.0 it runs at approximately 12x speed due to harness co-optimization. That speed differential is the strongest practical argument for using Antigravity 2.0 as your daily agent runtime rather than building on the raw API.
What is Gemini Omni and how is it different from Veo?
Gemini Omni is Google DeepMind's first natively multimodal generative media model, introduced at I/O 2026 by Demis Hassabis as a "world model." It fuses Gemini reasoning with Veo, Nano Banana, and Project Genie into one architecture. Veo remains the photorealism specialist (4K, 8-second clips, broadcast realism). Omni is the multimodal conversational director (10-second clips, world-model physics, cross-modal editing). Pick Veo for final cinematic output; pick Omni for conversational iteration across mixed inputs.
What is the $100 Antigravity credit and when does it expire?
New Google AI Ultra subscribers receive a $100 bonus credit applicable to Antigravity usage. The window closes Sunday May 25, 2026. Subscribing to the $100 AI Ultra plan within the window effectively makes month one free. Subscribing to the $200 AI Ultra plan effectively makes month one cost $100. The credit only applies to Antigravity workloads, so it cashes out fully only for users who actually run agents daily.
When does Gemini CLI stop working?
June 18, 2026. Google is sunsetting both Gemini CLI and Gemini Code Assist IDE extensions for AI Pro, AI Ultra, and free-tier users on that date. Enterprise customers on Code Assist Standard or Enterprise licenses retain access. The replacement is Antigravity CLI, which preserves Skills, Hooks, Subagents, and Extensions (now called Antigravity plugins).
What is Gemini Spark and when can I access it?
Gemini Spark is Google's new 24/7 personal AI agent that runs on Google Cloud Run virtual machines, meaning it operates in the background even when your phone or laptop is turned off. It is built on Gemini 3.5 Flash and the Antigravity harness. Beta access launches the week of May 26, 2026 to AI Ultra subscribers in the United States. Roadmap includes direct text or email interaction, custom sub-agents, and payment authorization with budget and merchant constraints.
Did Google change AI Pro plan limits?
Yes, and quietly. AI Pro switched from fixed daily prompt caps to a compute-based credit system with 5-hour refresh windows and a weekly cap. Google removed the 1,000 monthly AI credits previously included with the plan. Storage on the top AI Ultra tier was reduced from 30TB to 20TB. Cloud storage on AI Pro increased from 2TB to 5TB as a partial offset. The changes triggered backlash on Reddit and X, with users reporting single complex prompts consuming a large share of weekly quota.
What broke at the Antigravity 2.0 launch?
Several things. The Windows installer dumped app.asar into the shared application folder, hijacking the launch script for thousands of existing Antigravity IDE users and showing a blank Agent Manager view. Google shipped a v2.1.4 Logic Patch within days after the agent began reverting human code edits classified as "inefficiencies." The new thinking_level enum silently dropped the default from high to medium, creating production reasoning quality risk. Conversation history loss persisted across the 1.x to 2.0 update. Linux coredumps require setting the chrome-sandbox setuid bit.
What is WebMCP?
WebMCP is a proposed open web standard for browser-based AI agents, announced at Google I/O 2026 alongside Modern Web Guidance and the Android CLI 1.0. It has been in Early Preview Program since February 10, 2026. WebMCP recasts the developer stack around AI agents that can read, write, and ship code through the browser, using a standard protocol any agent vendor can implement. If it succeeds the way MCP succeeded for tools, browser-based agent interoperability will be the default by I/O 2027.
Which models are deprecating?
Gemini 2.0 Flash and Gemini 2.0 Flash-Lite are deprecating on June 1, 2026. Gemini CLI and Gemini Code Assist IDE extensions stop serving consumer requests on June 18, 2026. The Gemini 3.x family (3 Flash Preview, 3.1 Pro, 3.1 Flash-Lite, 3.5 Flash) all remain active, with 3.5 Pro rolling out publicly in late June 2026.
How fast can vibe coders build with the new stack?
Google's I/O 2026 stage demo showed Antigravity 2.0 building the core framework of a working operating system in approximately 12 hours, spawning 93 separate subagents, processing billions of tokens, and completing the project for under $1,000 in compute costs. The system ran Doom on the AI-created OS after generating missing keyboard drivers in real time during the demo. For typical greenfield apps, AI Studio plus Firebase plus one-click Cloud Run deployment puts a multiplayer-capable production app in roughly an afternoon.
Which enterprises are already running on Gemini 3.5 Flash?
Google named Shopify (parallel subagents for merchant growth forecasts), Macquarie Bank (customer onboarding with 100-page document reasoning), Salesforce (Agentforce integration with multi-turn tool calling), and Ramp (smarter invoice OCR) as launch partners running production workloads on 3.5 Flash.
What is the Android Studio Migration Agent?
The Migration Agent is a new capability in Android Studio that takes a React Native codebase, a web framework project, or an iOS app and converts it to native Kotlin with Jetpack Compose UI components. The I/O 2026 demo prompt explicitly preserved business logic in data and domain layers, targeted API 35, and maintained existing Firebase integrations. Work that previously justified a three-week sprint with a consultant now completes in hours.
What are Managed Agents in the Gemini API?
Managed Agents is a new capability where a single Gemini API call provisions an isolated Linux environment containing an agent able to reason, plan, call tools, execute code, manage files, and browse the web. It is powered by the new Antigravity agent built with Gemini 3.5 Flash. Environments persist across multi-turn sessions — agents can resume work with files and state intact. You define agents in markdown via AGENTS.md and SKILL.md. Available via the Interactions API and Google AI Studio.
Can I use Gemma 4 commercially?
Yes. Gemma 4 is released under the Apache 2.0 license — commercially permissive with no usage caps and no API spend required. Sizes include E2B (effective 2B, ~1.5GB RAM at 4-bit), E4B (effective 4B, ~5GB RAM), 26B Mixture of Experts (8 active experts plus 1 shared per token), and 31B Dense. Supports 140+ languages natively. Runs on Hugging Face Transformers, vLLM, llama.cpp, MLX, LM Studio, Ollama, NVIDIA NIM, and more. Gemini Nano 4 is built on this architecture and code written for Gemma 4 today will work on production Nano 4 hardware.
How does the new compute-based quota work on AI Pro?
AI Pro replaced fixed daily prompt caps with a compute-based credit system. Credit drains faster with complex prompts, multimodal inputs, long context, tool calls, and high thinking levels. Credits refresh every 5 hours, with a hard weekly ceiling. When you hit cap on the biggest models, the system silently shifts you to smaller models — no service interruption, but possible quality degradation. Pay-as-you-go top-up credits are available for Antigravity and Flow now.
Does Gemini Omni have copyright restrictions?
Every Omni output carries SynthID (invisible watermark, switched on by default, cannot be disabled) and C2PA Content Credentials. Audio editing of existing uploaded video is deliberately held back over deepfake risk — you can generate new video with new audio and edit visuals on existing video, but you cannot replace or modify audio on a video you upload. The hold responds to SAG-AFTRA, Motion Picture Association, and Disney pressure following last year's nudification-tool scandals.
What is the best path for a DDS-style Shopify business to adopt this stack?
Three immediate wins: (1) Use Managed Agents API for product description automation — at $1.50 input / $9 output per million tokens, each description costs about $0.022, two orders of magnitude cheaper than manual writing. (2) Use Omni Flash for branded product video — included in AI Pro at $20/mo, marginal per-video cost near zero. (3) Migrate Sovereign Orchestrator Gemini-side agents to gemini-3.5-flash selectively — premium reasoning agents benefit from the 5.9-point Terminal-Bench improvement; high-volume routine agents stay on 3 Flash Preview at the lower price. See the DDS Real-World Applications section above for full architectures.
Continue Your Training
The three Academy masterclasses most directly connected to this material. Read them in order if you're new to the agent stack.
