Cloud Next 2026
Google Unleashes Gemini 3 Flash, 8th-Gen TPUs, and a $750M Agent Platform Bet at Cloud Next
Sundar Pichai brands the moment an “agent-first enterprise” pivot — Vertex AI becomes the Gemini Enterprise Agent Platform, scaling to 200+ models including Claude.
At the Cloud Next 2026 conference in Las Vegas, Google rebranded Vertex AI as the Gemini Enterprise Agent Platform — a full-stack agentic control plane featuring an Agent Designer, persistent memory, A2A protocol support, and access to more than 200 models, including Anthropic’s Claude. The company framed the launch as a pivot from a model vendor into an “agent-first enterprise” provider, with Sundar Pichai telling attendees the shift defines how Google Cloud will sell to the Fortune 500 for the rest of the decade.
Alongside the platform, Google unveiled its 8th-generation TPU (dubbed TPU 8t), scaling to 9,600 chips and 2 petabytes of shared HBM per pod, along with a $750 million partner fund to accelerate agentic-AI deployments across system integrators, ISVs, and enterprise customers. The combined silicon-plus-capital push mirrors the hyperscaler-scale playbook AWS and Microsoft have run for years, but tilts decisively toward the agent-orchestration layer rather than raw compute rental.
Google also expanded the Gemini 3 family with Gemini 3 Flash, which posts 90.4% on GPQA Diamond and 78% on SWE-bench Verified — outperforming Gemini 3 Pro on coding — while Gemini 3 Pro tops LMArena at 1501 Elo. Flash is live across Gemini CLI, Android Studio, AI Studio, and Vertex AI, and is priced for high-volume agentic workloads where latency and token cost dominate the procurement conversation.
The combined announcement is the most aggressive product drop from Google in an AI event cycle so far in 2026, and establishes a clear three-way arms race with OpenAI and Anthropic over enterprise agent distribution. Bloomberg’s coverage framed the Cloud Next keynote as Google’s most explicit declaration yet that it intends to compete on the agent layer rather than cede that ground to startups building atop its models.