Gemma 4 is Google DeepMind's latest family of open-weight AI models, released April 2, 2026. It comes in four sizes (31B Dense, 26B MoE, E4B, E2B), processes text, images, video, and audio, and ships under the Apache 2.0 license for unrestricted commercial use. The models are built from the same research behind Gemini 3.

How to access Gemma 4?

Chat with Gemma 4 for free on Overchat AI — select it from the model picker and start typing. No Google Cloud account or API key required. For local deployment, download the weights from Hugging Face or Kaggle under the Apache 2.0 license.

Type or paste your question into the chat. You can also attach images for visual analysis. Gemma 4 excels at reasoning, math, coding, and multilingual tasks. For developers, it supports native function-calling and structured JSON output for building agent workflows.

Google DeepMind, the AI research division of Alphabet. DeepMind is responsible for Gemini, AlphaFold, and AlphaGo, among other breakthroughs. Gemma 4 is their most capable open-weight release to date, designed to bring Gemini-class performance to the open-source ecosystem.

Gemma 4 — Chat with Google's Most Powerful Open Source Model

What is Gemma 4?

Gemma 4 is Google DeepMind's best open-weight AI model, with a release date of April 2, 2026. It's mainly designed for local AI enthusiasts and developers. This model is powerful, lightweight, and runs on any consumer hardware — even mobile devices. You can chat with Gemma 4 on Overchat AI online — or install it locally and run Gemma 4 fully offline with Atomic Chat, our sister offline chat application.

How to Use Gemma 4

Ask Gemma 4 anything

Scroll up to the chat widget and type your question — Gemma 4 will respond instantly.

Get your answer

The Gemma 4 response will stream directly into the chat application above. It's a great way to chat with Gemma 4 online and weigh how well it performs before committing to a local installation.

Ask more questions

Continue chatting with Gemma 4. This AI model has a context window of up to 256K, so you can have extremely long conversations without the model forgetting important details.

Get Started

What is Gemma 4?

Gemma 4 is a family of four open-source models released by Google DeepMind on April 2, 2026. The smallest model has two billion parameters and is optimized for mobile devices, while the largest model has 31 billion parameters and is designed to compete with the best AI models currently available.

There are four Gemma 4:
‍
- 31B Dense: — Maximum quality
- 26B MoE (Mixture of Experts): A more efficient enterprise option
- E4B: The most balanced version optmized for offline use
- E2B: ultra-low-latency edge model for mobile devices

New for Gemma models, All four have built-in support for images and video-understanding, and the two edge models also understand audio input for real-time speech processing.

Gemma 4 Agentic Layer
All Gemma 4 variants come with native function-calling support and structured JSON output. This means they can interact with external tools, plan ahead for task execution, and act on their plans, similar to OpenClaw. For developers, the Gemma 4 family is perfect as a base for autonomous agents that call APIs, chain tools, and execute multi-step workflows.

Gemma 4 Capabilities

The most notable feature of Gemma 4 is its ability to deliver frontier-level reasoning at a fraction of the usual model size. For example, the 31B Dense variant is ranked third on the Arena AI text leaderboard and outperforms models with 20 times more parameters on math and instruction-following benchmarks. In practice, Gemma 4 produces results similar to those of much more expensive AI models — or, in the case of local AI, models that potentially won't even fit into the memory of a typical consumer machine.

Another notable feature of Gemma 4 is its native multimodality, which is standard across the entire family. This improves accuracy, speed, and efficiency when processing audio-visual content, such as images, videos, and voice recordings.

Gemma 4 is designed to be the foundation for AI agents. It can be used with other tools and systems because it has native function-calling and structured JSON output. With the Apache 2.0 license, you own whatever you build on top of it.

Gemma 4 is available on Overchat AI alongside Claude Mythos and DeepSeek V4. You can also dowload and run Gemma 4 locally through Atomic Chat — the best local AI app to run open-source models offline.

Gemma 4 Benchmarks

As of the time of writing, Gemma 4 31B Dense ranks #3 on the Arena AI text leaderboard, while Gemma 4 26B MoE is #6. This is a remarkable performance for a model that activates fewer than 4 billion parameters per inference. The fewer parameters activated, the more cost-efficient, fast, and less taxing on the hardware the model is.

Introducing Gemma 4

What is Gemma 4?

Open-source AI that can chat with you online or run fully offline

Online Gemma 4 chat

Run Gemma 4 anywhere

Choose your Gemma 4