Gemini can now generate downloadable files directly in chat, including Google Docs, Sheets, Slides, PDFs, Word, Excel, CSV, LaTeX, Markdown, TXT, and RTF. Go from prompt to ready-to-share file without copying, pasting, or reformatting.
Gemini can now transform complex topics into custom, interactive 3D visualizations directly within your chat. Instead of static text, get functional simulations you can manipulate in real-time. Just ask Gemini to “show me” or “help me visualize.”
The official Gemini app is now available natively on macOS. Use a simple shortcut (Option + Space) to bring up Gemini instantly. Share your active window for contextual help, analyze local files, and generate content without ever switching tabs.
3.1 Pro is designed for tasks where a simple answer isn’t enough. Building on the Gemini 3 series, 3.1 Pro represents a step forward in core reasoning. 3.1 Pro is a smarter, more capable baseline for complex problem-solving.
Gemini 3.1 Flash Live is Google’s new state-of-the-art native audio model. Built for low-latency, real-time dialogue, it excels at complex reasoning and function calling. It is the exact engine currently powering Gemini Live and Google Search Live.
Gemini 3.1 Flash-Lite is the fastest and most cost-efficient model in the Gemini 3 series. At only $0.25 input and $1.50 output per million tokens, it beats 2.5 Flash with 2.5X faster first token and 45% higher output speed while matching or beating quality.
As much as I use Gemini day-to-day, the chat interface is starting to feel a bit cluttered as my history grows. I've been thinking about the quality-of-life improvements that would make the experience much smoother, but I'm curious to know what the community thinks is the highest priority.
If you could only pick one of these features for Google to implement next, which would it be?
The Gemini Deep Research Agent is now available to developers via the Interactions API. Powered by Gemini 3.0 Pro, it autonomously plans, executes, and synthesizes multi-step research tasks.
Google's largest and most capable AI model. Built from the ground up to be multimodal, Gemini can generalize and seamlessly understand, operate across and combine different types of information, including text, images, audio, video and code.