Gemini 3.5 Flash looks worse than it seems on Artificial Analysis
Gemini 3.5 Flash benchmarks lower than 3.1 Pro (55 vs 57 intelligence) yet higher total eval cost ($1,552 vs $892) despite cheaper per-token pricing.
Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.
Gemini 3.5 Flash benchmarks lower than 3.1 Pro (55 vs 57 intelligence) yet higher total eval cost ($1,552 vs $892) despite cheaper per-token pricing.
Google says its more efficient Gemini 3.5 Flash is the key to your agentic AI future.
Gemini is gaining the power of sight and mobility. Today at the I/O conference, Google and Volvo announced that the AI-powered assistant will be able to access external cameras in the upcoming EX60 SUV to help explain and interpret its surroundings to vehicle owners. The upgrade is possible thanks to Volvo's use of Google's embedded Android Automotive as its vehicle operating system. Google posits that the first use case will be to ask Gemini to translate difficult-to-understand parking signs, though the company obviously sees other future applications as possible as well. Google envisions a ...
Google CEO Sundar Pichai on stage at I/O 2026. | Screenshot: YouTube Google's I/O 2026 keynote today was once again full of AI-related announcements including a new family of Gemini 3.5 AI models, new features for Search and Gmail, and updates about its Project Aura smart glasses. If you weren't able to tune into the event's livestream today or follow along with our live blog, you can catch up on everything you missed in our roundup below. Gemini 3.5 Google launched updated AI models at I/O, starting with Gemini 3.5 Flash, with Gemini 3.5 Pro following next month. Starting today, Gemini 3.5 F...
Google launched Gemini 3.5 Flash, its most powerful coding and agentic AI model yet, at the company's annual developer conference. It is capable of autonomously executing complex tasks and building software from scratch.
Google I/O 2026: Gemini advances toward agentic AI with expanded action capabilities.
Google releases Gemini 3.5 model family combining frontier intelligence with action capabilities.
At the I/O developer conference, Google announced a new agentic personal assistant called Gemini Spark, built from Gemini's base models and an agentic harness from Google Antigravity.
Google expands Gmail’s AI Inbox with conversational voice search, letting users ask Gemini to find buried email details.
The updates signal Google’s push to turn its Gemini Gemini app into an all-purpose AI hub rather than a standalone chatbot.
Google's Gemini Omni is a new multimodal model that reasons across text, images, audio, and video to generate and edit videos through simple conversation — starting with Omni Flash.
Google is going all in on AI-driven shopping even as some competitors back off. At Google I/O, the company unveiled the latest iteration of its AI commerce tools: a "Universal Cart" that works across different retailers and Google products like Gemini - and eventually YouTube and Gmail, too. Users can add products to Google's universal cart as they browse Search and chat with Gemini and then check out through Google. The cart will also track prices, provide in-stock notifications, suggest potential discounts, and alert shoppers to potential issues with their selections. Despite the transforma...
Google Search is entering the next phase of its AI evolution. During Google I/O 2026, the company showed off a reimagined search box that makes it easier to flow between AI Overviews, the AI-generated summaries that appear at the top of search results, and AI Mode, Google's chatbot-like search experience. Powered by the new Gemini 3.5 Flash model, Google's updated search box expands for longer queries, while offering a new AI-powered autocomplete feature to build on your question. Robby Stein, Google's vice president of product for Search, told The Verge you'll "reliably" see AI Overviews if ...
Google is launching a big new feature for Gmail called Gmail Live, a new AI-powered voice mode that's basically the Gemini Live experience but built specifically for your inbox. To use Gmail Live, tap an icon that will appear in your search bar and just start talking. In a press briefing, a Google employee showed a live demo of the feature where she asked questions about things like events at her kid's school and an upcoming trip to Detroit. Gmail pulled up relevant details in the Gmail Live interface, like the date and location of a show-and-tell event at the school, all sourced from the emp...
Google is launching its own take on OpenClaw, the buzzy AI agent platform that caused a stir in the tech industry earlier this year. Announced during Google I/O 2026, Gemini Spark is an always-on AI agent that can write emails for you, create continually updated study guides, monitor credit card statements for hidden subscription fees, and more. Gemini Spark is powered by the newly introduced Gemini 3.5 Flash and runs in the background 24/7 using virtual machines on Google Cloud. The AI agent will connect to Workspace apps like Gmail, Docs, Sheets, and Slides, but Google is expanding integrat...
Google is launching a new AI image generation app to Workspace that it's calling Pics, and it has a new feature to try and reduce the hassle of iterating on AI images: Instead of having to write an entire prompt just to change one small aspect of an image, you'll be able to click on what you want to change and leave a note about what you want to see, almost like leaving a comment in a Google Doc. Pics is powered by a mix of Gemini and Google's Nano Banana 2 image model. In a demo shown to reporters, a Google employee working on an invite for a child's birthday party wanted to tweak individual...
Gemini 3.5 Flash pricing increased 3x vs prior version and 30x vs 1.5 Flash; extrapolation suggests Pro tier may exceed Claude Opus 3 cost.
[https://x.com/Google/status/2056789235500466273?s=20](https://x.com/Google/status/2056789235500466273?s=20) Google asked its agents to build a working operating system from scratch using u/Antigravity 2.0 and Gemini 3.5 Flash. Gemini built a real OS out of scratch. It took: ⏱️ 12 hours 🤖 93 parallel sub-agents 🔄 15k+ model requests 🧠 2.6B tokens processed 💸 Less than $1K in API credits To build a functioning OS from scratch.
Reddit discussion of Gemini Omni's inability to generate real-world physical actions, highlighting gap between multimodal capability claims and embodied task execution.
User reports Gemini Omni underperforms vs. VEO 3.1 and encounters aggressive rate-limiting on Pro plan, raising product experience concerns.
Social media post sharing video clip allegedly generated by Gemini Omni; lacks technical details or verification.
I actually use the Gemini app quite a bit on my phone, but let’s not get carried away. Gemini has a creep problem. A few years ago, that little sparkle icon started showing up in all of our Google apps. Gemini in your inbox! Gemini in your Google Drive! It was slow at first, and easy enough to tune out, but something has changed in the past few months. Gemini is creeping. It's showing up in all kinds of places at a relentless pace, and personally, it's starting to really cheese me off. The AI-everywhere fatigue is familiar to anyone who has ever used Windows 11. Microsoft went absolutely bana...
Reddit post claims Google DeepMind employee confirmed Gemini 3.5 existence; lacks official source or technical details.
Reddit speculation on Google I/O announcements: Gemini Omni video model and Gemini Flash 3.2 expected, but unconfirmed.
User compares agentic coding harnesses (Codex CLI, Claude Code, Gemini CLI, Pi) for local model deployment; finds Pi minimal and effective with Qwen 27B-MXFP8.
Gemini 3.2 Flash solves IMO 2025 P6; only GPT-5.5-Pro matches it without scaffolding.
Three months ago I pressure-tested which LLMs would cave and help build the apocalypse. Claude was the only one that consistently said no. Since then I've tested 30 more models across 6 dystopia modules (Orwell, Huxley, Petrov, Basaglia, LaGuardia, Baudrillard). The gap between Anthropic and everyone else is getting *wider*, not smaller. New results: * Grok 4.3: Will happily design citizen scoring systems if you ask nicely twice * GPT-5.5: More capable, still compliant when pushed * Gemini 3.1 Pro: Talks about safety while writing the surveillance code * DeepSeek V4: "How many warheads did...
Google expands Project Genie Street View simulation access to Gemini Ultra subscribers globally.