AlphaEvolve: How our Gemini-powered coding agent is scaling impact across fields
Google DeepMind releases AlphaEvolve, a Gemini-powered coding agent demonstrating applications across business, infrastructure, and scientific domains.
Every story matching this topic across titles and summaries, newest first.
Google DeepMind releases AlphaEvolve, a Gemini-powered coding agent demonstrating applications across business, infrastructure, and scientific domains.
Google Chrome may be taking up more of your storage than expected thanks to a large on-device AI model file that, in some cases, is being automatically downloaded to the browser's system folders. Users who have noticed unexplained drops in their available desktop device storage are now discovering that Chrome is installing a 4GB weights.bin file inside their browser directory when certain AI features are enabled. The weights.bin file in question is connected to Google's Gemini Nano AI model, which powers Chrome AI tools like scam detection, writing assistance, autofill, and suggestion feature...
Google Home users can now ask Gemini to complete more complex, multi-step tasks and combine multiple tasks in a single command. Google has updated Gemini for Home to Gemini 3.1, which it says will improve the smart home assistant's ability to interpret and act on requests. The upgrade will also make Gemini for Home better at handling recurring and all-day events and allow users to "move around" upcoming events. Last month, Google also updated Gemini for Home with improvements for understanding natural language and identifying devices correctly. The upgrades follow reports of bugs in Google's ...
Fairness audit of five LLMs (Gemini, GPT-4, DeepSeek, Mistral, Nemotron) on emergency triage reveals gender bias persistence in clinical decision support.
Google's smart home ecosystem is getting its biggest update since the AI-fueled 2025 revamp.
Google Gemini API adds webhook support for asynchronous job notifications, reducing polling overhead for long-running requests.
Reddit user discusses token limits on Claude Opus 4.7 and compares subscription costs across Claude, ChatGPT, and Gemini.
Leaked Google I/O details suggest Gemini 'Omni' and versions 3.2/3.5 in development, indicating multimodal expansion.
**New arxiv paper just landed that's worth reading if you're interested in stylometry, AI revision, or the prose-writing strand of the 4.7 discussion.** Berkeley researcher Tom van Nuenen ran 300 personal narratives through three frontier models (Claude-class, ChatGPT-class, Gemini-class) under three prompt conditions: generic "improve this," generic "rewrite this," and explicitly "revise this while preserving the original voice." He measured 13 stylometric markers in input and output: function words, contractions, first-person pronouns, vocabulary diversity, sentence length variance, punctu...
Developer demo of generative game engine using Gemini 3 for spell generation with 6-player multiplayer physics simulation.
Reddit speculation on Google's competitive response to Anthropic following reported $40B investment, lacks substantive claims.
Here’s an early look at the new Gemini assistant on a vehicle infotainment system. | Image: Google Google is preparing to update vehicles that have Google built-in with its Gemini AI assistant. This will be an upgrade from the current Google Assistant according to Google's announcement, and promises to provide an improved experience for natural conversations, fetching vehicle-specific information, settings adjustments, and more. "When cars with Google built-in first hit the road in 2020, we made a commitment that your car will get better over time," Google senior product manager Alankar Agnih...
Google Search queries hit an "all time high" in the first quarter of 2026, according to a statement from CEO Sundar Pichai published as part of Alphabet's earnings on Wednesday. "Our AI investments and full stack approach are lighting up every part of the business," Pichai says. "Search had a strong quarter with AI experiences driving usage, queries at an all time high, and 19% revenue growth." He also notes that Q1 was "our strongest quarter ever for our consumer AI plans, driven by the Gemini App" and that the company now has more than 350 million paid subscriptions, with "YouTube and Googl...
Over the past several weeks, I've been working on HyperResearch, a Claude Code skill harness that converts CC into the most intelligent deep research framework out there. HyperResearch surpasses OpenAI, Google, and NVIDIA's offerings in the agentic search space based on DeepResearch Bench. It's open-source, installable with a single command, and uses your CC subscription, so you don't have to pay for OpenAI or Gemini Pro. It uses a 16-step pipeline that creates a searchable, persistent knowledge store during each session that can be built upon in later searches. I designed it to align with ...
Reddit discussion speculating GPT-5.x and Opus 4.7 show quality degradation; estimates Gemini 2.5 Pro at ~500B parameters with search-driven performance.
HealthNLP_Retrievers system using Gemini 2.5 Pro in cascaded pipeline for grounded clinical QA over electronic health records.
Google TV just got more Gemini features, including the ability to transform photos and videos with tools Nano Banana and Veo.
Gemini is coming to Cadillac, Chevrolet, Buick, and GMC vehicles. | Image: General Motors General Motors is planning to bring Google's Gemini AI assistant to around four million vehicles across the US. Model year 2022 and newer Cadillac, Chevrolet, Buick, and GMC vehicles with Google built-in will be eligible for the AI upgrade, which will be rolled out via over-the-air software updates for GM's infotainment system "over several months," according to GM's announcement. GM says this update represents "one of the largest deployments of Gemini in the industry," and that "customers will notice an...
A recent paper published in *JMIR Mental Health* (Csigó & Cserey, 2026) caught my attention. The researchers administered the 10 standard Rorschach inkblot cards to three multimodal LLMs (GPT-4o, Grok 3, Gemini 2.0) and coded their responses using the Exner Comprehensive System. They analyzed the models' "perceptual styles," determinants (like human movement vs. color), and human-related content themes. However, I am seriously struggling to understand the methodological validity of this setup, and I’m curious what the scientific community thinks. My main concerns are: Massive Data Cont...
Google signed classified AI agreement with Pentagon enabling military use of Gemini models despite internal employee opposition.
Gemini gets preferential treatment on Android, but maybe not for long (in Europe).
GPT-5.5 ranks 2nd on Extended NYT Connections Benchmark; Kimi K2.6 leads open-weights; DeepSeek V4 Pro and Qwen 3.6 Max show significant gains.
AstroVLBench evaluates six frontier VLMs on 4,100+ astronomy tasks; Gemini 3 Pro best performer but modality-dependent gaps remain.
Claude 4.7 identified journalist Kelsey Piper from 125 words of unpublished writing across multiple genres; ChatGPT and Gemini failed same test, raising privacy/fingerprinting concerns.
Reddit user's subjective ranking of GPT-5.5, Claude Opus 4.6, Gemini 3.1 Pro; commentary on frontier model performance and business incentives.
Reddit speculation about Google Gemini/Nano model release timing ahead of I/O conference.
DeepSeek V4 Flash matches Gemini 3 Flash performance at 20% of the cost, pressuring pricing economics.
DeepSeek V4 Pro costs 15x more to run than V3.2 on Artificial Analysis benchmarks, exceeding Gemini 3.1 Pro pricing.
Consumer tips for using Gemini AI to organize homes and digital spaces via cleaning schedules and inbox management.
I just played a game with my kids telling it to generate random images from gibbrish, and one of them was this, never mentioned Gemini or Nano Banana
Small business owner reports using Claude for custom app development after Gemini Pro struggled with security requirements.
Qwen 3.6 27B matches Claude Sonnet 4.6 on agentic benchmarks, surpassing GPT 5.2/5.3 and Gemini 3.1 Pro.
Google brings Gemini-powered 'auto browse' capabilities to Chrome for enterprise users, letting workers automate tasks like research, data entry, and more.
Gemini Enterprise Agent Platform takes an interesting approach: it is geared for IT and technical users.
Google's AI meeting notetaker is no longer limited to Google Meets - Gemini can also generate summaries and transcripts of in-person meetings now, as well as meetings on Zoom and Microsoft Teams, as first reported by 9to5Google. Support for in-person meetings was previously limited to alpha users and only available on Android. Google's support page for the feature notes that, "If a user who is not in person wants to join the meeting, you can transition the meeting to a normal video call." The feature also works for impromptu meetings - Google says you "don't need to be in a meeting room" or i...
LLM evaluation on feature model analysis using semi-formal blueprints shows reasoning-optimized models (Grok 4, Gemini 2.5 Pro) achieve 88-89% accuracy vs solver oracles.
I’ve been building plugins for Claude Code, and the first version of the idea was very Claude-focused. That made sense at the start. Claude Code has a real plugin model, hooks are useful, and it is one of the few agent tools where plugins can actually become part of a daily workflow. But after building a few integrations, I kept running into the same uncomfortable question: If I write the useful part of a plugin once, why should I rewrite or repackage the same thing again for Codex, Gemini, Cursor, OpenCode, and whatever comes next? The actual plugin logic is often not Claude-specific. Th...
SafetyALFRED benchmark evaluates multimodal LLMs (Qwen, Gemma, Gemini) on hazard recognition and mitigation in embodied kitchen environments.
Comparative study of output consistency across GPT-4.1, Claude Sonnet 4.6, Gemini 2.5 Flash for exercise prescriptions with safety analysis.
Google is rolling out Gemini in Chrome in Australia, Indonesia, Japan, the Philippines, Singapore, South Korea, and Vietnam.
Cross-linguistic study of politeness effects on 5 LLMs (Gemini-Pro, GPT-4o Mini, Claude 3 Sonnet, DeepSeek-Chat, Llama 3) via 22,500 English/Hindi/Spanish prompts.
Dual-aspect evaluation framework for 4 LLMs (GPT-4o, Claude 3 Opus, Gemini 1.5 Pro, Grok-1) on Vietnamese legal text simplification: accuracy, readability, consistency.
Google travel planning tools using Gemini; consumer product announcement without technical detail.
Google is making it easier to feed your photos into Nano Banana for more personal image generation.
Gemini app adds personalized image generation via Google Photos context; consumer feature launch.
There’s a fault line running through enterprise AI, and it’s not the one getting the most attention. The public conversation still tracks foundation models and benchmarks—GPT versus Gemini, reasoning scores, and marginal capability gains. But in practice, the more durable advantage is structural: who owns the operating layer where intelligence is applied, governed, and improved.…
Google released Gemini 3.1 Flash TTS, a prompt-directed text-to-speech model via Gemini API with novel audio profile prompting.
Tool reference for Google's Gemini 3.1 Flash TTS text-to-speech model accessible via standard Gemini API.
Google DeepMind releases Gemini 3.1 Flash TTS with granular audio tags for precise control over expressive speech generation.
Gemini 3.1 Flash TTS launched with expressive speech synthesis across Google products.
You can save custom prompts you find useful or grab a premade Skill from Google's library.
Google DeepMind launches Gemini Robotics-ER 1.6 with enhanced spatial reasoning and multi-view understanding for autonomous tasks.
Gemini API introduces Flex and Priority inference tiers for cost/latency tradeoffs.
Google releases Veo 3.1 Lite, a cost-optimized video generation model, via Gemini API and AI Studio.
Google DeepMind ships Gemini 3.1 Flash Live with improved precision and lower latency for more natural voice interactions.
Gemini 3.1 Flash Live becomes available across Google products, improving audio AI naturalness and reliability.
Lyria 3 music generation model launches in paid preview via Gemini API and AI Studio.
Google expands Personal Intelligence feature across AI Mode in Search, Gemini app, and Chrome.
Google DeepMind releases Gemini 3.1 Flash-Lite, the fastest and most cost-efficient model in the Gemini 3 series.
Google DeepMind releases Gemini 3.1 Pro, an upgraded model for complex reasoning and multi-step problem solving.
Gemini app integrates Lyria 3 music generation model, enabling 30-second track creation from text or image prompts.
Gemini 3 Deep Think updated for specialized reasoning in science, research, and engineering problem-solving.
Research papers demonstrate Gemini Deep Think's impact on mathematical and scientific discovery across multiple disciplines.
Google DeepMind releases Gemini 3 Flash, a frontier-grade model optimized for speed and cost efficiency.
Incomplete announcement; no substantive details provided.
SIMA 2: Gemini-powered agent reasoning and acting in interactive 3D virtual environments.
Pilot with Northern Ireland Education Authority shows Gemini saved teachers ~10 hours/week.
Gemini 2.5 Flash-Lite reaches general availability with 1M context, multimodality, production-grade performance.
Gemini with Deep Think achieves gold-medal performance at International Mathematical Olympiad.
Gemini 2.5 Deep Think achieves gold-medal performance at ICPC World Finals, demonstrating major advance in abstract reasoning and competitive programming.
Gemini Robotics 1.5 enables physical AI agents to perceive, plan, and execute multi-step tasks with tool use in real environments.
Google DeepMind rolls out Deep Think reasoning capability in Gemini app for Ultra subscribers; mathematicians get IMO competition model access.
Gemini app gains enhanced native image editing capabilities with improved transformation and control features.
Google DeepMind unveils Gemini 2.5 Computer Use model in API preview; specialized for UI automation and agent-based workflows.
Google DeepMind releases on-device robotics model with general-purpose dexterity and rapid task adaptation for local deployment.
Google DeepMind releases Gemini 2.5 Pro stable, Flash GA, and Flash-Lite preview with enhanced performance.
Gemini 2.5 Flash and Pro now GA; Flash-Lite introduced as fastest, most cost-efficient variant.
Gemini 2.5 adds audio dialog and generation capabilities.
Google DeepMind improves Gemini 2.5 security safeguards, positioning it as most secure model family.
Gemini 2.5 Pro and Flash updates; Deep Think experimental reasoning mode added to Pro.
Google DeepMind releases AlphaEvolve, a Gemini-powered agent that automatically designs algorithms by combining LLM creativity with automated evaluation.
Google releases updated Gemini 2.5 Pro Preview with improved coding performance, accelerating developer access.
Gemini 2.5 Pro Preview update adds enhanced capabilities for building interactive web applications.
Google launches Gemini 2.5 Flash, first fully hybrid reasoning model enabling developers to toggle thinking on or off.
Google releases Veo 2 video generation in Gemini and Whisk, creating eight-second high-resolution videos from text or image prompts.
Google applies Gemini LLM to dolphin communication analysis through DolphinGemma, a domain-specialized model variant.
Google announces Gemini 2.5, flagship model with integrated reasoning/thinking capabilities, claiming state-of-the-art intelligence.
Google DeepMind releases Gemini Robotics and Gemini Robotics-ER models for robot perception and control in physical environments.
Gemini 2.0 Flash now supports native image generation, available in Google AI Studio and Gemini API for developers.
Gemini 2.0 Flash-Lite reaches general availability for production use on Gemini API, Google AI Studio, and Vertex AI.
Google announces Gemini 2.0 family: Flash, Flash-Lite (GA), and Pro Experimental, positioning for agentic AI workflows.
Google announces Gemini 2.0, a multimodal model designed for agentic AI with native capabilities for reasoning and tool use.