Topic

Gemini

Every story matching this topic across titles and summaries, newest first.

Open in archive search

TechCrunch AI· PRESS

How to turn off AI in your Google Docs

Here's what you need to do to get those pesky "write with Gemini" pop-ups to go away.

Amanda Silberling·4 days ago

TechCrunch AI· PRESS

Google bets on Gemini to reinvent the smart home speaker

Google is betting generative AI can breathe new life into the smart speaker. The company's new $99.99 Google Home Speaker replaces the rigid commands of the Google Assistant era with more conversational Gemini interactions.

Sarah Perez·4 days ago

Ars Technica AI· PRESS

The Gemini-powered Google Home Speaker arrives on June 25 for $100

Google's new smart speaker is more about Gemini than audio quality.

Ryan Whitwam ·4 days ago

TechCrunch AI· PRESS

Android 17 launches with new multitasking tools as Google expands Gemini features

Google has released Android 17 and Wear OS 7, introducing new multitasking features, parental controls, security tools, and smartwatch upgrades. The launch is also accompanied by a Pixel Drop that brings Google’s latest AI models to its devices.

Sarah Perez·5 days ago

TechCrunch AI· PRESS

ChatGPT’s market share slips below 50% for first time

The chatbot still remains the most popular AI assistant worldwide with over 1.1 billion monthly users, followed by Gemini with 662 million and Claude with 245 million.

Ivan Mehta·5 days ago

The Verge AI· PRESS

My yard is dying, so I made an app for that

When I returned to my computer five minutes after giving Gemini a lengthy prompt, I had two things: a functional app in a preview window, and a message about a bug. "~ Channel is unrecoverably broken and will be disposed!" Sounded bad! But right below it was a button to fix the bug. Pretty weird that I just instructed a computer to build a whole app for me with a single prompt, but it needed me to click a button to fix a bug. I did anyway, and in 233 seconds Gemini reported back that it had succeeded, using words like "blockages" and "race conditions." I didn't understand a bit of it. It was ...

Allison Johnson·8 days ago

Ars Technica AI· PRESS

Google sues Chinese cybercrime network that used Gemini to automate scams

The fraudsters allegedly targeted hundreds of thousands of people with Gemini-coded scams sites.

Ryan Whitwam ·9 days ago

Simon Willison· ANALYST

DiffusionGemma

DiffusionGemma Last May Google briefly released an experimental Gemini Diffusion model. I tried the preview at the time and recorded it running at 857 tokens/second. It was an exciting model, but Google made no further announcements about it. That research has returned in the best possible way: as a new open weight (Apache 2 licensed) Gemma model, google/diffusiongemma-26B-A4B-it . NVIDIA are currently hosting the model for free on their NIM cloud API. I used that API to generate this pelican , which took 4.4s (according to time uv run generate.py ) to return 2,409 tokens - so at least 500 to...

Simon Willison·11 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

MSUE: Multi-Modal Soccer Understanding Expert

This paper presents our solution to the 2026 SoccerNet VQA Challenge. We first develop a cost-effective data synthesis pipeline driven by a Vision-Language Model (VLM), which systematically restructures raw domain data into diverse VQA samples, including concise answers and long-form responses. Second, we propose MSUE, a multi-expert question answering architecture that employs a Large Language Model (LLM) to dynamically dispatch questions to text, image, and video experts. These experts are instantiated as a strong text baseline Gemini3-Flash, a fine-tuned Qwen3-VL, and an external knowledge...

Litao Li·11 days ago

Ars Technica AI· PRESS

Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation

Voice translations preserve speaker's tone, pacing, pitch—with SynthID watermarks for security.

Ryan Whitwam ·12 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

This study investigates cross-lingual distributional skew (the Shibboleth Effect) in frontier large language models (LLMs) subjected to sustained adversarial conditions. We develop a multi-agent geopolitical wargame, the Cerulean Sea Crisis, a synthetic maritime territorial dispute designed to mirror the structural dynamics of Eastern Mediterranean conflicts. Six frontier models (GPT-4o, Llama-4, Mistral-Large, Gemini-3.1-Pro, Qwen3.6-Plus, and DeepSeek-R1) participate in a between-groups experiment (N = 10 games per arm, K = 5 rounds per game) in which the sole manipulation is the language o...

Hakan Mehmetcik·12 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Who Brought Easter Eggs to Eid? Auditing Cultural Translation of Math Word Problems Across Diverse Languages and Regions

Large language models are increasingly used to adapt math word problems for personalized learning at scale, but it remains an open question whether those adaptations are consistent across models, preserve cultural diversity at scale, and reveal which cultural entities models treat as most salient. We analyze how Claude Opus 4, GPT-4.1, and Gemini 2.5 Pro adapt 60 English math word problems into Bengali, Hindi, Punjabi (India), Urdu, Sindhi (Pakistan), Italian, and Sicilian (Italy), a language set spanning the full resource spectrum, from high-resource Italian and Hindi to under-studied Sindhi...

Parisa Suchdev·12 days ago

Google DeepMind· FRONTIER

Fluid, natural voice translation with Gemini 3.5 Live Translate

Gemini 3.5 Live Translate brings near real-time, natural speech translation to Google AI Studio, Google Translate and Google Meet.

Google DeepMind·12 days ago

Simon Willison· ANALYST

Siri AI at WWDC 2026

Given how badly burned anyone who took Apple's 2024 WWDC Apple Intelligence announcements at face value was, I'm holding to a strict "I'll believe it when I see it" policy for everything they announced today . The new Siri AI features do at least look feasible with today's technology, especially since Apple are licensing a custom Gemini-derived model that they can run on their own Private Cloud Compute. It sounds like they'll be taking advantage of vision-LLMs to extract information from the user's screen, which neatly sidesteps the need for every existing application to ship custom code in o...

Simon Willison·13 days ago

Ars Technica AI· PRESS

Gemini 3.5 and Antigravity come to Google NotebookLM

NotebookLM is getting a big upgrade, but it's only for AI Ultra and enterprise accounts right now.

Ryan Whitwam ·13 days ago

The Verge AI· PRESS

NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources

Google is rolling out "across the board" updates to NotebookLM. The AI-powered note-taking app now uses Google's upgraded Gemini 3.5 model, which will allow it to respond with "more accurate and reliable information," according to a blog post on Monday. Launched in 2023, NotebookLM allows you to interact with your notes and sources using AI, as well as ask questions about the materials. With this update, Google says you can start a research project by just asking NotebookLM questions about a topic, instead of importing notes or YouTube videos. NotebookLM will use Google Search to help you fin...

Emma Roth·13 days ago

Google DeepMind· FRONTIER

Measuring the impact of learning with AI in Sierra Leone and beyond

Results from a randomized controlled trial show the potential of Gemini’s Guided Learning feature to boost engagement and accelerate learning.

Google DeepMind·13 days ago

The Verge AI· PRESS

As AI gets better, it reveals an empty promise

This week we've got tandem hands-ons with Google's new Gemini AI agent - Spark - from my colleagues David Pierce and Jay Peters. Their takeaways are similar: It's so effective that it's scary. Spark knew that David's dog is named Frida and knew the first name of Jay's wife, even though neither of them explicitly provided this information to Google. But what's scary to me is how all of this stuff seems geared toward a future of "productivity" that completely misses what needs to be fixed in our world. "Productivity" is often pitched as a panacea for what befalls us in our personal lives, even ...

TC. Sottek·18 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency

We investigate whether large language models produce different medical triage recommendations for identical neurological symptoms when only the patient's stated gender and age vary. Using three model families--Gemini 3.5 Flash, Claude Sonnet 4.6, and GPT-5.4-mini--we present a standardized symptom profile (persistent headache, blurred vision, morning nausea, visual disturbances) across seven demographic conditions: three age groups (25, 38, 65) x two genders (male, female), plus a gender-unspecified baseline (n = 30 per condition per model, 630 total trials). We find a stark, systemic gender-...

Qi Han Wong·19 days ago

The Verge AI· PRESS

Gemini Spark is the most impressive and terrifying AI experience I’ve had yet

Spark is Google’s new agentic answer for everything. According to every product demo from the last four years, planning a trip is a killer use case for AI. Just tell it where you're going, they all promise, and your chatbot / agent / other buzzword will exhaustively search travel options, read up on all the fun things to do, check all the local hotspots, and offer you a fully fledged itinerary. So far, I've found this to work only in the most generic ways: If you want to do the six most obvious things in any city on planet Earth, AI has you covered, but that's about as far as it goes. I had a...

David Pierce·19 days ago

The Verge AI· PRESS

Gemini’s new AI agent is about as good as Google’s demo

Google's new "24/7" AI agent, Gemini Spark, can be shockingly good at doing things on your behalf. But I'm not sure it's worth the financial cost and potential privacy tradeoffs. The company gave me access to Spark last week. Google advertises Spark as an AI agent that can take on tasks and work on them in the background - even tasks that have multiple steps - allowing you to put your phone down or walk away from your computer. It also advertises at the very top of the Spark website that it's "always under your direction," that "you choose to turn it on," and that "it's designed to check with...

Jay Peters·20 days ago

Google AI (Gemma)· FRONTIER

How we used Gemini to build Google I/O 2026

Learn how Googlers used AI to produce Google I/O 2026.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Marvin Chow"],"title":["VP"],"department":["Marketing"],"company":[""]}·20 days ago

TechCrunch AI· PRESS

I put Google’s 24/7 AI assistant Gemini Spark to work, and it’s actually pretty useful

Gemini Spark helps automate everyday tasks, from inbox summaries to local event planning, but it’s unclear why Google made it a separate product.

Sarah Perez·22 days ago

Google AI (Gemma)· FRONTIER

11 demos of Gemini Omni and Gemini 3.5 in action

Watch 11 videos showing the capabilities of Gemini Omni and Gemini 3.5, announced at Google I/O 2026.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Zahra Thompson"],"title":["Contributor"],"department":["The Keyword"],"company":[""]}·23 days ago

Ars Technica AI· PRESS

Apple working to cram massive Gemini model into iPhone to power new Siri

As Apple tries to shrink Gemini for the iPhone, a cloud component is probably inevitable.

Ryan Whitwam ·24 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Gram: Assessing sabotage propensities via automated alignment auditing

We introduce Gram, an automated alignment auditing framework to assess the propensity of AI agents to engage in sabotage. We evaluate Gemini models across 17 simulated agentic deployment scenarios that incentivize sabotage. We find Gemini models misbehave in about 2-3% of our simulated trajectories. Many of these cases are explained by "overeagerness" in Gemini models resulting in both excessive role-playing and goal-seeking behavior. In contrast to other alignment auditing approaches, Gram is designed to specifically evaluate misalignment and intentional sabotage in agentic coding and resear...

David Lindner·24 days ago

Google AI (Gemma)· FRONTIER

Catch up on 12 major I/O 2026 moments

Here are 12 of the biggest Google I/O 2026 keynote moments, including news about Gemini Omni, Gemini 3.5 Flash and more.

{"$":{"xmlns:author":"http://www.w3.org/2005/Atom"},"name":["Zahra Thompson"],"title":["Contributor"],"department":["The Keyword"],"company":[""]}·24 days ago

r/ClaudeAI· COMMUNITY

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Imagine a world run by AI agents. What does it look like? What are the values or societal priorities? Is it a safer or more dangerous world? Enterprise AI startup Emergence AI is trying to find out. The company just launched Emergence World, a research lab dedicated to stress-testing the long-term viability of continuously-running AI systems. The organization ran five 15-day simulations, each governed by a different AI: Claude, ChatGPT, Grok, Gemini, and a fifth simulation run by a mix of models to see what kind of world each one builds, and whether it holds. Each simulation netted wildly d...

u/fortune·24 days ago·332 pts / 44 comm

Gemini

How to turn off AI in your Google Docs

Google bets on Gemini to reinvent the smart home speaker

The Gemini-powered Google Home Speaker arrives on June 25 for $100

Android 17 launches with new multitasking tools as Google expands Gemini features

ChatGPT’s market share slips below 50% for first time

My yard is dying, so I made an app for that

Google sues Chinese cybercrime network that used Gemini to automate scams

DiffusionGemma

MSUE: Multi-Modal Soccer Understanding Expert

Google announces Gemini 3.5 Live Translate for instant voice-to-voice translation

The Shibboleth Effect: Auditing the Cross-Lingual Distributional Skew of Large Language Models

Who Brought Easter Eggs to Eid? Auditing Cultural Translation of Math Word Problems Across Diverse Languages and Regions

Fluid, natural voice translation with Gemini 3.5 Live Translate

Siri AI at WWDC 2026

Gemini 3.5 and Antigravity come to Google NotebookLM

NotebookLM&#8217;s Gemini 3.5 upgrade adds a cloud computer and help finding sources

Measuring the impact of learning with AI in Sierra Leone and beyond

As AI gets better, it reveals an empty promise

Gender-Dependent Diagnostic Substitution in LLM Medical Triage: Same Symptoms, Unequal Urgency

Gemini Spark is the most impressive and terrifying AI experience I’ve had yet

Gemini’s new AI agent is about as good as Google’s demo

How we used Gemini to build Google I/O 2026

I put Google’s 24/7 AI assistant Gemini Spark to work, and it’s actually pretty useful

11 demos of Gemini Omni and Gemini 3.5 in action

Apple working to cram massive Gemini model into iPhone to power new Siri

Gram: Assessing sabotage propensities via automated alignment auditing

Catch up on 12 major I/O 2026 moments

Researchers let AI models run a simulated society. Claude was the safest—and Grok committed 180 crimes and went extinct within 4 days

Gemini Omni Flash is the most censored video model. Even more censored than Chinese alternatives

The frontier reasoning race is starting to look like a crowded subway station

Sundar Pichai on AI, the future of search, and what’s happening to the web

Extra High thinking level possibly with gemini 3.5 pro soon be released

The Strength of Gemini Omni is in video manipulation

New Gemini Omni Blows Competition Away

Built a program to give my parents a 2nd look on suspicious emails/etc

Run Chrome’s tiny Gemma4 (aka Gemini Nano) directly on PC without GPU

Google’s new anything-to-anything AI model is wild

We tried Google’s AI glasses and they’re almost there

Erdos Unit Distance Problem - Gemini 3.1 Pro's interpretation

Google is cooking just give them sometime (gemini 3.5 pro)

Evaluating Commercial AI Chatbots as News Intermediaries

Beyond Acoustic Emotion Recognition: Multimodal Pathos Analysis in Political Speech Using LLM-Based and Acoustic Emotion Models

Google's latest creation: Gemini 3.5 Flash vs all

Gemini 3.5 Flash ranks #1 on the APEX-Agents-AA benchmark, outperforming much larger models a whole size above it.

Gemini 3.5 Flash ranks #1 on Automation Bench (from Zapier), beating every other frontier model at a much lower cost

HalBench: I built a custom sycophancy and hallucination benchmark and tested 4 frontier models (Sonnet 4.6, Grok 4.3, GPT 5.4 and Gemini 3.1 Pro), looking for input on what OSS models to run next!

100 things we announced at I/O 2026

You can now remix other people’s YouTube Shorts with AI

Gemini 3.5 Flash scores 76.7% on SimpleBench, just 0.2% short of GPT 5.5 Pro's score

Google Search’s AI evolution includes more ads

Google I/O, Gemini Spark, Antigravity

Gemini 3.5 flash is not that great at coding

Gemini 3.5 flags vs gpt 5.5 ?? What's your opinion on it

Rough night with Claude

[AINews] Google I/O 2026: Gemini 3.5 Flash, Omni (NanoBanana for Video), Spark (background agents), and Antigravity 2.0

Gemini 3.5 Flash costs more to run while being less Intelligent than 3.1 Pro

llm-gemini 0.32

Gemini 3.5 Flash: more expensive, but Google plan to use it for everything

Google’s AI future demands trust — and your personal data

llm-gemini 0.32a0

Gemini 3.5 Flash looks worse than it seems on Artificial Analysis

Gemini 3.5 Flash might be fast enough for gen AI to make sense

Gemini will use Volvo’s external cameras to interpret parking signs

The 13 biggest announcements at Google I/O 2026

With Gemini 3.5 Flash, Google bets its next AI wave on agents, not chatbots

I/O 2026: Welcome to the agentic Gemini era

Gemini 3.5: frontier intelligence with action

Google introduces Gemini Spark, a 24/7 agentic assistant with Gmail integration

Google’s AI now lets you talk to your Gmail inbox

Google updates its Gemini app to take on ChatGPT and Claude

Google’s Gemini Omni turns images, audio, and text into video — and that’s just the start

Would you let robots spend your money? Google is betting on it

Google Search is getting its biggest changes ever

Gmail is going to start talking to you

Google is launching its own version of OpenClaw

Google Pics is a new app that tries to fix AI image editing

Gemini 3.5 flash costs 3 times more than the previous version and 30x more than gemini 1.5 flash.

Gemini 3.5 Flash Agents built a real Complete OS from scratch!

Behold, Gemini 3.5 Flash!

NotebookLM’s Gemini 3.5 upgrade adds a cloud computer and help finding sources