The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Geometric and Stochastic Analysis of Discontinuities in Sparse Mixture-of-Experts

Sparse Mixture-of-Experts (SMoE) architectures are now widely deployed in state-of-the-art language and vision models, where conditional routing allows scaling to very large networks. However, this very Top-$k$ expert selection that enables conditional routing also renders the SMoE map inherently discontinuous. In the vicinity of these discontinuity surfaces, even inputs that are arbitrarily close may activate substantially different sets of experts resulting in significantly different outputs. In this work we give a rigorous geometric and stochastic analysis of these discontinuities. We firs...

Tho Tran Huu·5 days ago

The Verge AI· PRESS

Google’s first smart speaker in six years arrives next week

The Google Home Speaker comes in four colors, including porcelain. (Stroopwafel not included.) | Photo by Jennifer Pattison Tuohy / The Verge Google's first new smart speaker in six years starts shipping on June 29th, narrowly missing its promised spring launch window. Preorders for the Google Home Speaker open today, June 17th. Nothing has changed hardware-wise in the nine months since the $99 speaker was announced. It has the same slightly squished round design, with touch-capacitive buttons on top and a light ring at the bottom to indicate status. And it still comes in four colors: porcela...

Jennifer Pattison Tuohy·5 days ago

The Archive

Geometric and Stochastic Analysis of Discontinuities in Sparse Mixture-of-Experts

Google’s first smart speaker in six years arrives next week

A Hybrid LSTM--Vision Transformer Architecture for Predicting HRRR Forecast Errors

FoMoE: Breaking the Full-Replica Barrier with a Federation of MoEs

Lifecycle-Aware Dynamic Analysis for Secure ML Model Execution

Canadian pension giant joins race to fund India’s AI-fueled data center boom

Sumi: Open Uniform Diffusion Language Model from Scratch

Spotlight: Synergizing Seed Exploration and Spot GPUs for DiT RL Post-Training

Enhancing Multilingual Reasoning via Steerable Model Merging

DIPHINE: Diffusion-based $Φ$-ID Neural Estimator

TRAP: Benchmark for Task-completion and Resistance to Active Privacy-extraction

Sequential Kernel-based Conditional Independence Testing via Adaptive Betting

DeepL acquires Mixhalo for live-event audio streaming and translation

G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment

ThinkDeception: A Progressive Reinforcement Learning Framework for Interpretable Multimodal Deception Detection

Beyond Tokenization: Direct Timestep Embedding and Contrastive Alignment for Time-Series Question Answering

Mitigating Scoring Errors and Compensating for Nonverbal Subtests in Speech-Based Dementia Assessment

CAPRA: Scaling Feedback on Software Architecture Deliverables with a Multi-Agent LLM System

FOSC-X: An Extended Framework for Optimal Local Cuts and Non-Horizontal Cluster Selection from Clustering Hierarchies

A Controlled Benchmark of Quantum-Latent GAN Augmentation for Brain MRI

EfficientRollout: System-Aware Self-Speculative Decoding for RL Rollouts

Online Reward-Punishment Learning from Fixed-Channel Perceptual Event Streams without Environment Rewards

Be Your Own Teacher: Steering Protein Language Models via Unsupervised Reward Optimization

GraphPO: Graph-based Policy Optimization for Reasoning Models

RTSGameBench: An RTS Benchmark for Strategic Reasoning by Vision-Language Models

Decoupling Search from Reasoning: A Vendor-Agnostic Grounding Architecture for LLM Agents

SenFlow: Inter-Sentence Flow Modeling for AI-Generated Text Detection in Hybrid Documents

Graph-ESBMC-PLC: Formal Verification of Graphical PLCopen XML Ladder Diagram Programs Using SMT-Based Model Checking

SciRisk-Bench: A Risk-Dimension-Aware Benchmark for AI4Science Safety

Zero-Shot Active Feature Acquisition via LLM-Elicitation