The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Scalable Training of Spatially Grounded 2D Vision-Language Models for Radiology

We study how to train visually grounded vision-language models (VLMs) for radiology without manual spatial annotations. We introduce RefRad2D, a large-scale bilingual (German/English) dataset of 1.2M CT and MR image-text pairs derived from clinical practice, with task-specific VQA and spatial grounding subsets generated automatically via LLM-based curation and automated segmentation. Trained on this data, our model RadGrounder jointly performs report generation, visual question answering, and spatial grounding via bounding-box detection or segmentation. On external VQA benchmarks (Slake, VQA-...

Yusuf Salcan·3 days ago

TechCrunch AI· PRESS

‘Queer Eye’s’ life coach Karamo Brown launches Kē, a wellness app featuring his AI digital clone

Karamo Brown, famous for his pep talks on Netflix’s “Queer Eye,” has jumped into the wellness and AI space with his new app, Kē. After spending a year and a half focusing on his own journey—from fitness and nutrition to meditation, sobriety, relationships, and personal growth—Brown wants to help others do the same. Kē offers […]

Lauren Forristal·3 days ago

The Archive

Scalable Training of Spatially Grounded 2D Vision-Language Models for Radiology

‘Queer Eye’s’ life coach Karamo Brown launches Kē, a wellness app featuring his AI digital clone

Marginal Advantage Accumulation for Memory-Driven Agent Self-Evolution

UltraQuant: 4-bit KV Caching for Context-Heavy Agents

Analyzing Defensive Misdirection Against Model-Guided Automated Attacks on Agentic AI Systems

Fisher-Geometric Sharpness and the Implicit Bias of SGD toward Flat Minima

Agentic Symbolic Search: Characterizing PDEs Beyond Hand-crafted Expressions, Meshes, and Neural Networks

Data Bias Mitigation under Coverage Constraints & The Price of Fairness

Context-Aware Hierarchical Bayesian Modeling of IVF Laboratory Environmental Conditions

Repurposing a Speech Classifier for Guided Diffusion-Based Speech Generation

SSH-Net: A Deep Neural Network for Predicting Failure Time Distribution Functions under Competing Risks with Application to GPU Data

Topological Data Analysis for High-Dimensional Dynamic Process Monitoring

Evolutionary Two-Stage Hyperparameter Optimization Strategies for Physics-Informed Neural Networks

Interpretable Sperm Morphology Classification via Attention-Guided Deep Learning

HEPTv2: End-to-End Efficient Point Transformer for Charged Particle Reconstruction

Multi-View Decompilation for LLM-Based Malware Classification

Sparsity, Superposition, and Forgetting: A Mechanistic Study of Representation Retention in Continual Learning

Neural network surrogates with uncertainty quantification for inverse problems in partial differential equations

On the Redundancy of Timestep Embeddings in Diffusion Models

Pseudo-Feature Padding: A Lightweight Defense Against False Data Injection in Power Grids

Amazon employees say they’re facing termination for backing data center limits

Direct Advantage Estimation for Scalable and Sample-efficient Deep Reinforcement Learning

LLM agent safety, multi-turn red-teaming, jailbreak benchmarks, adversarial robustness, safety-critical systems

The Significance of Style Diversity in Annotation-Free Synthetic Data Generation

DataMagic: Transforming Tabular Data into Data Insight Video

Towards Modality-imbalanced Federated Graph Learning: A Data Synthesis-based Approach

Rethinking Shrinkage Bias in LLM FP4 Pretraining: Geometric Origin, Systemic Impact, and UFP4 Recipe

CRAX: Fast Safe Reinforcement Learning Benchmarking

AutoPass: Evidence-Guided LLM Agents for Compiler Performance Tuning

CATCH-ME if you RAG: a dataset of Contextually Annotated multi-Turn Counterspeech against Hate and Misinformation Exchanges