The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Trials and tribulations fine-tuning & deploying Gemma-4 [P]

Hey all, Our ML team spent some time this week getting training and deployments working for Gemma-4, and wanted to document all the things we ran into along the way. * **PEFT doesn't recognize Gemma 4's custom layers.** Google wrapped vision/audio projections in a new `ClippableLinear` class that doesn't inherit from `nn.Linear`, so PEFT refuses to attach LoRA, even for text-only fine-tuning. Fix: unwrap the wrappers after loading weights but before calling PEFT. * **SFTTrainer killed training silently.** TRL hardcodes `use_cache=False`, which breaks Gemma 4's KV-sharing attention. Loss nev...

u/FallMindless3563·2 months ago·47 pts / 6 comm

The Archive

Trials and tribulations fine-tuning &amp; deploying Gemma-4 [P]

Systematic Capability Benchmarking of Frontier Large Language Models for Offensive Cyber Tasks

Lightweight Cybersickness Detection based on User-Specific Eye and Head Tracking Data in Virtual Reality

Uncertainty Quantification in PINNs for Turbulent Flows: Bayesian Inference and Repulsive Ensembles

Tesla brings its robotaxi service to Dallas and Houston

From Legal Text to Executable Decision Models: Evaluating Structured Representations for Legal Decision Model Generation

FlowRefiner: Flow Matching-Based Iterative Refinement for 3D Turbulent Flow Simulation

Graph-of-Agents: A Graph-based Framework for Multi-Agent LLM Collaboration

The RAM shortage could last years

Negative Momentum for Convex-Concave Optimization

SeekerGym: A Benchmark for Reliable Information Seeking

SciImpact: A Multi-Dimensional, Multi-Field Benchmark for Scientific Impact Prediction

Local Inconsistency Resolution: The Interplay between Attention and Control in Probabilistic Models

The Consensus Trap: Rescuing Multi-Agent LLMs from Adversarial Majorities via Token-Level Collaboration

BOIL: Learning Environment Personalized Information

RoIt-XMASA: Multi-Domain Multilingual Sentiment Analysis Dataset for Romanian and Italian

If Only My CGM Could Speak: A Privacy-Preserving Agent for Question Answering over Continuous Glucose Data

Please refuse to answer me! Mitigating Over-Refusal in Large Language Models via Adaptive Contrastive Decoding

Automated Classification of Plasma Regions at Mars Using Machine Learning

A proposal for PU classification under Non-SCAR using clustering and logistic model

CASCADE: A Cascaded Hybrid Defense Architecture for Prompt Injection Detection in MCP-Based Systems

The Topological Trouble With Transformers

A Two-Stage Deep Learning Framework for Segmentation of Ten Gastrointestinal Organs from Coronal MR Enterography

AI chip startup Cerebras files for IPO

The Provenance Gap in Clinical AI: Evidence-Traceable Temporal Knowledge Graphs for Rare Disease Reasoning

Complementing Self-Consistency with Cross-Model Disagreement for Uncertainty Quantification

HiveMind: OS-Inspired Scheduling for Concurrent LLM Agent Workloads

Anthropic’s relationship with the Trump administration seems to be thawing

The App Store is booming again, and AI may be why

Claude system prompts as a git timeline

Trials and tribulations fine-tuning & deploying Gemma-4 [P]