The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

A companion study established a de-biased, cross-model VLM-as-3D-judge that reliably ranks single-image-to-3D mesh quality where cheap geometry and CLIP proxies fall short. This paper asks: can that judge's preferences specialize a strong open generator, TRELLIS, on one asset class (furniture), cheaply and without human labels? Taking the judge from ranking to optimization is where the work lives. Pushing a VLM judge into the training and evaluation loop exposes failure modes ranking never triggered, so our contribution is an optimization-grade hardening of the judge: a training judge (Qwen2....

Ali Asaria·3 days ago

The Archive

Judging to Improve: A De-biased VLM-as-3D-Judge Protocol for Single-Image 3D Generation

Automating SKILL.md Generation for Computer-Using Agents via Interaction Trajectory Mining

Train, Retrieve, or Both? A Four-Arm Head-to-Head for Correct Statutory Citation on the Ontario Residential Tenancies Act

General Intuition in talks to raise $300M at around $2B valuation

On the Variance of Temporal Difference Learning and its Reduction Using Control Variates

Robust $Q$-learning for mean-field control under Wasserstein uncertainty in common noise

Critical Percolation as a Synthetic Data Model for Interpretability

Quantum ring all-reduce: communication and privacy advantages for distributed learning

A tech worker-backed PAC is bringing a $5M knife to Big Tech’s $100M gunfight

SoftSkill: Behavioral Compression for Contextual Adaptation

Constrained hybrid modelling to predict microbial dynamics and organic matter turnover in soil systems

Quantum-classical physics-informed Kolmogorov-Arnold networks for PDEs

Recurrent neural networks approximate continuous functions

A Model-Driven Approach for Developing Families of Reinforcement Learning Environments

Leveraging systems' non-linearity to tackle the scarcity of data in the design of Intelligent Fault Diagnosis Systems

Statistical Properties of Training & Generalization

Token-Operations-Oriented Inference Optimization Techniques for Large Models

Shifting-based Optimizable Linear Relaxations for General Activation Functions

Integrating national forest inventory, airborne lidar, and satellite imagery for wall-to-wall mapping of forest structure with computer vision

PsyScore: A Psychometrically-Aware Framework for Trait-Adaptive Essay Scoring and ZPD-Scaffolded Feedback

Boundary Embedding Shaping with Adaptive Contrastive Learning for Graph Structural Disentanglement

ELVA: Exploring Ranking-Driven Universal Multimodal Retrieval

Lagrange: An Open-Vocabulary, Energy-Based Sparse Framework for Generalized End-to-End Driving

Confidence-Aware Automated Assessment of Student-Drawn Scientific Models

Editorial Alignment: A Participatory Approach to Engaging Editorial Expertise in LLM-mediated Knowledge Dissemination

The Register Gap: A Meaning Intelligence Framework for Nigerian Public Discourse

Who decides when AI is too dangerous?

Finetuning Vision-Language-Action Models Requires Fewer Layers Than You Think

Navigating Unreliable Parametric and Contextual Knowledge: Explicit Knowledge Conflict Resolution for LLM Inference

SPOT-E: Test-Time Entropy Shaping with Visual Spotlights for Frozen VLMs