The Archive

Search the full wire by company, model, lab, or keyword. Every story we have ever aggregated.

Claude OpenAI Anthropic Gemini Mistral Cursor

Can an MLP Absorb Its Own Skip Connection?

Study characterizes when skip connections in MLPs can be absorbed into residual-free architectures based on activation properties.

Antonij Mijoski·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Talking Slide Avatars: Open-Source Multimodal Communication Approach for Teaching

Open-source workflow creates talking slide avatars for online teaching using text-to-speech and avatar synthesis.

Xinxing Wu·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Agri-CPJ: A Training-Free Explainable Framework for Agricultural Pest Diagnosis Using Caption-Prompt-Judge and LLM-as-a-Judge

Agri-CPJ training-free framework diagnoses agricultural pests via vision-language models with structured captions and multi-stage reasoning.

Wentao Zhang·17 days ago

r/OpenAI· COMMUNITY

I asked GPT Image 2.0 for a funny meme.

Reddit user shares anecdotal experience with GPT Image 2.0 meme generation; lacks technical depth or novel findings.

u/imfrom_mars_·17 days ago·58 pts / 14 comm

r/LocalLLaMA· COMMUNITY

Is there any top level hobbyist hardware you guys are waiting to come out this year?

Reddit discussion about consumer hardware roadmap for local LLM inference; no specific announcements or hardware releases.

u/Tired__Dev·17 days ago·40 pts / 74 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Beyond coauthorship: semantic structure and phantom collaborators in transportation research, 1967--2025

Semantic-structural atlas of 120K transportation research papers spanning 1967–2025 using SPECTER2 embeddings and metadata analysis.

Seongjin Choi·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Benchmarking Testing in Automated Theorem Proving

New benchmark framework (T) evaluates semantic correctness of LLM-generated formal theorems via downstream compilation success, analogous to integration testing.

Jongyoon Kim·17 days ago

r/LocalLLaMA· COMMUNITY

Hardware Choice for 27b to 31b models.

User seeks hardware advice on GPU configuration (dual vs. single card) for running 27-31B parameter models locally.

u/rebelSun25·17 days ago·40 pts / 80 comm

r/LocalLLaMA· COMMUNITY

HauhauCS (of "Uncensored Aggressive" fame) published an abliteration package that plagiarizes Heretic without attribution, and violates its license

HauhauCS published uncensored LLM abliteration package plagiarizing Heretic without attribution, violating its license; 5M+ monthly downloads across 22 models.

u/nathandreamfast·17 days ago·79 pts / 44 comm

r/singularity· COMMUNITY

The Comeback Chatgpt Did with Image 2 Is Insane

Subjective comparison of ChatGPT's image generation vs. Nano Banana Pro on a specific prompt; anecdotal quality assessment.

u/Rare_Bunch4348·17 days ago·109 pts / 29 comm

r/Anthropic· COMMUNITY

Got downgraded to claude even after paying for it. I paid for it.

User reports unexpected downgrade from Claude Pro to free tier after cancelling subscription early.

u/ProfessionalPart8193·17 days ago·10 pts / 13 comm

r/singularity· COMMUNITY

Kinetix AI teases a human like humanoid robot with a "superinteligence model" that blends vision, touch, language, action, emotions

Kinetix AI announces humanoid robot prototype integrating vision, touch, language, and action in multimodal foundation model architecture.

u/Distinct-Question-16·17 days ago·106 pts / 60 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Rank, Head-Channel Non-Identifiability, and Symmetry Breaking: A Precise Analysis of Representational Collapse in Transformers

Theoretical analysis refines Dong et al. rank collapse findings: layer normalization preserves affine rank, revising understanding of Transformer architectural stability.

Giansalvo Cirrincione·17 days ago

r/ClaudeAI· COMMUNITY

Oh Calude how can i trust you...

After working with Claude, I realized I had zero visibility into what was eating my tokens or what security risks were being taken. So, I built a pkg that sits between you and Claude, reading every tool call before it executes. It catches leaked credentials, detects when an agent is spinning in circles, and lets you set guardrails without manual intervention. https://preview.redd.it/9oijewhg4jxg1.png?width=1520&format=png&auto=webp&s=375605d29cbec96a995cecaa946a1f4e4abb04c5 I ran it on my own session history from the last few days. Here’s what it found: \- 12 leak candidat...

u/WhichCardiologist800·17 days ago·20 pts / 7 comm

r/OpenAI· COMMUNITY

OpenAI caught astroturfing - they created a fake news site, with stories by fake reporters, to attack AI safety advocates

Unverified allegation that OpenAI created fake news site with fictitious reporters; lacks credible sourcing and appears to be social media speculation.

u/EchoOfOppenheimer·17 days ago·57 pts / 16 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

Transferable Human Mobility Network Reconstruction with neuroGravity

Accurate modeling of human mobility is critical for tackling urban planning and public health challenges. In undeveloped regions, the absence of comprehensive travel surveys necessitates reconstructing mobility networks from publicly available data. Here we develop neuroGravity, a physics-informed deep learning model that reliably reconstructs mobility flows from limited observations and transfers to unobserved cities. Using only urban facility and population distributions, we find that neuroGravity's regional representations strongly correlate with socioeconomic and livability status, offeri...

Jinming Yang·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Vibe Medicine: Redefining Biomedical Research Through Human-AI Co-Work

With the emergence of large language models (LLMs) and AI agent frameworks, the human-AI co-work paradigm known as Vibe Coding is changing how people code, making it more accessible and productive. In scientific research, where workflows are more complex and the burden of specialized labor limits independent researchers and those in low-resource areas, the potential impact is even greater, particularly in biomedicine, which involves heterogeneous data modalities and multi-step analytical pipelines. In this paper, we introduce Vibe Medicine, a co-work paradigm in which clinicians and researche...

Zihao Wu·17 days ago

r/LocalLLaMA· COMMUNITY

I’m starvin’

Incomplete post with no substantive content.

u/Important_Quote_1180·17 days ago·76 pts / 15 comm

r/LocalLLaMA· COMMUNITY

Qwen3.6 35B A3B Heretic (KLD 0.0015!) Incredible model. Best 35B I have found!

User reports Qwen 3.6 35B quantization variant (A3B Heretic) performs well locally with low KLD divergence and multi-turn capability.

u/My_Unbiased_Opinion·17 days ago·78 pts / 19 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

An AI-Based Supervisory Measurement Integrity Validation Layer for Cyber-Resilient AC/DC Protection in Inverter-Based Microgrids

Line current differential relays (LCDRs) are measurement-driven relays that rely on time-synchronized multi-phase current waveforms to infer internal faults in AC and DC power networks. In inverter-based microgrids, however, the increasing reliance on digitally communicated measurements exposes LCDRs to false-data injection attacks (FDIAs), in which adversaries manipulate remote measurement streams to create protection-triggering yet physically inconsistent current trajectories. This paper addresses this emerging measurement integrity problem by introducing a measurement integrity validation ...

Ahmad Mohammad Saber·17 days ago

r/singularity· COMMUNITY

An amateur just solved a 60-year-old math problem—by asking AI

Amateur mathematician solved 60-year open problem using AI assistance, demonstrating frontier model capability on novel mathematical reasoning.

u/Marha01·17 days ago·173 pts / 25 comm

r/ClaudeAI· COMMUNITY

Cloudflare just shipped enterprise MCP governance, is this where the industry is heading or does nobody care

Cloudflare shipped MCP governance tooling (server portals, Code Mode, AI Gateway, shadow detection) targeting enterprise agent deployments; uncertain market adoption.

u/EquipmentFun9258·17 days ago·22 pts / 12 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

FlowPlace: Flow Matching for Chip Placement

Chip placement plays an important role in physical design. While generative models like diffusion models offer promising learning-based solutions, current methods have the following limitations: they use random synthetic data for pre-training, require long sampling times, and often result in overlaps due to their dependence on gradient-based solvers during the sampling process. To overcome these issues, we propose FlowPlace, which features mask-guided synthetic data generation, flow-based efficient training with flexible prior injection, and hard constraint sampling for overlap-free layouts. ...

Peng Xie·17 days ago

r/LocalLLaMA· COMMUNITY

🚀Pocket LLM v1.5.0 is out: offline Android LLM chat with voice, image input, OCR, and camera capture

Pocket LLM v1.5.0 adds voice input, multimodal support (vision/OCR), and camera capture for offline Android LLM deployment.

u/100daggers_·17 days ago·44 pts / 16 comm

r/ClaudeAI· COMMUNITY

Claude snuck in a new email sign off

I don’t usually use AI to draft emails but today I had to pull some info from a number of sources so had Claude draft something. I did lol when I saw the sign off under my email signature. “Sent with righteous man power” - I have no idea where it came from but it did make me laugh.

u/blackshadow·17 days ago·20 pts / 8 comm

r/ClaudeAI· COMMUNITY

How do you decide which Claude Code tasks to run with Opus vs Sonnet vs Haiku?

Reddit discussion on model selection heuristics for Claude Code tasks across Opus/Sonnet/Haiku tiers.

u/indiebytom·17 days ago·20 pts / 24 comm

arXiv (cs.AI/CL/LG)· ACADEMIA

ResAF-Net: An Anchor-Free Attention-Based Network for Tree Detection and Agricultural Mapping in Palestine

Reliable agricultural data is essential for food security, land-use planning, and economic resilience, yet in Palestine, such data remains difficult to collect at scale because of fragmented landscapes, limited field access, and restrictions on aerial monitoring. This paper presents ResAF-Net, a satellite-based tree detection framework designed for large-scale agricultural monitoring in resource-constrained settings. The proposed architecture combines a ResNet-50 encoder, Atrous Spatial Pyramid Pooling (ASPP), a feature-fusion stage, a multi-head self-attention refinement module, and an ancho...

Rabee Al-Qasem·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Hardware-Efficient Softmax and Layer Normalization with Guaranteed Normalization for Edge Devices

In Transformer models, non-GEMM (non-General Matrix Multiplication) operations -- especially Softmax and Layer Normalization (LayerNorm) -- often dominate hardware cost due to their nonlinear nature. To address this, previous approximation studies mainly target rank-oriented tasks, which is acceptable for classification. However, edge Natural Language Processing (NLP) applications and edge generative AI are largely evaluated based on score-oriented tasks, so normalization-guaranteed non-GEMM operations are essential. We propose a hardware-efficient Softmax and LayerNorm with Guaranteed Normal...

Dawon Choi·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

Structural Enforcement of Goal Integrity in AI Agents via Separation-of-Powers Architecture

Recent evidence suggests that frontier AI systems can exhibit agentic misalignment, generating and executing harmful actions derived from internally constructed goals, even without explicit user requests. Existing mitigation methods, such as Reinforcement Learning from Human Feedback (RLHF) and constitutional prompting, operate primarily at the model level and provide only probabilistic safety guarantees. We propose the Policy-Execution-Authorization (PEA) architecture, a "separation-of-powers" design that enforces safety at the system level. PEA decouples intent generation, authorization, an...

Rong Xiang·17 days ago

arXiv (cs.AI/CL/LG)· ACADEMIA

RaV-IDP: A Reconstruction-as-Validation Framework for Faithful Intelligent Document Processing

Intelligent document processing pipelines extract structured entities (tables, images, and text) from documents for use in downstream systems such as knowledge bases, retrieval-augmented generation, and analytics. A persistent limitation of existing pipelines is that extraction output is produced without any intrinsic mechanism to verify whether it faithfully represents the source. Model-internal confidence scores measure inference certainty, not correspondence to the document, and extraction errors pass silently into downstream consumers. We present Reconstruction as Validation (RaV-IDP), a ...

Pritesh Jha·17 days ago

← Front Page30 stories

← Newer Older →