Developer Tools B

Showing 301–312 of 312
  • arXiv cs.LG (Machine Learning) · EN Developer Tools extract
    Free Heavy-Tailed Lunch for Muon: A Theoretical Justification of Empirical Success
    A theoretical justification for Muon's empirical success
    Reinforcement Learning Transformer
    Matrix-update non-Euclidean optimizers like Muon and Scion train transformers well but lack theory. The paper offers a heavy-tailed analysis justifying Muon's empirical advantages over Euclidean methods.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Rethinking Global Average Pooling: Your Classifier Is Secretly a Multi-Instance Learner
    Rethinking GAP: your classifier is secretly a multi-instance learner
    Retrieval-Augmented Generation (RAG)
    Modern image classifiers widely use global average pooling followed by a linear head. The paper shows this linearity makes GAP-based classifiers behave as multi-instance learners, prompting a rethink of global average pooling.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    TRACE: Trajectory-Routed Causal Memory for Delayed-Evidence Visuomotor Imitation
    TRACE: trajectory-routed causal memory for delayed-evidence imitation
    Reinforcement Learning
    Autonomous robots may need decisions based on evidence no longer visible. For delayed-evidence tasks, where an early cue disappears before a later decision, TRACE introduces trajectory-routed causal memory for visuomotor imitation.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Provably Safe, Yet Scalable Reinforcement Learning
    Provably safe yet scalable reinforcement learning
    Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning
    Safe RL usually relies on soft-constrained policy optimization without hard guarantees. This work proposes an approach that is provably safe while remaining scalable.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.CL (Computation and Language) · EN Training & Fine-tuning extract
    BayLing-Duplex: Native Full-Duplex Speech Dialogue with a Single Autoregressive LLM
    BayLing-Duplex: native full-duplex speech dialogue from one LLM
    Deep Learning Fine-tuning Llama Reinforcement Learning from Human Feedback (RLHF) Speech Processing
    BayLing-Duplex enables native full-duplex speech interaction with a single autoregressive LLM, letting it listen and speak simultaneously. It handles natural phenomena such as overlap and hesitation.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.LG (Machine Learning) · EN Developer Tools extract
    Behavioral Audit of Machine Unlearning Has a Privacy Cost
    Behavioral audits of machine unlearning carry a privacy cost
    Machine Learning Neural Network
    Machine unlearning removes learned data from models, but auditing its behavior is itself costly. The paper shows that behavioral audits of unlearning incur a privacy cost.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN New Model Releases extract
    PepALD: Macrocyclic Peptide Generation via Autoregressive Latent Diffusion
    PepALD generates macrocyclic peptides via autoregressive latent diffusion
    Embeddings Neural Network
    Macrocyclic peptides are promising for intracellular targets but require joint control of non-natural chemistry, ring topology, and permeability. PepALD generates them using autoregressive latent diffusion.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Beyond the Training Distribution: Evaluating Predictions Under Distribution Shift and Selection Bias
    Evaluating predictions under distribution shift and selection bias
    Algorithms & Theory Machine Learning
    Knowing how a model will perform in a new environment before deployment helps prevent harm. The paper evaluates predictions under two common sources of degradation: distribution shift and selection bias.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Nonlinear Two-Time-Scale Stochastic Approximation: A Sharp Phase Transition and How to Beat It
    A sharp phase transition in two-time-scale stochastic approximation
    Speech Processing
    The paper analyzes nonlinear two-time-scale stochastic approximation, revealing a sharp phase transition under contractive assumptions and showing how to beat it.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Infrastructure & Hardware extract
    When the Tool Decides: LLM Agents Defer Blindly to Graph Neural Network Tools, and Stronger Backbones Defer More
    LLM agents defer blindly to GNN tools — stronger backbones defer more
    AI Agents Deep Learning Neural Network Software Engineering
    A growing line of work equips LLM agents with graph neural networks as callable tools. The paper finds that agents defer blindly to these GNN tools, and that stronger backbones tend to defer even more.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    The Perceived Fragility of Explanations in Audio Models: Manipulation of Attribution with Unchanged Predictions
    Manipulating audio-model explanations while predictions stay unchanged
    Retrieval-Augmented Generation (RAG)
    The paper investigates the fragility of post-hoc explanations in audio deepfake detection. Introducing a psychoacoustic framework beyond image-style Lp metrics, it shows attributions can be manipulated while predictions remain unchanged.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    A Computational Audit of Demographic Association Encoding in ClinicalBERT Language Predictions
    A computational audit of demographic encoding in ClinicalBERT
    Machine Learning Transformer
    As clinical language models enter high-stakes decision support, the paper audits how demographic associations are encoded in ClinicalBERT and shape its language predictions.
    Read original (arXiv cs.CL (Computation and Language)) ↗