Developer Tools B
Showing 301–312 of 312
-
Free Heavy-Tailed Lunch for Muon: A Theoretical Justification of Empirical SuccessA theoretical justification for Muon's empirical successMatrix-update non-Euclidean optimizers like Muon and Scion train transformers well but lack theory. The paper offers a heavy-tailed analysis justifying Muon's empirical advantages over Euclidean methods.
-
Rethinking Global Average Pooling: Your Classifier Is Secretly a Multi-Instance LearnerRethinking GAP: your classifier is secretly a multi-instance learnerModern image classifiers widely use global average pooling followed by a linear head. The paper shows this linearity makes GAP-based classifiers behave as multi-instance learners, prompting a rethink of global average pooling.
-
TRACE: Trajectory-Routed Causal Memory for Delayed-Evidence Visuomotor ImitationTRACE: trajectory-routed causal memory for delayed-evidence imitationAutonomous robots may need decisions based on evidence no longer visible. For delayed-evidence tasks, where an early cue disappears before a later decision, TRACE introduces trajectory-routed causal memory for visuomotor imitation.
-
Provably Safe, Yet Scalable Reinforcement LearningProvably safe yet scalable reinforcement learningSafe RL usually relies on soft-constrained policy optimization without hard guarantees. This work proposes an approach that is provably safe while remaining scalable.
-
BayLing-Duplex: Native Full-Duplex Speech Dialogue with a Single Autoregressive LLMBayLing-Duplex: native full-duplex speech dialogue from one LLMBayLing-Duplex enables native full-duplex speech interaction with a single autoregressive LLM, letting it listen and speak simultaneously. It handles natural phenomena such as overlap and hesitation.
-
Behavioral Audit of Machine Unlearning Has a Privacy CostBehavioral audits of machine unlearning carry a privacy costMachine unlearning removes learned data from models, but auditing its behavior is itself costly. The paper shows that behavioral audits of unlearning incur a privacy cost.
-
PepALD: Macrocyclic Peptide Generation via Autoregressive Latent DiffusionPepALD generates macrocyclic peptides via autoregressive latent diffusionMacrocyclic peptides are promising for intracellular targets but require joint control of non-natural chemistry, ring topology, and permeability. PepALD generates them using autoregressive latent diffusion.
-
Beyond the Training Distribution: Evaluating Predictions Under Distribution Shift and Selection BiasEvaluating predictions under distribution shift and selection biasKnowing how a model will perform in a new environment before deployment helps prevent harm. The paper evaluates predictions under two common sources of degradation: distribution shift and selection bias.
-
Nonlinear Two-Time-Scale Stochastic Approximation: A Sharp Phase Transition and How to Beat ItA sharp phase transition in two-time-scale stochastic approximationThe paper analyzes nonlinear two-time-scale stochastic approximation, revealing a sharp phase transition under contractive assumptions and showing how to beat it.
-
When the Tool Decides: LLM Agents Defer Blindly to Graph Neural Network Tools, and Stronger Backbones Defer MoreLLM agents defer blindly to GNN tools — stronger backbones defer moreA growing line of work equips LLM agents with graph neural networks as callable tools. The paper finds that agents defer blindly to these GNN tools, and that stronger backbones tend to defer even more.
-
The Perceived Fragility of Explanations in Audio Models: Manipulation of Attribution with Unchanged PredictionsManipulating audio-model explanations while predictions stay unchangedThe paper investigates the fragility of post-hoc explanations in audio deepfake detection. Introducing a psychoacoustic framework beyond image-style Lp metrics, it shows attributions can be manipulated while predictions remain unchanged.
-
A Computational Audit of Demographic Association Encoding in ClinicalBERT Language PredictionsA computational audit of demographic encoding in ClinicalBERTAs clinical language models enter high-stakes decision support, the paper audits how demographic associations are encoded in ClinicalBERT and shape its language predictions.