Developer Tools B

Showing 31–60 of 312
  • arXiv cs.AI (Artificial Intelligence) · EN Inference & Efficiency extract
    Robust $Q$-learning for mean-field control under Wasserstein uncertainty in common noise
    Robust Q-learning for mean-field control under Wasserstein uncertainty
    Quantization
    This paper presents a robust Q-learning algorithm for discrete-time mean-field control problems with common noise, accounting for Wasserstein uncertainty in the model.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Quantum ring all-reduce: communication and privacy advantages for distributed learning
    Quantum ring all-reduce for efficient, private distributed learning
    Deep Learning Machine Learning
    The paper proposes a quantum ring all-reduce scheme for distributed learning, arguing that quantum communication can make distributed training both more communication-efficient and information-theoretically private. The approach is discussed for both classical and quantum settings.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Inference & Efficiency extract
    SoftSkill: Behavioral Compression for Contextual Adaptation
    SoftSkill: behavioral compression for contextual adaptation
    Computer Vision Deep Learning Inference Software Engineering
    Agent skills are commonly deployed as natural-language Markdown files that encode answer policies. SoftSkill compresses such behaviors to enable more efficient contextual adaptation.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.LG (Machine Learning) · EN Developer Tools extract
    Quantum-classical physics-informed Kolmogorov-Arnold networks for PDEs
    QCPIKAN: a quantum-classical physics-informed KAN for PDEs
    Neural Network
    The paper develops QCPIKAN, a quantum-classical physics-informed Kolmogorov-Arnold network for solving partial differential equations. The hybrid framework combines Chebyshev-polynomial KAN layers with parameterized quantum circuits and embeds physical constraints into the training loss.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Developer Tools extract
    Leveraging systems' non-linearity to tackle the scarcity of data in the design of Intelligent Fault Diagnosis Systems
    Using system non-linearity to fight data scarcity in fault diagnosis
    Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning
    Deep transfer learning enables efficient construction of intelligent fault diagnosis systems. This work leverages systems' non-linearity to tackle the scarcity of data in their design.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Inference & Efficiency extract
    Token-Operations-Oriented Inference Optimization Techniques for Large Models
    Token-operation-oriented inference optimization for large models
    Inference Reinforcement Learning
    The paper proposes a token-operations-oriented framework for large-model inference optimization, presenting a four-layer technical architecture aimed at scalable, low-cost, and stable large-model services. The layers include components such as multi-model fusion.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Shifting-based Optimizable Linear Relaxations for General Activation Functions
    Shifting-based linear relaxations for general activation functions
    Deep Learning Neural Network
    The paper proposes a shifting-based method for constructing optimizable linear relaxations of general activation functions, used in formal verification of neural networks. It removes the need for hand-crafted relaxations tailored to each activation, supporting formal guarantees in safety- and security-critical settings.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN New Model Releases extract
    Integrating national forest inventory, airborne lidar, and satellite imagery for wall-to-wall mapping of forest structure with computer vision
    Wall-to-wall forest structure mapping from inventory, lidar, imagery
    Computer Vision Neural Network
    The paper integrates national forest inventory data, airborne lidar, and satellite imagery with computer vision to produce wall-to-wall maps of forest structure. It targets the persistent need for annually updated, large-landscape maps to support forest and wildfire risk management.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.CL (Computation and Language) · EN Multimodal extract
    PsyScore: A Psychometrically-Aware Framework for Trait-Adaptive Essay Scoring and ZPD-Scaffolded Feedback
    PsyScore: psychometric essay scoring with scaffolded feedback
    Retrieval-Augmented Generation (RAG)
    The paper presents PsyScore, a psychometrically-aware framework for automated essay scoring that adapts to writing traits and provides ZPD-scaffolded feedback. It aims to unify scoring and feedback, which existing methods treat separately, balancing reliable assessment with interpretable, actionable instruction.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Developer Tools extract
    Boundary Embedding Shaping with Adaptive Contrastive Learning for Graph Structural Disentanglement
    Boundary embedding shaping for graph structural disentanglement
    Embeddings Machine Learning Neural Network Retrieval-Augmented Generation (RAG)
    Graph neural networks aggregate neighbor information for classification but entangle structure. This work proposes boundary embedding shaping with adaptive contrastive learning for graph structural disentanglement.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Multimodal extract
    Confidence-Aware Automated Assessment of Student-Drawn Scientific Models
    Confidence-aware automated assessment of student-drawn science models
    Deep Learning Retrieval-Augmented Generation (RAG) Transformer
    Student-generated drawings are widely used in science education to assess conceptual understanding. This work introduces confidence-aware automated assessment of student-drawn scientific models.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    Editorial Alignment: A Participatory Approach to Engaging Editorial Expertise in LLM-mediated Knowledge Dissemination
    Editorial alignment: engaging editorial expertise in LLM knowledge dissemination
    LLM-driven information services are reshaping how public knowledge is produced. This work proposes a participatory approach to engage editorial expertise in LLM-mediated knowledge dissemination.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    The Register Gap: A Meaning Intelligence Framework for Nigerian Public Discourse
    The Register Gap: a meaning intelligence framework for Nigerian discourse
    Deep Learning Gemini Neural Network Retrieval-Augmented Generation (RAG)
    This work introduces the Meaning Intelligence Framework, a nine-dimension annotation and evaluation scheme, to study the register gap in Nigerian public discourse.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Multimodal extract
    SPOT-E: Test-Time Entropy Shaping with Visual Spotlights for Frozen VLMs
    SPOT-E: test-time entropy shaping with visual spotlights for frozen VLMs
    Computer Vision Inference Reinforcement Learning Software Engineering
    Vision-language models often underperform on evidence-intensive tasks by missing decisive visual cues. SPOT-E applies test-time entropy shaping with visual spotlights to improve frozen VLMs.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Developer Tools extract
    A Multi-Agent system for Multi-Objective constrained optimization
    A multi-agent system for multi-objective constrained optimization
    Embeddings Reinforcement Learning
    Many decision-making problems in computing and networking can be cast as constrained optimization. This work presents a multi-agent system for multi-objective constrained optimization.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    ScholarQuest: A Taxonomy-Guided Benchmark for Agentic Academic Paper Search in Open Literature Environments
    ScholarQuest: a taxonomy-guided benchmark for agentic paper search
    AI Agents Software Engineering
    Academic paper search is a core step in research, and LLM-based search agents are emerging. ScholarQuest provides a taxonomy-guided benchmark for agentic academic paper search in open literature environments.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Developer Tools extract
    Thermodynamic Measure of Intelligence
    A thermodynamic measure of intelligence
    Neural Network Reinforcement Learning
    Can intelligence be measured? This work proposes defining intelligence as the lawful amplification of order and develops a thermodynamic measure to quantify it.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Training & Fine-tuning extract
    Actionable Activation Directions for Detecting and Mitigating Emergent Misalignment Across Language Model Families
    Activation directions for detecting emergent misalignment in LLMs
    Fine-tuning Llama Reinforcement Learning
    The paper investigates whether emergent misalignment—induced by fine-tuning language models on insecure code—corresponds to a causally actionable, shared direction in activation space. Across four instruction-tuned model families, it studies using such directions to detect and mitigate the misalignment.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    CzechDocs: A Multiway Parallel Dataset of Formatted Documents for Minority Languages in Czechia
    CzechDocs: a parallel formatted-document MT dataset for Czechia
    Machine Learning
    The paper presents CzechDocs, a multiway parallel dataset of formatted documents in HTML, DOCX, and PDF covering Czech and minority languages used in Czechia—primarily Ukrainian and English, with smaller amounts of Vietnamese, Russian, and others. It is designed to support evaluation of machine translation.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Industry Adoption extract
    Augmenting Game AI with Deep Reinforcement Learning
    Augmenting game AI with deep reinforcement learning
    AI Agents Machine Learning Reinforcement Learning
    Immersion in video games depends not only on graphics, audio, and mechanics but also on the quality of game AI. This work augments game AI using deep reinforcement learning.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Off-Policy Evaluation for Missingness-Aware Policies in MDPs with Rewards Missing Not at Random
    Off-policy evaluation when rewards are missing not at random
    Reinforcement Learning
    The paper studies off-policy evaluation in finite-horizon MDPs when rewards are missing not at random, as in offline reinforcement learning with sparse, irregular, or censored reward records. It develops missingness-aware policies for settings such as health care and marketing.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact
    LLM psychological profiles are largely a measurement artifact
    Deep Learning Neural Network
    Psychological instruments designed for humans are increasingly applied to large language models. This work argues that the apparent psychological profiles of LLMs are largely a measurement artifact.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    Pitch Spelling Jazz Lead Sheets, Solo Transcriptions, Classical Piano and Monophonic Scores
    An algorithm for pitch spelling and key estimation from MIDI
    The paper presents an algorithm for pitch spelling and key estimation across jazz lead sheets, solo transcriptions, classical piano, and monophonic scores. Given MIDI-like input with note pitches and bar boundaries, it estimates note names, a global key signature, and local scales.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • IEEE Spectrum (AI section) · EN Developer Tools extract
    Sound Waves Give Neuromorphic Chips a Brain-Simulating Edge
    Sound waves give neuromorphic chips a brain-simulating edge
    Neural Network
    Neuromorphic computing mimics the brain and can use far less energy than conventional AI chips, yet today devices remain simple. This piece reports research using sound (acoustic) waves to push neuromorphic chips toward more brain-like information processing.
    Read original (IEEE Spectrum (AI section)) ↗
  • arXiv cs.CL (Computation and Language) · EN New Model Releases extract
    MedRLM: Recursive Multimodal Health Intelligence for Long-Context Clinical Reasoning, Sensor-Guided Screening, Evidence-Grounded Decision Support, and Community-to-Tertiary Referral Optimization
    MedRLM: recursive multimodal AI for long-context clinical reasoning
    AI Agents Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning Software Engineering
    The paper introduces MedRLM, a recursive multimodal health-intelligence system for long-context clinical reasoning, sensor-guided screening, evidence-grounded decision support, and community-to-tertiary referral optimization. It targets reasoning over heterogeneous, longitudinal patient data, beyond the single-step prompting or retrieval of current medical LLMs.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Developer Tools extract
    From Texts to Scores: Tracing the Emergence of Essay Quality Representations in Large Language Models
    Tracing how LLMs represent essay quality internally
    Reinforcement Learning
    The paper traces how representations of essay quality emerge inside large language models used for automated essay scoring. It systematically analyzes the hidden representations of eight LLMs across two English essay datasets to better understand the internal mechanisms behind LLM-based scoring.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Multimodal extract
    Self-Preference Is Weak or Absent in Verifiable Instruction-Following Revision: A Four-Model Test Under Genuine Authorship
    Self-preference is weak in verifiable instruction-following revision
    Neural Network
    The paper tests whether large language models resist valid corrections to their own writing during verifiable instruction-following revision. Across four models under genuine authorship, it finds that the documented self-preference bias is weak or absent in this revision setting.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • Hacker News (Front Page) · EN Developer Tools extract
    CS 6120: Advanced Compilers: The Self-Guided Online Course (2020)
    CS 6120: a self-guided online course on advanced compilers
    A look at CS 6120, an advanced compilers course released as a self-guided online resource. With lectures and assignments, the free course lets learners study advanced topics such as optimization and program analysis on their own.
    Read original (Hacker News (Front Page)) ↗
  • arXiv cs.CL (Computation and Language) · EN New Model Releases extract
    What Makes Effective Supervision in Latent Chain-of-Thought: An Information-Theoretic Analysis
    An information-theoretic look at supervising latent chain-of-thought
    The paper gives an information-theoretic analysis of what makes supervision effective in latent chain-of-thought reasoning, which internalizes reasoning in continuous hidden states. It examines why outcome supervision provides weak learning signals, making robust latent reasoning difficult.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Developer Tools extract
    Generative Engine Optimization at Scale: Measuring Brand Visibility Across AI Search Engines
    Measuring brand visibility across AI search engines at scale
    Claude Gemini GPT Retrieval-Augmented Generation (RAG) Software Engineering
    The paper studies generative engine optimization at scale, measuring how brands are represented, cited, and recommended across AI search engines such as ChatGPT, Claude, Perplexity, and Gemini. It frames the shift from traditional SEO as users increasingly get answers directly from these engines.
    Read original (arXiv cs.CL (Computation and Language)) ↗