Safety & Evaluation A

Showing 31–60 of 317
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Quantum ring all-reduce: communication and privacy advantages for distributed learning
    Quantum ring all-reduce for efficient, private distributed learning
    Deep Learning Machine Learning
    The paper proposes a quantum ring all-reduce scheme for distributed learning, arguing that quantum communication can make distributed training both more communication-efficient and information-theoretically private. The approach is discussed for both classical and quantum settings.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    A Model-Driven Approach for Developing Families of Reinforcement Learning Environments
    A model-driven approach to building families of RL environments
    AI Agents Reinforcement Learning
    The paper presents a model-driven approach for developing families of reinforcement learning environments. It treats virtual training environments as software-intensive systems and aims to make building these safe, cost-efficient alternatives to real-world training more systematic.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Statistical Properties of Training & Generalization
    Physics-informed view of training and generalization in deep learning
    Deep Learning Machine Learning Reinforcement Learning
    The article investigates the key features and surprises of deep learning's training and generalization from a physics-informed perspective. It examines how deep learning departs from classical statistical intuitions to achieve strong real-world performance, justifying these observations where possible.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Shifting-based Optimizable Linear Relaxations for General Activation Functions
    Shifting-based linear relaxations for general activation functions
    Deep Learning Neural Network
    The paper proposes a shifting-based method for constructing optimizable linear relaxations of general activation functions, used in formal verification of neural networks. It removes the need for hand-crafted relaxations tailored to each activation, supporting formal guarantees in safety- and security-critical settings.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.CL (Computation and Language) · EN Multimodal extract
    PsyScore: A Psychometrically-Aware Framework for Trait-Adaptive Essay Scoring and ZPD-Scaffolded Feedback
    PsyScore: psychometric essay scoring with scaffolded feedback
    Retrieval-Augmented Generation (RAG)
    The paper presents PsyScore, a psychometrically-aware framework for automated essay scoring that adapts to writing traits and provides ZPD-scaffolded feedback. It aims to unify scoring and feedback, which existing methods treat separately, balancing reliable assessment with interpretable, actionable instruction.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    Editorial Alignment: A Participatory Approach to Engaging Editorial Expertise in LLM-mediated Knowledge Dissemination
    Editorial alignment: engaging editorial expertise in LLM knowledge dissemination
    LLM-driven information services are reshaping how public knowledge is produced. This work proposes a participatory approach to engage editorial expertise in LLM-mediated knowledge dissemination.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    The Register Gap: A Meaning Intelligence Framework for Nigerian Public Discourse
    The Register Gap: a meaning intelligence framework for Nigerian discourse
    Deep Learning Gemini Neural Network Retrieval-Augmented Generation (RAG)
    This work introduces the Meaning Intelligence Framework, a nine-dimension annotation and evaluation scheme, to study the register gap in Nigerian public discourse.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Training & Fine-tuning extract
    Finetuning Vision-Language-Action Models Requires Fewer Layers Than You Think
    Finetuning vision-language-action models needs fewer layers than expected
    Computer Vision Fine-tuning Inference Machine Learning Reinforcement Learning
    Vision-Language-Action models pre-trained on massive video-robot datasets have transformed robot control. This work shows that finetuning them requires fewer layers than previously assumed.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    ScholarQuest: A Taxonomy-Guided Benchmark for Agentic Academic Paper Search in Open Literature Environments
    ScholarQuest: a taxonomy-guided benchmark for agentic paper search
    AI Agents Software Engineering
    Academic paper search is a core step in research, and LLM-based search agents are emerging. ScholarQuest provides a taxonomy-guided benchmark for agentic academic paper search in open literature environments.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    QMFOL: Benchmarking Large Language Model Reasoning via Quantifiable Monadic First-Order Logic Test Case Generation
    QMFOL: benchmarking LLM reasoning via first-order logic test generation
    Reinforcement Learning
    Large language models have advanced in reasoning, especially deduction. QMFOL benchmarks LLM reasoning through quantifiable monadic first-order logic test-case generation.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Training & Fine-tuning extract
    Actionable Activation Directions for Detecting and Mitigating Emergent Misalignment Across Language Model Families
    Activation directions for detecting emergent misalignment in LLMs
    Fine-tuning Llama Reinforcement Learning
    The paper investigates whether emergent misalignment—induced by fine-tuning language models on insecure code—corresponds to a causally actionable, shared direction in activation space. Across four instruction-tuned model families, it studies using such directions to detect and mitigate the misalignment.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Learner-based Concept Drift Detection: Analysis and Evaluation
    Learner-based concept drift detection: analysis and evaluation
    Algorithms & Theory Deep Learning Machine Learning Reinforcement Learning
    Machine learning deployed in evolving streaming environments must handle non-stationarity. This work analyzes and evaluates learner-based approaches to concept drift detection.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    CzechDocs: A Multiway Parallel Dataset of Formatted Documents for Minority Languages in Czechia
    CzechDocs: a parallel formatted-document MT dataset for Czechia
    Machine Learning
    The paper presents CzechDocs, a multiway parallel dataset of formatted documents in HTML, DOCX, and PDF covering Czech and minority languages used in Czechia—primarily Ukrainian and English, with smaller amounts of Vietnamese, Russian, and others. It is designed to support evaluation of machine translation.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Beyond Accuracy: Measuring Logical Compliance of Predictive Models
    Beyond accuracy: measuring logical compliance of predictive models
    Embeddings Machine Learning Reinforcement Learning
    Machine learning models are mostly evaluated through predictive metrics such as accuracy. This work goes beyond accuracy to measure the logical compliance of predictive models.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Off-Policy Evaluation for Missingness-Aware Policies in MDPs with Rewards Missing Not at Random
    Off-policy evaluation when rewards are missing not at random
    Reinforcement Learning
    The paper studies off-policy evaluation in finite-horizon MDPs when rewards are missing not at random, as in offline reinforcement learning with sparse, irregular, or censored reward records. It develops missingness-aware policies for settings such as health care and marketing.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Apparent Psychological Profiles of Large Language Models are Largely a Measurement Artifact
    LLM psychological profiles are largely a measurement artifact
    Deep Learning Neural Network
    Psychological instruments designed for humans are increasingly applied to large language models. This work argues that the apparent psychological profiles of LLMs are largely a measurement artifact.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    Pitch Spelling Jazz Lead Sheets, Solo Transcriptions, Classical Piano and Monophonic Scores
    An algorithm for pitch spelling and key estimation from MIDI
    The paper presents an algorithm for pitch spelling and key estimation across jazz lead sheets, solo transcriptions, classical piano, and monophonic scores. Given MIDI-like input with note pitches and bar boundaries, it estimates note names, a global key signature, and local scales.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    HilDA: Hierarchical Distillation with Diffusion for Advancing Self-Supervised LiDAR Pre-trainin
    HilDA: hierarchical distillation with diffusion for self-supervised LiDAR
    Computer Vision Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning
    Using vision foundation models for camera-to-LiDAR knowledge distillation is promising. HilDA advances self-supervised LiDAR pre-training through hierarchical distillation with diffusion.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    ReNikud: Audio-Supervised Hebrew Grapheme-to-Phoneme Conversion
    ReNikud: audio-supervised Hebrew grapheme-to-phoneme conversion
    Neural Network Speech Processing
    The paper presents ReNikud, an audio-supervised approach to grapheme-to-phoneme conversion for Modern Hebrew. It addresses the ambiguity of Hebrew's abjad script, which leaves vowels largely unwritten, going beyond standard pipelines that first predict vowel diacritics (nikud).
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN New Model Releases extract
    NAMESAKES: Probing Identity Memorization in Text-to-Image Models
    NAMESAKES: probing identity memorization in text-to-image models
    Neural Network
    The paper introduces NAMESAKES, a study probing identity memorization in text-to-image models, which can generate realistic likenesses of individuals from their names. It addresses the difficulty of telling whether a generated face is memorized or fabricated without ground-truth photos, training data, or white-box model access.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    Learning to Prompt: Improving Student Engagement with Adaptive LLM-based High-School Tutoring
    Adaptive subject-aware prompting for LLM high-school tutoring
    The paper develops an adaptive LLM-based high-school tutoring system with subject-aware prompting, built on 14 pedagogical features—such as tutor scaffolding and student understanding—extracted from transcripts. It aims to improve student engagement where static-prompt tutoring struggles to adapt across disciplines.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Inference & Efficiency extract
    When Does Streaming Tool Use Help? Characterizing Tool-Intent Stabilization in Streaming Retrieval-Augmented Generation
    When does streaming tool use help in streaming RAG?
    Retrieval-Augmented Generation (RAG) Reinforcement Learning Software Engineering
    The paper characterizes when streaming tool use helps in streaming retrieval-augmented generation, which issues tool queries in parallel with ongoing user input to cut perceived latency. It argues the benefit is query-intrinsic and studies how tool intent stabilizes before an utterance is complete.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Multimodal extract
    Self-Preference Is Weak or Absent in Verifiable Instruction-Following Revision: A Four-Model Test Under Genuine Authorship
    Self-preference is weak in verifiable instruction-following revision
    Neural Network
    The paper tests whether large language models resist valid corrections to their own writing during verifiable instruction-following revision. Across four models under genuine authorship, it finds that the documented self-preference bias is weak or absent in this revision setting.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    IHUBERT: Vector-Based Semantic Deduplication and Domain-Balanced Pretraining for Persian Resources
    IHUBERT: a Persian language model with semantic dedup pretraining
    Reinforcement Learning Software Engineering
    The paper presents IHUBERT, a monolingual Persian pretrained language model trained from scratch on a RoBERTa-base encoder. It uses vector-based semantic deduplication and domain-balanced pretraining to address the scarcity of large, high-quality Persian corpora and limited evaluation.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • OpenAI Blog · EN New Model Releases extract
    Improving health intelligence in ChatGPT
    OpenAI improves ChatGPT health responses with GPT-5.5 Instant
    GPT
    OpenAI says GPT-5.5 Instant strengthens ChatGPT's health and wellness responses through better reasoning, richer context, and clearer communication. The work is backed by physician-informed evaluations aimed at delivering more reliable, trustworthy health guidance.
    Read original (OpenAI Blog) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    Source-Grounded Data Generation for Text-to-JSON Learning
    Source-grounded data generation for text-to-JSON extraction
    Reinforcement Learning
    The paper proposes source-grounded data generation for text-to-JSON learning, where models extract information from long unstructured documents into structured, machine-readable JSON. It targets domains such as financial filings and clinical records that store high-value information in unstructured text.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    When Lower Privileges Suffice: Investigating Over-Privileged Tool Selection in LLM Agents
    Investigating over-privileged tool selection in LLM agents
    AI Agents Meta Neural Network
    The paper investigates over-privileged tool selection in LLM agents, which autonomously choose among tools with different privilege levels. It addresses a gap in prior tool-selection research, which focuses on safety-agnostic metadata preferences, by studying when lower-privilege tools would suffice.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Agents & Tool Use extract
    Connect the Dots: Training LLMs for Long-Lifecycle Agents with Cross-Domain Generalization Via Reinforcement Learning
    Connect the Dots: RL training for long-lifecycle LLM agents
    AI Agents Meta Neural Network Reinforcement Learning
    The paper presents Connect the Dots (CoD), a reinforcement-learning framework for training large language models as long-lifecycle agents. It targets the meta-capability of solving a long sequence of tasks while continuously exploring an environment, aiming for cross-domain generalization.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Multimodal extract
    Investigating Human-Model Discrepancies in Speech Quality Assessment via Acoustic and Prosodic Perturbations
    Human-model gaps in speech quality assessment under perturbations
    Speech Processing
    The paper investigates discrepancies between human judgments and MOS prediction models in speech quality assessment, using controlled acoustic and prosodic perturbations. It probes whether these models, widely used as proxy metrics in text-to-speech research, capture quality differences beyond acoustic fidelity.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    Light-weight Pronunciation Assessment via Discrete Speech Token Surprisal
    Lightweight pronunciation assessment via speech token surprisal
    Inference Speech Processing
    The paper proposes a lightweight framework for automated pronunciation assessment based on discrete speech token surprisal, trained only on native speech resources. It operates unsupervised or with light calibration from a small set of scored utterances, avoiding costly labeled learner-error or non-native corpora.
    Read original (arXiv cs.CL (Computation and Language)) ↗