Infrastructure & Hardware B

Showing 31–60 of 110
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    Essential Subspace Merging for Multi-Task Learning
    Essential subspace merging for multi-task model merging
    Inference Neural Network
    Model merging integrates the capabilities of several models fine-tuned from the same pretrained checkpoint into one, enabling multi-task learning. This work proposes Essential Subspace Merging, which extracts and merges each task's essential subspace to reduce interference and preserve multi-task performance.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • IEEE Spectrum (AI section) · EN Infrastructure & Hardware extract
    How Musicians Can Get Paid for Training AI
    IEEE Spectrum explores how musicians can be paid for AI training use
    Generative AI Reinforcement Learning
    IEEE Spectrum examines how musicians can be compensated when their music is used to train AI, covering attribution and payment for training-data use. This summary is title-based as the excerpt was blocked by a cookie/query-string wall and not retrieved; the specific mechanisms are per the article and unverified independently.
    Read original (IEEE Spectrum (AI section)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Training & Fine-tuning extract
    AdsMind: A Physics-Grounded Multi-Agent System for Self-Correcting Discovery of Adsorption Configurations on Heterogeneous Catalyst Surfaces
    AdsMind: physics-grounded multi-agent search for adsorption configs
    AI Agents Machine Intelligence Machine Learning
    Identifying the lowest-energy surface-adsorbate configuration is critical for modeling heterogeneous catalysis, but exhaustive ab initio exploration is prohibitive. AdsMind is a physics-grounded multi-agent system that self-corrects to efficiently discover adsorption configurations.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Infrastructure & Hardware extract
    Written by AI, Managed by AI: Semantic Space Control and Index Sickness Elimination Across 391 Consecutive Sessions
    Semantic-space control sustains 391 consecutive AI-run sessions
    Neural Network Reinforcement Learning
    The prevailing intuition for conceptual drift in long-horizon LLM collaboration is to trade more formal constraints for more reliable outputs. Across 391 consecutive sessions, this work studies semantic space control and 'index sickness' elimination in workflows written and managed by AI.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Quantifying and Auditing LLM Evaluation via Positive--Unlabeled Learning
    Auditing LLM-as-judge bias via positive-unlabeled learning
    Embeddings
    LLMs are increasingly used as judges for scalable evaluation, yet LLM-as-a-Judge systems show systematic biases decoupled from semantic quality, notably verbosity bias. This work uses positive-unlabeled learning to quantify and audit LLM evaluation, helping detect and correct such biases.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Adaptive Speech-to-Spike Encoding for Spiking Neural Networks
    Adaptive speech-to-spike encoding for spiking neural networks
    Deep Learning Google Neural Network Speech Processing
    The mismatch between continuous acoustic signals and discrete event-driven processing is a fundamental bottleneck for neuromorphic speech processing. Rather than fixed spike encoders, this work proposes adaptive speech-to-spike encoding for spiking neural networks, improving downstream performance.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Infrastructure & Hardware extract
    FoMoE: Breaking the Full-Replica Barrier with a Federation of MoEs
    FoMoE breaks the full-replica barrier with a federation of MoEs
    Mixture of Experts (MoE) Neural Network
    Pretraining LLMs typically demands large-scale infrastructure with tightly coupled accelerators. As model and data scale grow, FoMoE proposes a federation of Mixture-of-Experts that avoids replicating the full model across devices, breaking the full-replica barrier and easing infrastructure constraints.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    Spotlight: Synergizing Seed Exploration and Spot GPUs for DiT RL Post-Training
    Spotlight cuts DiT RL post-training cost with spot GPUs
    Deep Learning Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning Transformer
    Reinforcement learning post-training of Diffusion Transformers is prohibitively expensive, needing thousands of high-end GPUs. Spotlight synergizes seed exploration with cheap, preemptible spot GPUs to substantially reduce the cost of DiT RL post-training.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN New Model Releases extract
    Enhancing Multilingual Reasoning via Steerable Model Merging
    Enhancing multilingual reasoning via steerable model merging
    Neural Network
    Model merging effectively composes the capabilities of a multilingual model and a reasoning model, achieving promising generalization on multilingual reasoning by aligning their feature spaces. This work introduces steerable model merging to control the composition and further boost multilingual reasoning.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN New Model Releases extract
    TRAP: Benchmark for Task-completion and Resistance to Active Privacy-extraction
    TRAP benchmarks agents on task completion and privacy resistance
    AI Agents Neural Network
    Agents are increasingly deployed in document-intensive workflows where sensitive private information is routine input—e.g., booking a flight needs passport numbers. TRAP is a benchmark evaluating agents on both task completion and resistance to active privacy-extraction attempts.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    G-IdiomAlign: A Gloss-Pivoted Benchmark for Cross-Lingual Idiom Alignment
    G-IdiomAlign: a gloss-pivoted cross-lingual idiom benchmark
    Embeddings
    Idioms resist literal cross-lingual mapping because they are non-compositional. G-IdiomAlign anchors each idiom to an English Wiktionary gloss and adds a high-confidence reference alignment set. Two protocols (multiple-choice idiom equivalence and gloss-contrastive generation) isolate the effect of explicit glosses.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Inference & Efficiency extract
    Decoupling Search from Reasoning: A Vendor-Agnostic Grounding Architecture for LLM Agents
    Decoupling search from reasoning: a vendor-agnostic grounding architecture
    AI Agents Deep Learning Model Context Protocol (MCP) Reinforcement Learning Software Engineering
    Production LLM agents increasingly depend on real-time search but get locked into vendor-specific grounding. This work decouples search from reasoning with a vendor-agnostic grounding architecture, letting search backends be swapped while preserving reasoning quality.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    Graph-ESBMC-PLC: Formal Verification of Graphical PLCopen XML Ladder Diagram Programs Using SMT-Based Model Checking
    Graph-ESBMC-PLC: SMT-based verification of PLCopen ladder diagrams
    Inference Machine Learning Neural Network
    PLCopen XML defines encodings for IEC 61131-3 Ladder Diagrams. Graph-ESBMC-PLC applies SMT-based model checking to formally verify graphical PLCopen XML Ladder Diagram programs, supporting correctness checking of industrial control software.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Inference & Efficiency extract
    Approximate Structured Diffusion for Sequence Labelling
    Approximate structured diffusion for sequence labelling
    Inference Machine Learning Neural Network Natural Language Processing (NLP) Retrieval-Augmented Generation (RAG)
    Sequence labelling is a core NLP task. This work proposes an approximate structured diffusion approach that models label dependencies while keeping sequence labelling efficient.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Infrastructure & Hardware extract
    Morpheus: A Morphology-Aware Neural Tokenizer and Word Embedder for Turkish
    Morpheus: a morphology-aware neural tokenizer and embedder for Turkish
    Embeddings Inference Retrieval-Augmented Generation (RAG)
    Turkish is agglutinative, with meaning carried by morphemes that subword tokenizers fail to capture. Morpheus is a morphology-aware neural tokenizer and word embedder designed to improve Turkish language processing.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • Cohere Blog · EN Inference & Efficiency extract
    LLM Serving Fairness: No more noisy neighbours
    Cohere ensures fair compute sharing across LLM serving tenants
    Deep Learning Inference Meta Neural Network Reinforcement Learning
    Cohere details how it ensures every tenant gets a fair share of compute in LLM serving, tackling the 'noisy neighbour' problem where one user monopolizes resources. The design allocates capacity fairly across tenants to deliver stable, predictable multi-tenant performance.
    Read original (Cohere Blog) ↗
  • NVIDIA Developer Blog · EN Agents & Tool Use extract
    Building AI Agents for AR Glasses and XR Devices with NVIDIA XR AI
    NVIDIA unveils XR AI to build AI agents for AR glasses and XR devices
    AI Agents Computer Vision Generative AI NVIDIA
    NVIDIA introduced NVIDIA XR AI, a framework for developers to build AI agents for AR glasses and wearable XR devices. It targets the gap between ready hardware and the work of integrating live, real-time AI experiences. Capabilities are per NVIDIA's own announcement; third-party verification pending.
    Read original (NVIDIA Developer Blog) ↗
  • NVIDIA Developer Blog · EN Infrastructure & Hardware extract
    Build Your Own Transaction Foundation Model for Financial Intelligence
    NVIDIA details building a transaction foundation model for finance
    NVIDIA
    NVIDIA's developer blog walks through how to build your own transaction foundation model aimed at financial-intelligence use cases such as fraud detection and risk analysis. Specifics and claimed benefits come from NVIDIA's own post; independent verification is pending, as the raw excerpt was unavailable and this is summarized from the title and source.
    Read original (NVIDIA Developer Blog) ↗
  • ITmedia AI+ · JA Infrastructure & Hardware extract
    生成AI×自動運転で注目のTesla・Waymo・NVIDIA 各社が目指す「フィジカルAI」は何が違うのか
    How Tesla, Waymo and NVIDIA differ on 'physical AI' for driving
    NVIDIA
    ITmedia surveys 'physical AI'—a strategic focus area for Japan's government—through the lens of autonomous driving. The article reviews how advances in generative AI are reshaping the competition and compares the latest moves and differing approaches of Tesla, Waymo and NVIDIA.
    Read original (ITmedia AI+) ↗
  • arXiv cs.LG (Machine Learning) · EN Infrastructure & Hardware extract
    Adaptive Volumetric Mechanical Property Fields Invariant to Resolution
    AdaVoMP predicts resolution-invariant mechanical property fields for 3D
    Reinforcement Learning Transformer
    Reliable physics simulation needs Young's modulus, Poisson's ratio and density, which most 3D assets lack. AdaVoMP predicts dense, spatially varying values of these properties for input 3D objects in a way invariant to resolution across representations.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Infrastructure & Hardware extract
    Finite-Time Queue Peak Laws in Stochastic Networks: Logarithmic Scaling After Geometric Thresholds
    Finite-time queue-peak laws show log scaling after geometric thresholds
    Neural Network
    Studying finite-horizon queue peaks in generalized switches, where many queues share constrained service resources, the paper derives laws under a uniform interior-slack load condition showing logarithmic scaling of peaks after geometric thresholds.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • NVIDIA Developer Blog · EN Infrastructure & Hardware extract
    Build On-Device AI Companions with the NVIDIA ACE Game Agent SDK and Unreal Engine 5 Plugins
    NVIDIA unveils ACE Game Agent SDK and UE5 plugins for on-device AI
    Deep Learning NVIDIA
    NVIDIA announced the ACE Game Agent SDK and Unreal Engine 5 plugins for developers to build on-device AI companions—AI agents that run locally on the device rather than in the cloud—for in-game characters. The export raw_excerpt was blocked (cookie/query string data), so this is summarized neutrally from the title and the NVIDIA developer blog framing; specific figures and performance claims are unverified.
    Read original (NVIDIA Developer Blog) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Towards Understanding and Measuring COGNITIVE ATROPHY in LLM Behaviour
    Formalizing 'cognitive atrophy' as a process-level measure of LLM behaviour
    Neural Network
    The paper formalizes 'cognitive atrophy,' a process-level behavioural measure of AI-mediated mental-health support, capturing whether interactions help users keep reflecting, coping, and deciding, a dimension distinct from safety and static response quality.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.CL (Computation and Language) · EN New Model Releases extract
    Unintended Effects of Geographic Conditioning in Large Language Models
    Unintended regional biases from geographic conditioning in LLMs
    Claude Llama Meta Neural Network Reinforcement Learning
    Conversational AI localizes responses using user metadata, yet the regional biases this hidden context introduces remain poorly understood. The paper analyzes the unintended effects of geographic conditioning on large language model outputs.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.LG (Machine Learning) · EN Inference & Efficiency extract
    Ternary Mamba: Grouped Quantization-Aware Training of W1.58A16 State Space Models
    Ternary Mamba: grouped QAT for W1.58A16 state space models
    Inference Quantization Retrieval-Augmented Generation (RAG) Transformer
    Ternary Mamba applies grouped quantization-aware training to Mamba state space models with ternary (W1.58) weights and 16-bit activations, targeting efficient low-bit training and inference of sequence models while preserving accuracy.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.CL (Computation and Language) · EN New Model Releases extract
    HistoRAG: Embedding Historical Methodology in Retrieval-Augmented Generation Through Critical Technical Practice
    HistoRAG embeds historical methodology into RAG via critical practice
    Embeddings Retrieval-Augmented Generation (RAG) Reinforcement Learning Software Engineering
    RAG grounds model outputs in external evidence, but its dominant evaluations and defaults are oriented toward factual question answering. HistoRAG embeds historical methodology into retrieval-augmented generation through critical technical practice for interpretive historical studies.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • NVIDIA Developer Blog · EN Infrastructure & Hardware extract
    How to Optimize Transformer-Based Models for Low-Precision Training
    NVIDIA guide on optimizing transformer models for low-precision training
    Generative AI NVIDIA Transformer
    An NVIDIA technical post explains techniques for optimizing transformer-based models during low-precision training. The export raw_excerpt was blocked (cookie/query data), so this summary is based only on the title and source; specific methods and figures are unverified.
    Read original (NVIDIA Developer Blog) ↗
  • arXiv cs.LG (Machine Learning) · EN Infrastructure & Hardware extract
    Tensor-based second-order causal discovery
    Tensor-based second-order causal discovery (TSCD)
    Deep Learning
    To uncover causal dependencies among variables, the paper proposes TSCD, a tensor-based second-order causal discovery algorithm whose input is a tensor formed from covariance matrices of observational and interventional data, assuming linear structural equations.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Agentic AI-based Framework for Mitigating Premature Diagnostic Handoff and Silent Hallucination in Healthcare Applications
    A multi-agent framework against premature handoff and silent hallucination
    AI Agents Llama
    The paper proposes a multi-agent framework for healthcare that mitigates premature diagnostic handoff and silent clinical hallucinations, replacing LLM-as-a-judge routing with deterministic orchestration constraints and adding two safety mechanisms.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.LG (Machine Learning) · EN Infrastructure & Hardware extract
    ConTex: Reformulating Counterfactual Generation For Time Series Forecasting
    ConTex reformulates counterfactual generation for time-series forecasting
    Deep Learning
    Decision-making with deep time-series forecasting needs not just accurate predictions but actionable insight, which current architectures lack. ConTex reformulates counterfactual generation to indicate how present conditions must change to shift a predicted outcome toward a desired future.
    Read original (arXiv cs.LG (Machine Learning)) ↗