Funding & M&A C

Showing 31–60 of 73
  • Stratechery (free posts) · EN Safety & Evaluation extract
    The State of Fable, The Jailbreak Problem, SpaceX Acquires Cursor
    Stratechery on Fable's state, jailbreaks, and SpaceX buying Cursor
    Anthropic
    A Stratechery column by Ben Thompson on three topics: the state of Anthropic's Fable model, the AI jailbreak problem, and SpaceX's acquisition of Cursor. Thompson argues the administration is likely wrong about Fable but that responsibility ultimately lies with Anthropic. Views are the author's; deal specifics are unverified.
    Read original (Stratechery (free posts)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    Aligning Implied Statements for Implicit Hate Speech Generalizability with Context-Bounded Semi-hard Negative Mining
    Aligning implied statements for generalizable implicit hate detection
    Speech Processing
    Classifying implicit hate speech is hard because intent is rarely explicit. This work aligns implied statements and applies context-bounded semi-hard negative mining to improve the generalizability of implicit hate speech detection.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • ITmedia AI+ · JA New Model Releases extract
    Cursor、Gitホスティング「Origin」発表 SpaceXによる買収発表直後に
    Cursor unveils 'Origin' Git hosting, seen as a GitHub rival
    Cursor, the AI coding tool, announced 'Origin', a Git hosting service that the article frames as aimed at rivaling GitHub. The reveal reportedly came right after news of SpaceX acquiring Cursor. Acquisition terms and Origin's features are article-based, and third-party verification is unconfirmed.
    Read original (ITmedia AI+) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    LegalWorld: A Life-Cycle Interactive Environment for Legal Agents
    LegalWorld: a life-cycle interactive environment for legal agents
    AI Agents Neural Network Reinforcement Learning
    Civil litigation is inherently a life-cycle process where documents connect across stages. LegalWorld provides an interactive environment covering the full litigation life cycle, enabling legal agents to be evaluated and trained within that flow.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    LLMs Struggle to Measure What Distinguishes Students of Different Proficiency Levels: A Study of Item Discrimination in Reading Comprehension Assessment
    LLMs struggle to measure item discrimination in reading assessment
    Software Engineering
    Item discrimination is a fundamental psychometric property that distinguishes students of different proficiency. This study shows that large language models struggle to measure item discrimination in reading comprehension assessment, exposing limits of automated evaluation.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • ITmedia AI+ · JA Developer Tools extract
    SpaceX、AIコーディング「Cursor」を9.6兆円で買収 「近く大幅な改善」へ
    SpaceX reported to acquire AI coding tool Cursor for 9.6 trillion yen
    SpaceX is reported to be acquiring the AI coding tool "Cursor" for 9.6 trillion yen. Cursor said on its official X account that "major improvements are coming soon," according to the article. Deal details and the headline figure are based on the report and remain unverified by third parties.
    Read original (ITmedia AI+) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    The Stanford EDGAR Filings Dataset: Reconstructing U.S. Corporate and Financial Disclosures into Layout-Faithful and Token-Efficient Pretraining Data
    SEFD: an open, layout-faithful reconstruction of SEC filings for LLMs
    Reinforcement Learning
    The paper introduces the Stanford EDGAR Filings Dataset (SEFD), an open reconstruction of SEC filings into layout-faithful MultiMarkdown, providing audited financial disclosures as token-efficient pretraining and evaluation data for financial language modeling.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.LG (Machine Learning) · EN New Model Releases extract
    A Diffusion Approximation for Temporal-Difference Learning with Linear Features under Markovian Noise
    A diffusion approximation for TD learning under Markovian noise
    The classical continuous-time description of temporal-difference learning with linear features is an ODE capturing asymptotic mean dynamics but neglecting stochasticity. This work provides a diffusion approximation for TD learning under Markovian noise to capture those fluctuations.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Evaluating Open-Source LLMs for Multi-Label ATT&CK Technique Classification on CTI Reports
    Evaluating open-source LLMs for ATT&CK multi-label CTI classification
    Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning
    The paper evaluates open-source LLMs on multi-label classification of cyber threat intelligence (CTI) reports using MITRE ATT&CK techniques. Summary is title-based and neutral; details and figures are as presented by the source and not independently verified.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    The Measurement Gap in the Automation of EU Law: Benchmarking Doctrinal Legal Reasoning under the EU AI Act
    Benchmarking doctrinal legal reasoning under the EU AI Act
    Neural Network
    LLMs produce legal text of at least median quality, yet no benchmark evaluates doctrinal legal reasoning, the interpretive core of legal work. The paper benchmarks doctrinal reasoning under the EU AI Act and discusses the measurement gap in legal automation.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.LG (Machine Learning) · EN Inference & Efficiency extract
    Embedded Machine Learning for Microcontroller-Class Edge Devices: Data, Feature, Evaluation, and Deployment Pipelines
    A pipeline survey of embedded ML for microcontroller-class devices
    Inference Machine Learning Quantization
    Embedded machine learning moves inference from the cloud to resource-constrained devices. This practice-oriented synthesis lays out data, feature, evaluation and deployment pipelines for an embedded ML workflow on microcontroller-class platforms.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Edge Flow: A Tractable and Predictive Continuous-Time Model for Gradient Descent at the Edge of Stability
    Edge Flow: a tractable continuous-time model for GD at the edge of stability
    Deep Learning
    Gradient descent in deep learning can operate at the edge of stability, where the loss Hessian's top eigenvalue hovers near the stability threshold. Classical tools fail there, so Edge Flow offers a tractable, predictive continuous-time model of this regime.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Policy & Regulation extract
    When LLMs Analyze Scars: From Images to Clinically-Meaningful Features
    When LLMs analyze scars: images to clinically-meaningful features
    Deep Learning Neural Network Reinforcement Learning
    Medical image classification excels at scale but real clinics face data scarcity from annotation cost, privacy and disease rarity. Focusing on pathological scar classification, the paper uses LLMs to derive clinically-meaningful features from images.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Funding & M&A extract
    C2FL: Clustered Continual Federated Learning under Spatial and Temporal Drift
    C2FL: clustered continual federated learning under drift
    Machine Learning Retrieval-Augmented Generation (RAG)
    Collective adaptive systems let nodes learn from locally sensed data, but privacy-sensitive data and node mobility hinder scaling. C2FL proposes clustered continual federated learning that handles spatial and temporal drift.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Infrastructure & Hardware extract
    Multiple cyclicity and Wavelet Decomposition with Channel Correlation for Long-term Time Series Forecasting
    Multiple cyclicity and wavelet decomposition for long-term forecasting
    Neural Network Reinforcement Learning
    Cyclicity and trend are key components of time series, but prior work often neglects real-world inter-channel correlations. The paper combines multiple cyclicity with wavelet decomposition and channel correlation to improve long-term time series forecasting.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Dimensionality Controls When Modularity Helps in Continual Learning
    Dimensionality controls when modularity helps in continual learning
    Reinforcement Learning
    Compositional learning systems must balance plasticity and stability. The paper analyzes when modularity helps in continual learning and shows that the dimensionality of representations controls whether modular structure is beneficial.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.CL (Computation and Language) · EN Developer Tools extract
    A Framework for Evaluating Agentic Skills at Scale
    A framework for evaluating agentic skills at scale
    AI Agents Deep Learning Reinforcement Learning
    Agent skills, structured reusable knowledge artifacts that augment LLM agents, have been rapidly adopted, yet their cross-domain impact and a reusable methodology for evaluating individual skills are lacking. The paper presents a framework for evaluating agentic skills at scale.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • Stratechery (free posts) · EN Funding & M&A extract
    Fox Buys Roku, The Problem With Fox’s Smart Strategy, Streaming That Works
    Stratechery analyzes Fox's acquisition of Roku and its strategy
    Retrieval-Augmented Generation (RAG)
    Tech-analysis newsletter Stratechery examines Fox's acquisition of Roku. While the market reacted negatively, the piece argues Fox is trading direct extraction from rights holders for the leverage of operating as a distribution 'renter,' and reflects on streaming business models and media-company positioning.
    Read original (Stratechery (free posts)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Your Privacy My Cloak: Backdoor Attacks on Differentially Private Federated Learning
    DP can hide backdoors in federated learning, enabling RING attack
    Deep Learning Retrieval-Augmented Generation (RAG) Reinforcement Learning
    Challenging the belief that differential privacy (DP) makes federated learning robust to backdoors, the authors show empirically that complying with DP masks the statistical signatures defenses rely on, rendering them ineffective. They exploit this with RING, an attack that uses DP to conceal malicious contributions while maximizing impact, acting as a perturbation layer agnostic to the underlying backdoor technique.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Developer Tools extract
    Learning the Geometry of Data: A Mathematical Review of Shape Space Analysis
    A mathematical review of shape space analysis for geometric data
    Computer Vision Deep Learning Machine Learning Neural Network Reinforcement Learning
    This survey synthesizes the fast-growing literature on shape space analysis, a framework for data whose observations carry rich geometric form across biology, medicine, anthropology and vision. Drawing on differential geometry, statistics and ML, it organizes the work around a shared pipeline of shape representation, parameterization and metric construction.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    A Multi-Center Benchmark for Abdominal Disease Diagnosis and Report Generation from Non-Contrast CT
    Multi-center benchmark diagnoses abdominal disease from non-contrast CT
    Deep Learning Retrieval-Augmented Generation (RAG)
    The paper introduces a multi-center benchmark for multi-organ abdominal disease diagnosis and automated radiology report generation that synthesizes contrast-enhanced findings from single-phase non-contrast CT, aiming to cut contrast risks and radiologist workload. Using paired NCCT-CECT studies from two centers, it benchmarks five deep-learning architectures under a unified protocol.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Latent space mapping of interpretable structural coordinates from stochastic single-molecule signals
    Contrastive latent mapping of nanopore signals into molecular coordinates
    Nanopores are versatile single-molecule sensors, but stochastic translocation dynamics warp encoded information, limiting their utility. The paper shifts from time-domain analysis to a learned latent-space mapping via a contrastive encoder trained only on simulated signals from a physics-informed model. It maps nanopore signals of engineered DNA barcodes into an interpretable molecular coordinate system that responds to structural parameters but stays invariant to acquisition conditions.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    A Unified Causal-Origin Taxonomy of Distributional Shifts in Reinforcement Learning
    A unified causal-origin taxonomy of distributional shifts in RL
    Reinforcement Learning
    Reinforcement learning systems degrade when operating conditions diverge from training, reflecting distributional shifts in the data-generating process. These shifts arise between training and evaluation (ID vs. OOD generalization) or in non-stationary settings where dynamics evolve, yet their formal relationship is unclear and prior work emphasizes mitigation over causes. The paper proposes a unified taxonomy of the causal origins of shift within the agent-environment interaction.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.CL (Computation and Language) · EN Safety & Evaluation extract
    IMPACTeen: Intentions, Manipulation, Persuasion, Annotations, and Consequences in Teen Communication Dataset
    IMPACTeen: a teen-context dataset of social-influence scenarios and labels
    Neural Network
    The paper introduces IMPACTeen, a dataset of textual social-influence scenarios in adolescent interpersonal, media, and digital settings. It contains 1,021 texts and 5,100 annotation records labeled from five perspectives (teens, parents, psychologists, communication experts, teachers), built via constrained LLM generation plus two-step human editing, with Polish and English versions. Summarized neutrally from the abstract.
    Read original (arXiv cs.CL (Computation and Language)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    Upper Bounds on the Generalization Error of Deep Learning Models via Local Robustness and Stability
    Tighter deep-learning generalization bounds via local robustness
    Deep Learning Neural Network Reinforcement Learning
    Robustness-based generalization bounds are often vacuous in practice. The authors trace much of the looseness to the robustness term itself, especially for 0-1 loss, which is usually treated as a global measure. They propose a bound that scales the robustness term by the number of stable and unstable samples across input sub-regions, yielding tighter estimates.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.LG (Machine Learning) · EN Safety & Evaluation extract
    HawkesNest: A Multi-Axis Synthetic Benchmark for Spatiotemporal Pattern Complexity
    HawkesNest: a synthetic benchmark for spatiotemporal point process models
    Reinforcement Learning Software Engineering
    Evaluating spatiotemporal point process (STPP) models relies on opaque real datasets where failures are hard to attribute. HawkesNest is a generator-aligned synthetic benchmark built on a multivariate Hawkes backbone, defining four complexity axes with deterministic indices so models can be stress-tested under known structural difficulty.
    Read original (arXiv cs.LG (Machine Learning)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Industry Adoption extract
    Beyond Models: Reflections on Engineering AI-enabled Systems in a Project-Based Course
    Reflections on teaching the engineering of AI-enabled systems in a course
    Algorithms & Theory Machine Learning Neural Network Reinforcement Learning Software Engineering
    This paper reflects on a project-based master's course at the University of Bremen on engineering AI-enabled systems. It argues that machine learning courses emphasize model development while students lack experience in architectural design, deployment, and monitoring, and reports on the course's design and implementation.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    Robust Spoofed Speech Detection via Temporal Pyramid Modeling
    Temporal Pyramid modeling for robust, generalizable spoofed-speech detection
    Neural Network Speech Processing
    The paper proposes a Temporal Pyramid Adapter for spoofed speech detection, using parallel temporal convolutions with varying receptive fields to capture multi-scale cues from local artifacts to global prosodic irregularities. It combines self-supervised XLS-R representations with front-end adapters to improve cross-dataset generalization.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Funding & M&A extract
    ATOM-Bench: A Real-World Benchmark for Atomic Skills and Compositional Generalization in Manipulation Policies
    ATOM-Bench evaluates atomic skills and compositional generalization in robots
    Fine-tuning Reinforcement Learning
    The paper presents ATOM-Bench, a real-world benchmark for evaluating both atomic skills and compositional generalization in robotic manipulation policies. It factorizes tabletop manipulation into motor and instruction atoms, noting that a policy may succeed on demonstrated tasks yet fail to execute fine-grained skills or recombine them in new structures.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗
  • arXiv cs.AI (Artificial Intelligence) · EN Safety & Evaluation extract
    LabOSBench: Benchmarking Computer Use Agents for Scientific Instrument Control
    LabOSBench: a simulated testbed for computer-use agents controlling instruments
    AI Agents Computer Vision
    The paper proposes LabOSBench, a simulated yet realistic testbed for evaluating computer-use agents on scientific instrument control. It notes that existing benchmarks focus on software tasks in virtual systems, while real instruments require coordinated interface control and feedback-driven parameter tuning that are costly and risky to evaluate directly.
    Read original (arXiv cs.AI (Artificial Intelligence)) ↗