Safety & Evaluation (Page 2 of 4)｜AI/Tech News Trends

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN New Model Releases

EgoGenesis: Egocentric World-Action Modeling with Online Anchored Projective Memory and Action-3D RoPE

Embeddings Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Safety & Evaluation

AI and Authenticity in Islamic Research: A Critical Evaluation of Generative AI Reliability, Hallucination, and Source Fidelity in Quranic, Hadith, and Fiqh Knowledge

Deep Learning Generative AI Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Safety & Evaluation

Security of World-Model-Based Embodied AI: A Lifecycle of Threats, Defenses, and Evaluation

Neural Network Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-30 EN Infrastructure & Hardware

Fidelity Is Not Safety: Gently-Compressed LLMs Pass Every Data-Free Quality Guard Yet Invent Procedure Steps in Agentic Execution

Machine Learning Neural Network

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Developer Tools

Integrating AI into Requirements Quality Learning in Software Engineering Education: A TPACK-Guided Empirical Study

Software Engineering

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Inference & Efficiency

OPLD: On-Policy Latent Distillation for Multimodal Reasoning

Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN New Model Releases

Can Agents Deceive? Evaluating Reasoning and Deception in ParliamentBench using a Social Deduction Game

AI Agents DeepSeek GPT Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Safety & Evaluation

Asymmetric Communication: Large Language Models and Language Games

Neural Network

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Policy & Regulation

An Instrument to Evaluate Governance Proposals: AI Policy Analysis at Scale

Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Safety & Evaluation

Diversifying Personalized Research Ideation against AI-Induced Homogenization

Deep Learning Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-30 EN Agents & Tool Use

ClawTrack: Towards Trace-Level Evaluation and Improvement of Real-World Autonomous Agents

AI Agents Reinforcement Learning

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-30 EN Safety & Evaluation

Measuring Alignment With Reader Highlights Net of Position and Length

Deep Learning Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-30 EN Multimodal

DualAnchor: Preserving Language Priors and Improving Lexical Fidelity in Gloss-Free Sign Language Translation

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN Safety & Evaluation

Cost-Sensitive Conformal Prediction and Human-in-the-Loop Abstention for Imbalanced High-Stakes Decision Support: A Multi-Domain Benchmark

Retrieval-Augmented Generation (RAG) Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

NVIDIA Developer Blog · 2026-07-29 EN Agents & Tool Use extract

How to Self-Host a Validated AI Coding Assistant with NVIDIA NeMo Guardrails

NVIDIA: self-host a validated AI coding assistant via NeMo Guardrails

AI Agents Generative AI NVIDIA

An NVIDIA developer-blog post on self-hosting a validated AI coding assistant using NeMo Guardrails, framed around agent operation, infrastructure and safety. Note: the raw excerpt was blocked by a content guard, so specific components, supported models and guardrail rules are inferred from the title and URL and remain unverified from the body.

Read original (NVIDIA Developer Blog) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN New Model Releases

SciFigQual-Bench: A Benchmark for Scientific Figure Quality Assessment with Full-Manuscript Context

GPT Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN Safety & Evaluation

On-Policy Distillation for LLM Safety: A Routing Approach to Template-Robust Realignment

Fine-tuning Neural Network

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN Multimodal

Visual Credit Audit for Multimodal Spatial Reasoning

Machine Learning Neural Network Software Engineering

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN New Model Releases

SciFigAlign: Scoring Scientific Figures by Fine-tuned Alignment of Visuals with Manuscript Evidence

Machine Learning Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN New Model Releases

OptimismBench: Forecasting Bias and the Alignment Effect in Language Model Judgment

Anthropic

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN New Model Releases

TREK: A Travel Reasoning and Evaluation Kit for LLM Agents in Complex Trip Planning

AI Agents Neural Network

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN New Model Releases

Progressive Multimodal Alignment for Continual Instruction Tuning

Deep Learning Embeddings Machine Learning Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN New Model Releases

Belief-Guided Decision Making with Uncertainty Gating in the Game of Go

Deep Learning Inference Neural Network Reinforcement Learning Transformer

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN New Model Releases

Defending Against Backdoor Attacks via Alignment Checking in Model-Contrastive Federated Learning

Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN New Model Releases

BioVLN: A Simulation Platform for Visual Language Navigation in Biomedical Laboratories

AI Agents

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN New Model Releases

Dual-Path LLM Reasoning for Multimodal Few-Shot Knowledge Graph Completion

Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN New Model Releases

From Found to Designed: Concepts as a Design Axis for Large Language Models

Inference

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN Inference & Efficiency

FedTopo: Relation-Level Topology Sharing for Model-Heterogeneous Federated Learning

Inference Neural Network

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN Multimodal

Dual Inversion for Text-to-Image Diffusion Models: From Both Prompt and Noise Perspectives

Computer Vision

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN Safety & Evaluation

MPEcho: A Melody and Phoneme-Aware Generative Framework for Controllable Cover Song Generation

Neural Network

Read original (arXiv cs.AI (Artificial Intelligence)) ↗