推論・効率化 (4 / 6 ページ)｜AI/Tech動向まとめ

arXiv cs.CL (Computation and Language) · 2026-07-30 EN 推論・効率化

A Sparse Glimpse of the Whole: Train-Free Self-Speculative Decoding

推論 (Inference)

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-30 EN 新モデル・リリース

Recall Before You Rank: Similarity-Guided Top-$K$ Reuse for Efficient Long-Context Attention

強化学習

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-30 EN 推論・効率化

Beyond Similarity: Grounded Agentic Extraction and Expert-Adjudicated Evaluation of Intertextuality in Classical Chinese Histories

推論 (Inference) ニューラルネットワーク

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-30 EN 推論・効率化

Prox: Training-Free FFN Activation Sparsity via Approximate Intermediate-Channel Salience in LLMs

推論 (Inference) ニューラルネットワーク量子化

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-29 EN インフラ・ハードウェア

From Classification to Regression: Using a Fruitfly to Solve Equations

埋め込み (Embeddings) 推論 (Inference) 検索拡張生成 (RAG)

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 推論・効率化

Improving Item Discoverability in e-Commerce Search via Related Intent Generation

推論 (Inference) 検索拡張生成 (RAG) 強化学習

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 新モデル・リリース

OmegaUse-OfficeVal: Benchmarking LLM Agents on Long-Horizon Office-Suite Tasks with Economic Grounding

AI エージェント推論 (Inference) 検索拡張生成 (RAG)

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-29 EN 推論・効率化

Minimal Markovization via Stable Quotients in Holonomy-Cover Decision Processes

推論 (Inference) 強化学習

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-29 EN 新モデル・リリース

InferScale: GPU-Native KV Injection for Personalized LLM Serving

深層学習埋め込み (Embeddings) ファインチューニング GPT 推論 (Inference)

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 安全性・評価

On-Policy Distillation for LLM Safety: A Routing Approach to Template-Robust Realignment

ファインチューニングニューラルネットワーク

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 推論・効率化

CoCaRS: Correlation Calibration-Based Redundancy Suppression for Heterogeneous Knowledge Distillation

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-29 EN 推論・効率化

Mitigating Compounding Error via Video Representation Regularization

推論 (Inference) ニューラルネットワーク強化学習ロボティクス

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 新モデル・リリース

Generation or Judgement? A Paradigm Perspective on LLM-Based Emotion-Cause Pair Extraction in Conversation

深層学習推論 (Inference)

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 新モデル・リリース

Belief-Guided Decision Making with Uncertainty Gating in the Game of Go

深層学習推論 (Inference) ニューラルネットワーク強化学習 Transformer

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 推論・効率化

DIRECT: Direct Decoding for Efficient and Aligned Sequence Labeling with Large Language Models

ファインチューニング推論 (Inference) 人間のフィードバックによる強化学習 (RLHF)

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 新モデル・リリース

SERPO: Self-Evolving Rubric Policy Optimization for Open-Ended Test-Time Reinforcement Learning

推論 (Inference) ニューラルネットワーク検索拡張生成 (RAG) 強化学習ソフトウェア工学

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-29 EN 推論・効率化

No Data Is Not No Risk: Visibility Aware Graph-Based Inference of Business Conduct Risk

推論 (Inference) 検索拡張生成 (RAG) 強化学習

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 新モデル・リリース

Budget-Aware LLM Discovery via Cost-Calibrated Frontier Utility

GPT 推論 (Inference)

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 新モデル・リリース

From Found to Designed: Concepts as a Design Axis for Large Language Models

推論 (Inference)

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 推論・効率化

FedTopo: Relation-Level Topology Sharing for Model-Heterogeneous Federated Learning

推論 (Inference) ニューラルネットワーク

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 新モデル・リリース

See2Think: Do Multimodal Models Really Use Intermediate Visual States?

推論 (Inference) ニューラルネットワーク検索拡張生成 (RAG) 強化学習ソフトウェア工学

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN 開発者ツール

MediaWiki Code2Code Search: Neural Retrieval for the Semantic Discovery of Open-Source Software Entities

深層学習

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 新モデル・リリース

Metis: Memory Foundation Model

AI エージェント推論 (Inference) 強化学習

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-29 EN インフラ・ハードウェア

AgenticCANN: Automated Ascend C Operator Generation via Knowledge-Augmented Agentic Evolution

推論 (Inference) ニューラルネットワーク

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 推論・効率化

Filesystem-Based Memory for LLM Agents: Organization, Evolution, and Sustainability

AI エージェントソフトウェア工学

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 推論・効率化

Revisiting Lossy Verification in Speculative Decoding: Mechanisms, Trade-offs, and Failure Modes

推論 (Inference)

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 学習・ファインチューニング

FedWeave: Rethinking the Unit of Specialization in Heterogeneous Federated MoE-LoRA

推論 (Inference) Mixture of Experts (MoE) 検索拡張生成 (RAG) 強化学習

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 推論・効率化

Where Detectors Fail: Closing the Tail-Domain Gap with Expert-Guided Mutual Distillation

ニューラルネットワーク検索拡張生成 (RAG) 強化学習

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 推論・効率化

Which RAG Paradigm Wins at Scale? A Scaling Study of Retrieval-Augmented Generation Paradigms

検索拡張生成 (RAG) 強化学習ソフトウェア工学

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-29 EN 推論・効率化

Voice Memory for Agentic Speech Recognition

推論 (Inference) 音声処理

元記事を読む (arXiv cs.CL (Computation and Language)) ↗