Training & Fine-tuning (Page 4 of 4)｜AI/Tech News Trends

arXiv cs.CL (Computation and Language) · 2026-07-28 EN Training & Fine-tuning

Instruction-Tuned Models Locally Reuse Human Syntax More Than Humans Do

Llama Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-28 EN Industry Adoption

Empirical Evaluation of Out-Of-Distribution Performance of Tabular Foundation Models

Deep Learning Neural Network Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-28 EN Training & Fine-tuning

Physics-Aware End-to-End Deep Reinforcement Learning for Quadcopter Control with Actuator Dynamics

Algorithms & Theory Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-28 EN New Model Releases

Schrödinger's Cat: Probabilistic Representation and Prediction of Potential Scene Kinematics

Neural Network Reinforcement Learning

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-28 EN New Model Releases

Detecting Knowledge Inconsistencies Across Text, Tables, and Knowledge Graphs

Neural Network Retrieval-Augmented Generation (RAG) Software Engineering

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-28 EN Training & Fine-tuning

Large Language Model for Operations Research Formulation Selection in Multi-Warehouse Inventory Allocation

Fine-tuning Meta

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-28 EN Multimodal

Evaluating VLMs for Autonomous Agent-Driven Geometry Clipping Detection in Video Game QA

AI Agents Computer Vision Gemini GPT Llama

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

Publickey · 2026-07-28 JA New Model Releases extract

Google Cloud、AIが自律的にコードの脆弱性検出からサンドボックス内でのリスク検証、修正までを自動実行。「CodeMender」プレビュー公開

Google Cloud previews CodeMender, an AI agent that auto-fixes code flaws

AI Agents Google Machine Learning

Google Cloud unveiled a preview of CodeMender, an AI agent that autonomously detects code vulnerabilities, validates and reports the risk inside a sandbox, and then applies fixes. Google says it can uncover even complex flaws, aiming to automate security remediation.

Read original (Publickey) ↗

arXiv cs.LG (Machine Learning) · 2026-07-28 EN Multimodal

HiFi-UMI: Learning Deployable Manipulation Policies from High-Fidelity UMI Data Alone

Computer Vision Neural Network Reinforcement Learning

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-28 EN New Model Releases

DRIFT: Direct-Recursive Intervention-Conditioned Forecasting of ICU Physiological Trajectories

Retrieval-Augmented Generation (RAG) Reinforcement Learning from Human Feedback (RLHF) Transformer

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-28 EN Training & Fine-tuning

WALoMA: A Multitask Wireless Foundation Model via Adaptive Low-Rank Masked Autoencoders

Deep Learning Fine-tuning Neural Network Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-28 EN Training & Fine-tuning

Detecting CSAM Text-to-Image LoRAs From Weights

Fine-tuning Inference Meta

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-28 EN Training & Fine-tuning

Shared Voxel-Map-Based Cooperative Indoor UAV Guidance with a Multi-Agent Soft Actor-Critic Controller

Fine-tuning Neural Network Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-28 EN New Model Releases

Localized Adaptation Reveals Distinct Learning Signatures in Transformers

Deep Learning Neural Network Reinforcement Learning Transformer

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-28 EN Training & Fine-tuning

MemSFT: Mitigating Alignment Tax with an External Parametric Memory

Fine-tuning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-28 EN New Model Releases

AMPBench-MT: A Homology-Controlled Benchmark for Antimicrobial Peptide Potency, Spectrum, and Safety Prediction

Embeddings Neural Network Reinforcement Learning from Human Feedback (RLHF)

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-27 EN Training & Fine-tuning

Towards Robust Reinforcement Learning for Small-Scale Language Model Agents

AI Agents Fine-tuning Neural Network Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-27 EN Training & Fine-tuning

DS@GT ARC at CheckThat! 2026: LLM-Based Trace Ranking and Grouped Reward Modeling for Multilingual Numerical Claim Verification

Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-27 EN Training & Fine-tuning

DataOrchestra: Learning to Orchestrate Per-Example Curation of Pretraining Data

Machine Learning Neural Network Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-27 EN Training & Fine-tuning

Beyond Scale and Generation: Understanding Language Model-based Entity Matching

Embeddings Fine-tuning Neural Network Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-27 EN Industry Adoption

Artificial Intelligence and Innovation Ecosystem: Evolutionary Developments, Challenges, and Future Directions

Neural Network Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-27 EN Inference & Efficiency

Evaluating Fuzz Testing for Reinforcement Learning Agents

AI Agents Retrieval-Augmented Generation (RAG) Reinforcement Learning Robotics

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-27 EN Training & Fine-tuning

The Visual Bottleneck: Sparse-Frame Adaptation of MLLMs for Joint Spatial-Temporal Video Grounding

Fine-tuning Machine Learning Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-27 EN New Model Releases

EgoPlay: Event-Triggered Video Editing for Egocentric Streams

Deep Learning Fine-tuning Inference Neural Network Transformer

Read original (arXiv cs.AI (Artificial Intelligence)) ↗