New Model Releases (Page 3 of 11)｜AI/Tech News Trends

arXiv cs.CL (Computation and Language) · 2026-07-31 EN Inference & Efficiency

TransMem: Transforming Hidden States into Memory for Large Language Models

AI Agents Deep Learning Inference Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN New Model Releases

GoldenRetriever: Non-Interactive Homomorphic Encrypted Retrieval for Privacy-Preserving RAG

Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN New Model Releases

Mixture-of-Translators: Translating KV Caches Across Heterogeneous Large Language Models

Deep Learning GPT Neural Network Retrieval-Augmented Generation (RAG) Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

ITmedia AI+ · 2026-07-31 JA Training & Fine-tuning extract

Thinking Machines、軽量モデル「Inkling-Small」正式公開　サイズ4分の1で「Inkling」に匹敵する性能

Thinking Machines releases Inkling-Small, matching Inkling at 1/4 the size

Reinforcement Learning

Thinking Machines Lab released the final version of Inkling-Small, an open-weight AI model. At a quarter the size of its predecessor, the company says data improvements and reinforcement learning let it match the larger Inkling on tasks such as code generation.

Read original (ITmedia AI+) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN New Model Releases

FairFund-Bench: Evaluating Distributive Bias in LLM Resource Allocation

Meta

Read original (arXiv cs.CL (Computation and Language)) ↗

ITmedia AI+ · 2026-07-31 JA New Model Releases extract

Google、ロボット向けAI「Gemini Robotics 2」発表　ヒューマノイドの全身制御や指先作業を実現

Google unveils Gemini Robotics 2 for whole-body and fine fingertip control

Gemini Google Inference Robotics

Google and Google DeepMind announced Gemini Robotics 2, a family of robotics AI models supporting humanoid whole-body control, fine fingertip manipulation, and multi-robot collaboration. The lineup includes the ER 2 reasoning model that acts as a high-level brain, plus lighter variants.

Read original (ITmedia AI+) ↗

ITmedia AI+ · 2026-07-31 JA New Model Releases extract

Claudeが評価環境から実在企業に不正アクセス――Anthropic、3件のインシデントを公表

Anthropic: Claude mistakenly accessed three real companies' infra during eval

Anthropic Claude

Anthropic disclosed that, during a cybersecurity evaluation, its Claude model reached the open internet through a misconfigured path and mistakenly accessed the production infrastructure of three real organizations. It published the three incidents, where an exercise environment unexpectedly touched live systems.

Read original (ITmedia AI+) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN New Model Releases

Token-Level Diagnosis of Sycophancy in LLMs with Attribution-Guided Steering

Inference Neural Network

Read original (arXiv cs.CL (Computation and Language)) ↗

Cohere Blog · 2026-07-31 EN New Model Releases extract

Cohere signs EU Code of Practice on Transparency of AI-Generated Content

Cohere signs EU Code of Practice on AI content transparency

Neural Network Reinforcement Learning

Cohere said it signed the EU Code of Practice on Transparency of AI-Generated Content, joining other companies committing to clearer labeling and provenance for AI outputs. The move signals alignment with Europe's emerging AI governance framework.

Read original (Cohere Blog) ↗

Simon Willison's Weblog · 2026-07-30 EN Infrastructure & Hardware extract

Advancing the price-performance frontier with GPT‑5.6

OpenAI slashes GPT-5.6 prices: Luna down 80%, Terra down 20%

Anthropic Gemini GPT Inference OpenAI

OpenAI announced steep price cuts for GPT-5.6, with Luna dropping 80% and Terra 20%. The company credits GPT-5.6 Sol for enabling the reduction by optimizing load balancing and even the model's forward pass, the computation that turns inputs into next-token predictions.

Read original (Simon Willison's Weblog) ↗

ITmedia AI+ · 2026-07-30 JA New Model Releases extract

OpenAI、「GPT-5.6 Luna」を80％値下げ　モデル自身による効率化でコスト削減

OpenAI cuts 'GPT-5.6 Luna' price by 80% via model-driven efficiency

GPT OpenAI

OpenAI cut the price of 'Luna' in its GPT-5.6 family by 80%, saying efficiency gains achieved by the model itself lowered costs. The move makes a high-performance model considerably cheaper, reflecting OpenAI's recent emphasis on price-performance.

Read original (ITmedia AI+) ↗

Simon Willison's Weblog · 2026-07-30 EN New Model Releases extract

llm 0.32rc2

llm 0.32rc2 switches its default model to GPT-5.6 Luna

GPT Machine Learning Neural Network OpenAI Reinforcement Learning from Human Feedback (RLHF)

Simon Willison released llm 0.32rc2, fixing a dependency issue and changing the default model for users who have not set one from GPT-4o mini to the newer, more capable GPT-5.6 Luna. Luna is slightly more expensive but a notable upgrade.

Read original (Simon Willison's Weblog) ↗