New Model Releases｜AI/Tech News Trends

Publickey · 2026-08-03 JA New Model Releases

Rust製のフルスタックWebアプリフレームワーク「Topcoat」登場。非同期ランタイム「Tokio」上でサーバサイドレンダリング、ルーティング、コンポーネントライブラリなど

Reinforcement Learning

Read original (Publickey) ↗

Publickey · 2026-08-03 JA New Model Releases

クラウドインフラのシェア、AWSが28％と変わらず、Google Cloudは1ポイント上昇して15％に。市場全体が年42％と過去最高の成長率に。2026年第2四半期、Synergy Research

Google Microsoft

Read original (Publickey) ↗

ITmedia AI+ · 2026-08-03 JA New Model Releases extract

「Qwen3.8-Max」登場、オープン化は「来週」　一部「Fable 5」「GPT-5.6 Sol」超えの性能うたう

Alibaba Cloud releases Qwen3.8-Max; open weights due next week

GPT

Alibaba Cloud, part of China's Alibaba, released its large AI model Qwen3.8-Max on Aug 3, claiming it beats Fable 5 and GPT-5.6 Sol on some benchmarks. The company says it will publish the model weights next week, continuing its open-weight strategy.

Read original (ITmedia AI+) ↗

Simon Willison's Weblog · 2026-08-02 EN New Model Releases extract

condense-json 1.0

Simon Willison ships condense-json 1.0 for compact JSON

Simon Willison released condense-json 1.0, a small Python library that shrinks JSON by replacing repeated strings with a short replacements map (e.g. mapping a key to a recurring phrase). Now a year and a half old, it graduates to a stable 1.0 with sensible, non-disruptive fixes.

Read original (Simon Willison's Weblog) ↗

ITmedia AI+ · 2026-08-02 JA New Model Releases extract

OpenAI、次期主力モデル「Astra」の存在を明らかに――未解決の数学問題10件を「解決」と発表

OpenAI unveils 'Astra,' says it solved 10 open math problems

Algorithms & Theory OpenAI

OpenAI disclosed 'Astra,' its next flagship model, saying an internal version produced new results on 10 unsolved problems in mathematics and theoretical computer science—the first public use of the name. Compute cost was held to about $2,000 in 'Sol' terms, and formal proofs via the Lean proof assistant were published on GitHub, signaling potential for advanced math research.

Read original (ITmedia AI+) ↗

Publickey · 2026-08-02 JA New Model Releases extract

アトラシアン、AI時代の仕事用ブラウザ「Diaブラウザ」のWindows版リリースへ。ウェイトリストへの登録開始

Atlassian brings its AI work browser Dia to Windows this fall

Atlassian announced that Dia, its AI-era productivity browser currently available only on Mac, will get a Windows version this fall. A waitlist is now open, extending availability to Windows users.

Read original (Publickey) ↗

Publickey · 2026-08-02 JA New Model Releases extract

日本におけるクラウドネイティブコミュニティの開発者数が約100万人に、CNCFが調査結果を発表

CNCF: Japan's cloud-native developer community nears 1 million

The Cloud Native Computing Foundation and analyst firm SlashData released survey findings estimating that Japan's cloud-native developer community has reached roughly 1 million, underscoring the growing domestic adoption of Kubernetes and related cloud-native technologies.

Read original (Publickey) ↗

Sakana AI Blog (ja) · 2026-08-02 JA New Model Releases extract

Sakana AI、日本語特化のLLM API「Sakana Namazu」を提供開始

Sakana AI launches Namazu, a Japanese-focused OpenAI-compatible LLM API

AI Agents Inference Machine Learning Meta OpenAI

Sakana AI released Namazu, an LLM API tuned for Japanese and local business use. Built on Moonshot AI's open Kimi K2.6 and refined with in-house data, it adds built-in web search and code execution. Being OpenAI-compatible, existing code works by swapping the base_url, filling the gap between costly frontier models and raw open ones.

Read original (Sakana AI Blog (ja)) ↗

Simon Willison's Weblog · 2026-08-02 EN New Model Releases extract

July 2026 newsletter

Simon Willison publishes his latest monthly newsletter

Anthropic Claude DeepSeek GPT Model Context Protocol (MCP)

Developer Simon Willison released the latest edition of his sponsors-only monthly newsletter. It rounds up recent developments across AI models and tooling—spanning GPT, Claude, DeepSeek, Anthropic, and MCP—offering an individual's closely watched view of the fast-moving AI landscape.

Read original (Simon Willison's Weblog) ↗

ITmedia AI+ · 2026-08-02 JA New Model Releases extract

Google、パーソナルAI「Gemini Spark」を日本でも利用可能に　Chrome統合は米国から

Google expands Gemini Spark personal AI to 160+ countries incl. Japan

AI Agents Gemini Google

Google extended its Gemini Spark personal AI agent to more than 160 countries, including Japan. Running on Google's cloud, it can act even when a PC is off or a phone is locked, handling tasks based on triggers. Chrome integration will roll out first in the US.

Read original (ITmedia AI+) ↗

ITmedia AI+ · 2026-08-01 JA New Model Releases extract

OpenAI、アクティブユーザー10億人超に　導入企業は200万社超

OpenAI passes 1 billion active users and 2 million business customers

GPT Inference OpenAI

OpenAI said it surpassed one billion active users and two million business customers. It cited efficiency gains from retained reasoning, better context management, and production optimization that cut costs and improved token throughput, alongside price cuts on some GPT-5.6 models.

Read original (ITmedia AI+) ↗

Simon Willison's Weblog · 2026-08-01 EN New Model Releases extract

datasette-apps 0.2a0

Simon Willison releases datasette-apps 0.2a0

Simon Willison released datasette-apps 0.2a0, an update to the Datasette extension. The release includes changes that improve building and editing Datasette Apps directly within Datasette, continuing steady development of the open-source data-exploration ecosystem.

Read original (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-08-01 EN New Model Releases extract

Ten advances in mathematics and theoretical computer science

Simon Willison weighs in on OpenAI and Anthropic's math results

Anthropic Claude GPT OpenAI

Simon Willison discussed OpenAI's 'ten advances in mathematics and theoretical computer science,' noting that just days earlier Anthropic had reported similar discoveries. The post reflects on a growing trend of frontier AI models contributing to open problems in mathematics.

Read original (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-07-31 EN New Model Releases extract

Stateless MCP has recaptured my interest (and inspired mcp-explorer and datasette-mcp)

Simon Willison: stateless MCP (MCP 2.0) has recaptured my interest

Anthropic Claude Model Context Protocol (MCP) OpenAI Reinforcement Learning

Simon Willison wrote that the rollout of stateless MCP—the MCP 2.0 or 2026-07-28 Model Context Protocol specification—has renewed his interest in the protocol. He says it inspired him to build tools such as mcp-explorer and datasette-mcp on top of the new stateless design.

Read original (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-07-31 EN New Model Releases extract

llm-mcp-client 0.1a0

Simon Willison releases llm-mcp-client 0.1a0

Model Context Protocol (MCP)

Simon Willison released llm-mcp-client 0.1a0, a tool for connecting his LLM utility to Model Context Protocol (MCP) servers. Detailed in an accompanying blog post, the release adds to the growing set of tooling built around the MCP ecosystem.

Read original (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-07-31 EN New Model Releases extract

smevals - a small eval suite for evaluating models, prompts, and harnesses

Simon Willison introduces 'smevals,' a small eval suite for models

Claude GPT Machine Learning Neural Network Software Engineering

Simon Willison introduced smevals, a small evaluation suite for testing models, prompts, and harnesses. Built in collaboration with Jesse Vincent's Prime Radiant applied AI research lab, the framework aims to help answer questions about the capabilities of different AI models.

Read original (Simon Willison's Weblog) ↗

arXiv cs.LG (Machine Learning) · 2026-07-31 EN Inference & Efficiency

GQ-FSL: Green Quantized Federated Split Learning

Neural Network Quantization

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN New Model Releases

Evolving language compositionality in a frequency-structured meaning space

Deep Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

AgentHPOBench: A Benchmark For Evaluating LLM Agents as Sequential Hyperparameter Optimizers

AI Agents Machine Learning Software Engineering

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

The Theoretical Foundation of Socratic Tests: Dynamic, Multimodal, Conversational Examinations

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

When Does On-Policy Interaction Help? Representational Tradeoffs in Value-Based Imitation Learning

Neural Network Reinforcement Learning Robotics

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-31 EN New Model Releases

QASP: Query-Adaptive Robust Vector Search Policy

Inference Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

FriendBench: Benchmarking Dyadic Familiarity Inference in Humans and Multimodal Large Language Models

Inference Neural Network Software Engineering Speech Processing

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN Infrastructure & Hardware

TraceViT: Grounded Trace Supervision for Visual Abstract Reasoning

Neural Network

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

DungeonBench: A Benchmark for Rules-Rich Tactical Reasoning in Dungeons & Dragons Combat

AI Agents Neural Network Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

AMTFV: Agentic Mathematical Tool-Flow Verification for LLM Self-Correction

DeepSeek Gemini GPT Software Engineering

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

ARB: A Matched Authorship-Rewriting Benchmark Dataset for AI-Text Detector Evaluation

GPT Llama Mistral

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-31 EN New Model Releases

A Neurosymbolic Approach for Explainable Early Diagnosis of Alzheimer's Disease

Reinforcement Learning

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

TerraNova: A Foundation Model for the Anthropocene

Embeddings Neural Network Retrieval-Augmented Generation (RAG) Transformer

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

From Code Review to Code Critique: Intent, Drift, and Spotlight for AI-Generated Diffs at Scale

AI Agents Meta Neural Network

Read original (arXiv cs.AI (Artificial Intelligence)) ↗