新モデル・リリース｜AI/Tech動向まとめ

Publickey · 2026-08-03 JA 新モデル・リリース

Rust製のフルスタックWebアプリフレームワーク「Topcoat」登場。非同期ランタイム「Tokio」上でサーバサイドレンダリング、ルーティング、コンポーネントライブラリなど

強化学習

元記事を読む (Publickey) ↗

Publickey · 2026-08-03 JA 新モデル・リリース

クラウドインフラのシェア、AWSが28％と変わらず、Google Cloudは1ポイント上昇して15％に。市場全体が年42％と過去最高の成長率に。2026年第2四半期、Synergy Research

Google Microsoft

元記事を読む (Publickey) ↗

ITmedia AI+ · 2026-08-03 JA 新モデル・リリース抜粋

「Qwen3.8-Max」登場、オープン化は「来週」　一部「Fable 5」「GPT-5.6 Sol」超えの性能うたう

Alibaba Cloud、AIモデル「Qwen3.8-Max」を公開、重みは来週

GPT

中国Alibaba傘下のAlibaba Cloudが8月3日、大規模AIモデル「Qwen3.8-Max」を正式リリースした。一部ベンチマークで「Fable 5」や「GPT-5.6 Sol」を上回る性能をうたう。モデルの重みは来週公開予定で、オープン化を進める方針を示した。

元記事を読む (ITmedia AI+) ↗

Simon Willison's Weblog · 2026-08-02 EN 新モデル・リリース抜粋

condense-json 1.0

Simon Willison、JSONを圧縮する「condense-json 1.0」を公開

Simon Willison氏が、繰り返し出現する文字列を置換マップに置き換えてJSONを圧縮する小さなライブラリ「condense-json」のバージョン1.0を公開した。1年半前から存在するライブラリに非破壊的な修正を加え、安定版として正式リリースしたもの。

元記事を読む (Simon Willison's Weblog) ↗

ITmedia AI+ · 2026-08-02 JA 新モデル・リリース抜粋

OpenAI、次期主力モデル「Astra」の存在を明らかに――未解決の数学問題10件を「解決」と発表

OpenAI、次期主力モデル「Astra」公表──未解決の数学問題10件を「解決」

アルゴリズム・理論 OpenAI

OpenAIは次期主力モデル「Astra」の社内版で、数学や理論計算機科学の未解決問題10件について新たな結果を得たと発表した。「Astra」の名称公表は初とみられる。計算コストは「Sol」換算で約2000ドルに抑え、証明支援系「Lean」による形式証明もGitHubで公開し、高度な数学研究への応用可能性を示した。

元記事を読む (ITmedia AI+) ↗

Publickey · 2026-08-02 JA 新モデル・リリース抜粋

アトラシアン、AI時代の仕事用ブラウザ「Diaブラウザ」のWindows版リリースへ。ウェイトリストへの登録開始

アトラシアン、AI仕事用ブラウザ「Dia」のWindows版を今秋リリースへ

アトラシアンは、現在Mac版のみ提供しているAI時代の仕事用ブラウザ「Dia」のWindows版を今秋リリースすると発表した。すでにウェイトリストの登録受付を開始しており、Windowsユーザーへの提供拡大を進める。

元記事を読む (Publickey) ↗

Publickey · 2026-08-02 JA 新モデル・リリース抜粋

日本におけるクラウドネイティブコミュニティの開発者数が約100万人に、CNCFが調査結果を発表

CNCF調査、日本のクラウドネイティブ開発者が約100万人に

CNCFと調査会社SlashDataは、日本国内のクラウドネイティブ技術の開発者数が約100万人に達したとの調査結果を発表した。Kubernetesをはじめとするクラウドネイティブ技術の国内での普及拡大を示す内容となっている。

元記事を読む (Publickey) ↗

Sakana AI Blog (ja) · 2026-08-02 JA 新モデル・リリース抜粋

Sakana AI、日本語特化のLLM API「Sakana Namazu」を提供開始

Sakana AI、日本語特化LLM「Namazu」をOpenAI互換APIで提供開始

AI エージェント推論 (Inference) 機械学習 Meta OpenAI

Sakana AIが、日本語と日本の商習慣に特化したLLM API「Sakana Namazu」の提供を開始した。Sakana Chat搭載モデルを更新したもので、Moonshot AIのオープンモデル「Kimi K2.6」をベースに社内データで日本語・業務文脈への適合を進めた。Web検索とコード実行のビルトインツールを備え、OpenAI互換のためbase_urlの変更だけで既存コードから利用できる。高コストなフロンティアモデルと素のオープンモデルの中間を埋める選択肢として位置づける。

元記事を読む (Sakana AI Blog (ja)) ↗

Simon Willison's Weblog · 2026-08-02 EN 新モデル・リリース抜粋

July 2026 newsletter

Simon Willison、月刊ニュースレター最新号を公開

Anthropic Claude DeepSeek GPT Model Context Protocol (MCP)

開発者Simon Willison氏が、スポンサー向けの月刊ニュースレター最新号を公開した。GPTやClaude、DeepSeek、Anthropic、MCPなど、最近のAIモデルやツールを巡る動向をまとめている。個人によるAI業界ウォッチとして注目される内容だ。

元記事を読む (Simon Willison's Weblog) ↗

ITmedia AI+ · 2026-08-02 JA 新モデル・リリース抜粋

Google、パーソナルAI「Gemini Spark」を日本でも利用可能に　Chrome統合は米国から

Google、パーソナルAI「Gemini Spark」を日本含む160カ国以上に拡大

AI エージェント Gemini Google

Googleは、パーソナルAIエージェント「Gemini Spark」の提供を日本を含む160カ国以上に拡大した。PC停止時やスマホのロック時もGoogleのクラウド基盤上で動作し、トリガーに応じてタスクを自動処理する。Chrome統合は米国から先行提供される。

元記事を読む (ITmedia AI+) ↗

ITmedia AI+ · 2026-08-01 JA 新モデル・リリース抜粋

OpenAI、アクティブユーザー10億人超に　導入企業は200万社超

OpenAI、アクティブユーザー10億人・導入企業200万社を突破

GPT 推論 (Inference) OpenAI

OpenAIは、アクティブユーザーが10億人、導入企業が200万社を超えたと公表した。推論の保持やコンテキスト管理の改善、本番ソフトウェアの最適化によりコスト削減とトークン生成効率の向上を実現し、GPT-5.6の一部モデルは値下げした。

元記事を読む (ITmedia AI+) ↗

Simon Willison's Weblog · 2026-08-01 EN 新モデル・リリース抜粋

datasette-apps 0.2a0

Simon Willison、datasette-apps 0.2a0をリリース

Simon Willison氏は、Datasetteの拡張「datasette-apps」バージョン0.2a0をリリースした。Datasette上でアプリを作成・編集する際の使い勝手を改善する変更が含まれている。オープンソースのデータ探索ツールの機能拡充が続いている。

元記事を読む (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-08-01 EN 新モデル・リリース抜粋

Ten advances in mathematics and theoretical computer science

Simon Willison、OpenAI・Anthropicの数学的成果を論評

Anthropic Claude GPT OpenAI

Simon Willison氏が、OpenAIの「数学・理論計算機科学における10の進展」を取り上げた記事。数日前にはAnthropicも同様の発見を報告していたと触れ、フロンティアAIが数学の未解決問題に相次いで貢献し始めている流れを論じている。

元記事を読む (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-07-31 EN 新モデル・リリース抜粋

Stateless MCP has recaptured my interest (and inspired mcp-explorer and datasette-mcp)

Simon Willison、ステートレスMCP（MCP 2.0）への関心を再燃と語る

Anthropic Claude Model Context Protocol (MCP) OpenAI 強化学習

Simon Willison氏は、2026年7月28日に公開されたModel Context Protocolの新仕様、いわゆるステートレスMCP（MCP 2.0）のロールアウトに関心を再び高めたと述べた。これに触発され、mcp-explorerやdatasette-mcpといったツールの開発にも取り組んでいるという。

元記事を読む (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-07-31 EN 新モデル・リリース抜粋

llm-mcp-client 0.1a0

Simon Willison、llm-mcp-client 0.1a0をリリース

Model Context Protocol (MCP)

Simon Willison氏は、LLMツールからModel Context Protocol（MCP）サーバーを利用するための「llm-mcp-client」バージョン0.1a0をリリースした。ブログで詳細を紹介しており、MCPエコシステムに対応するツールの整備が進んでいる。

元記事を読む (Simon Willison's Weblog) ↗

Simon Willison's Weblog · 2026-07-31 EN 新モデル・リリース抜粋

smevals - a small eval suite for evaluating models, prompts, and harnesses

Simon Willison、モデル評価用の小型スイート「smevals」を紹介

Claude GPT 機械学習ニューラルネットワークソフトウェア工学

Simon Willison氏は、モデルやプロンプト、実行ハーネスを評価するための小型評価スイート「smevals」を紹介した。Jesse Vincent氏のPrime Radiant応用AI研究ラボと協力して構築しており、異なるモデルの能力を検証する疑問に答えるためのフレームワークだという。

元記事を読む (Simon Willison's Weblog) ↗

arXiv cs.LG (Machine Learning) · 2026-07-31 EN 推論・効率化

GQ-FSL: Green Quantized Federated Split Learning

ニューラルネットワーク量子化

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN 新モデル・リリース

Evolving language compositionality in a frequency-structured meaning space

深層学習

元記事を読む (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

AgentHPOBench: A Benchmark For Evaluating LLM Agents as Sequential Hyperparameter Optimizers

AI エージェント機械学習ソフトウェア工学

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

The Theoretical Foundation of Socratic Tests: Dynamic, Multimodal, Conversational Examinations

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

When Does On-Policy Interaction Help? Representational Tradeoffs in Value-Based Imitation Learning

ニューラルネットワーク強化学習ロボティクス

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-31 EN 新モデル・リリース

QASP: Query-Adaptive Robust Vector Search Policy

推論 (Inference) 検索拡張生成 (RAG)

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

FriendBench: Benchmarking Dyadic Familiarity Inference in Humans and Multimodal Large Language Models

推論 (Inference) ニューラルネットワークソフトウェア工学音声処理

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN インフラ・ハードウェア

TraceViT: Grounded Trace Supervision for Visual Abstract Reasoning

ニューラルネットワーク

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

DungeonBench: A Benchmark for Rules-Rich Tactical Reasoning in Dungeons & Dragons Combat

AI エージェントニューラルネットワーク検索拡張生成 (RAG)

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

AMTFV: Agentic Mathematical Tool-Flow Verification for LLM Self-Correction

DeepSeek Gemini GPT ソフトウェア工学

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

ARB: A Matched Authorship-Rewriting Benchmark Dataset for AI-Text Detector Evaluation

GPT Llama Mistral

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-31 EN 新モデル・リリース

A Neurosymbolic Approach for Explainable Early Diagnosis of Alzheimer's Disease

強化学習

元記事を読む (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

TerraNova: A Foundation Model for the Anthropocene

埋め込み (Embeddings) ニューラルネットワーク検索拡張生成 (RAG) Transformer

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN 新モデル・リリース

From Code Review to Code Critique: Intent, Drift, and Spotlight for AI-Generated Diffs at Scale

AI エージェント Meta ニューラルネットワーク

元記事を読む (arXiv cs.AI (Artificial Intelligence)) ↗