Infrastructure & Hardware (Page 2 of 8)｜AI/Tech News Trends

arXiv cs.CL (Computation and Language) · 2026-07-31 EN Infrastructure & Hardware

Zero-Mem: Zero-Token Memory Operations for LLM Agents

AI Agents Neural Network Software Engineering

Read original (arXiv cs.CL (Computation and Language)) ↗

Data Center Dynamics · 2026-07-31 EN Infrastructure & Hardware extract

AWS looks to build 800,000 sq ft data center campus at GWU site in Ashburn, Virginia

AWS eyes 800,000 sq ft data center campus in Ashburn, Virginia

AWS is looking to build an 800,000 sq ft data center campus on a former GWU site in Ashburn, Virginia, DatacenterDynamics reported. A county supervisor vowed to use 'every tool available' to fight the development, underscoring growing local pushback against data center expansion.

Read original (Data Center Dynamics) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN Infrastructure & Hardware

DualDiT: A Conditional Dual-Output Diffusion Transformer for Joint OCT Image and Segmentation Mask Generation

Neural Network Reinforcement Learning Transformer

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

Data Center Dynamics · 2026-07-31 EN Infrastructure & Hardware extract

CyrusOne files to develop three buildings for $1.5bn data center campus in Fairfield, Texas

CyrusOne files for $1.5bn, three-building data center campus in Texas

Meta

CyrusOne filed to develop three buildings for a $1.5bn data center campus in Fairfield, Texas. Construction has already begun on two of the buildings, with the third slated to start next month, reflecting the ongoing surge in large-scale data center investment across the US.

Read original (Data Center Dynamics) ↗

Data Center Dynamics · 2026-07-31 EN Multimodal extract

Veolia to operate 350MW gas-powered microgrid for Ohio data center campus

Veolia to run 350MW gas-powered microgrid for Ohio data center

Veolia will operate a 350MW microgrid for an Ohio data center campus, according to DatacenterDynamics. The system will be anchored by natural gas generation and supplemented with a battery energy storage (BESS) unit to provide reliable power for the large facility.

Read original (Data Center Dynamics) ↗

Data Center Dynamics · 2026-07-31 EN Infrastructure & Hardware extract

Legacy Investing buys printing press in Minneapolis, Minnesota, for data center development

Legacy Investing buys Minneapolis printing press for data center

Neural Network

Legacy Investing acquired a printing press site in Minneapolis, Minnesota, for data center development, DatacenterDynamics reported. The firm is planning a 20MW facility despite an ongoing moratorium on new data centers in the city.

Read original (Data Center Dynamics) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

OsteoCAD: A Human-in-the-Loop Cloud-Edge Framework for Bone Tumor Segmentation

Deep Learning Inference Neural Network Reinforcement Learning

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN New Model Releases

CalibratedRubric: Task-Adaptive Rubric Banks for Open-Ended LLM Evaluation

Neural Network Retrieval-Augmented Generation (RAG)

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN Infrastructure & Hardware

Small Is Enough: Per-User Style Rewriting of AI-Edited Text via LoRA Adapters

Inference

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-31 EN Inference & Efficiency

MOSAIC: Masked Outsourcing of Secure AI Computations

Inference Quantization Transformer

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN New Model Releases

Knowing When to Quit: Diagnosing and Training LLMs to Abort Futile Reasoning

Reinforcement Learning

Read original (arXiv cs.CL (Computation and Language)) ↗

ITmedia AI+ · 2026-07-31 JA Infrastructure & Hardware extract

キオクシアQ1決算、純利益は前年比4500％増　AIデータセンター向け需要がけん引

Kioxia Q1 net profit up 4,506% YoY, driven by AI data-center demand

Semiconductor maker Kioxia Holdings said net profit for its fiscal Q1 (April–June 2026, IFRS) reached ¥842.165bn, up 4,506% year on year. The company attributed the jump to surging memory demand from AI data centers.

Read original (ITmedia AI+) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN Multimodal

Faster but Different: Diagnosing and Controlling Content Drift in Accelerated Multimodal Diffusion Language Models

Deep Learning Machine Learning Neural Network

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN Inference & Efficiency

Adjudicated Captioning: Multi-Agent Alignment Scoring and Consensus-Distilled Beam Arbitration for Strict Zero-Shot Image Captioning

Deep Learning Inference Transformer

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN Infrastructure & Hardware

PARALLEL: A Prefrontal-Aligned Reinforcement inspired Approach for Language-Model Learning under Explicit Limits

Deep Learning Machine Learning Neural Network

Read original (arXiv cs.CL (Computation and Language)) ↗

arXiv cs.CL (Computation and Language) · 2026-07-31 EN New Model Releases

Token-Level Diagnosis of Sycophancy in LLMs with Attribution-Guided Steering

Inference Neural Network

Read original (arXiv cs.CL (Computation and Language)) ↗

Cohere Blog · 2026-07-31 EN New Model Releases extract

Cohere signs EU Code of Practice on Transparency of AI-Generated Content

Cohere signs EU Code of Practice on AI content transparency

Neural Network Reinforcement Learning

Cohere said it signed the EU Code of Practice on Transparency of AI-Generated Content, joining other companies committing to clearer labeling and provenance for AI outputs. The move signals alignment with Europe's emerging AI governance framework.

Read original (Cohere Blog) ↗

Simon Willison's Weblog · 2026-07-30 EN Infrastructure & Hardware extract

Advancing the price-performance frontier with GPT‑5.6

OpenAI slashes GPT-5.6 prices: Luna down 80%, Terra down 20%

Anthropic Gemini GPT Inference OpenAI

OpenAI announced steep price cuts for GPT-5.6, with Luna dropping 80% and Terra 20%. The company credits GPT-5.6 Sol for enabling the reduction by optimizing load balancing and even the model's forward pass, the computation that turns inputs into next-token predictions.

Read original (Simon Willison's Weblog) ↗

arXiv cs.CL (Computation and Language) · 2026-07-30 EN Developer Tools

TORUS: A Test of Rendering-Understanding Self-Coherence for Unified Audio Models

Deep Learning Neural Network Software Engineering Speech Processing

Read original (arXiv cs.CL (Computation and Language)) ↗

Simon Willison's Weblog · 2026-07-30 EN New Model Releases extract

llm 0.32rc2

llm 0.32rc2 switches its default model to GPT-5.6 Luna

GPT Machine Learning Neural Network OpenAI Reinforcement Learning from Human Feedback (RLHF)

Simon Willison released llm 0.32rc2, fixing a dependency issue and changing the default model for users who have not set one from GPT-4o mini to the newer, more capable GPT-5.6 Luna. Luna is slightly more expensive but a notable upgrade.

Read original (Simon Willison's Weblog) ↗

NVIDIA Developer Blog · 2026-07-30 EN Infrastructure & Hardware extract

Run High-Performance Core Math at Scale with NVIDIA nvmath-python

NVIDIA introduces nvmath-python for high-performance math at scale

Generative AI NVIDIA

NVIDIA presented nvmath-python, a library bridging the Python scientific community with CUDA-X math libraries. It lets developers run high-performance core math at scale from Python, making GPU acceleration easier to adopt in numerical workloads.

Read original (NVIDIA Developer Blog) ↗

NVIDIA Developer Blog · 2026-07-30 EN Agents & Tool Use extract

Four Ways to Deploy More Secure AI Agents

NVIDIA outlines four ways to deploy more secure AI agents

AI Agents Generative AI NVIDIA

NVIDIA outlined four approaches to deploying AI agents more securely in production, covering access controls, guardrails, and monitoring. The guidance targets security risks that arise as autonomous agents take on real workloads.

Read original (NVIDIA Developer Blog) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Inference & Efficiency

ReToken: One Token to Improve Vision-Language Models for Visual Retrieval

Computer Vision Embeddings Inference

Read original (arXiv cs.AI (Artificial Intelligence)) ↗

arXiv cs.LG (Machine Learning) · 2026-07-30 EN Inference & Efficiency

MixFrag: Fragility-Guided Mixed-Precision Post-Training Quantization for Vision Transformers

Computer Vision Quantization Retrieval-Augmented Generation (RAG) Reinforcement Learning Transformer

Read original (arXiv cs.LG (Machine Learning)) ↗

arXiv cs.AI (Artificial Intelligence) · 2026-07-30 EN Infrastructure & Hardware

Sample More, Reflect Less: Self-Refine and Reflexion Lose to Repeated Sampling at Equal Token Cost, from 1.5B to 7B

Deep Learning Reinforcement Learning Software Engineering