Frontier AI Watch

ИССЛЕДОВАНИЯ ИИ

Исследования ИИ

Исследовательские статьи, бенчмарки, методы и сигналы оценки в сфере ИИ.

Исследовательская лента AI Briefing из arXiv, Google Research и Google DeepMind.

Материалы поступают из англоязычных RSS-источников. Заголовки и описания показаны на языке оригинала.

Исследования

Новые материалы из исследовательских лент.

Иллюстрация: исследования ИИ
Google Research BlogИсследования

A New Era of Innovation: Google Research at I/O 2026

General Science

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.AI Atom FeedИсследования

Physics Is All You Need? A Case Study in Physicist-Supervised AI Development of Scientific Software

Are AI agents tools, co-authors, or researchers? We present a quantified case study ($N=1$): a physicist supervising an AI coding agent (Claude Code, Sonnet and Opus models) over 12 work days and 57 sessions to build CLAX-PT, a differentiable one-loop perturbation theory module in JAX. We documented and classified 15 supervision events by intervention level.…

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.LG Atom FeedИсследования

DynaFLIP: Rethinking Robotics Perception via Tri-Modal-Dynamics Guided Representation

Robot manipulation critically depends on perception that preserves the action-relevant aspects of a scene. Yet most robot learning pipelines are built upon visual encoders pre-trained for static recognition or vision-language alignment, leaving motion understanding to downstream policies. We introduce DynaFLIP, a dynamics-aware multimodal pre-training…

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.CL Atom FeedИсследования

Unlocking the Working Memory of Large Language Models for Latent Reasoning

To improve the reasoning capabilities of large language models, test-time compute is typically scaled by generating intermediate tokens before the final answer. However, this couples reasoning to autoregressive generation and thereby conflates internal computation with external communication. In contrast, human cognition can use working memory to hold and…

Открыть оригинал
Иллюстрация: исследования ИИ
Microsoft Research AIИсследования

Data Formulator 0.7: AI-powered data analytics for enterprise data

Data Formulator introduces AI-powered analytics for enterprise data workflows. Data teams can easily bring enterprise data into an AI-ready workspace where users can explore, analyze, and visualize data with AI agents to turn raw data into actionable insights. The post Data Formulator 0.7: AI-powered data analytics for enterprise data appeared first on…

Открыть оригинал
Иллюстрация: исследования ИИ
Google DeepMind NewsИсследования

We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks

We’re launching the Google DeepMind Accelerator program in Asia Pacific to tackle environmental risks

Открыть оригинал
Иллюстрация: исследования ИИ
Google Research BlogИсследования

Private analytics via zero-trust aggregation

Security, Privacy and Abuse Prevention

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.AI Atom FeedИсследования

VideoMLA: Low-Rank Latent KV Cache for Minute-Scale Autoregressive Video Diffusion

Long-rollout causal video diffusion has converged on a fixed-size sliding-window KV cache, with recent progress innovating within this layout by changing which tokens occupy the window or how their positions are encoded. The per-head KV layout itself, a dominant contributor to streaming memory and latency, has been mostly left unchanged. In this paper, we…

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.LG Atom FeedИсследования

Efficient Test-Time Finetuning of LLMs via Convex Reconstruction and Gradient Caching

Test-time finetuning (TTFT) is a rapidly evolving paradigm that adapts a language model to each prompt by retrieving related sequences, updating the model on them, and then evaluating the prompt. However, TTFT is only practical if it is fast: selection and finetuning both happen per query, making each a direct bottleneck. Existing methods trade speed for…

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.CL Atom FeedИсследования

Locally Coherent, Globally Incoherent: Bounding Compositional Incoherence in Multi-Component LLM Agents

Multi-component LLM agents assemble probabilistic claims from components that each see only part of a joint problem; the composition can violate basic probability axioms even when every component is locally coherent. We formalise this locally coherent, globally incoherent failure via the compositional residual eps*, the L2 distance from the composed quote to…

Открыть оригинал
Иллюстрация: исследования ИИ
Microsoft Research AIИсследования

Extending Human Intelligence Through AI

Understanding AI as an extension of human intelligence—not a replacement for it—offers a more grounded path for building trustworthy AI systems. The post Extending Human Intelligence Through AI appeared first on Microsoft Research .

Открыть оригинал
Иллюстрация: исследования ИИ
Google DeepMind NewsИсследования

Fast-tracking genetic leads to reverse cellular aging

Biologists use Co-Scientist to find novel factors that successfully rejuvenate human cells.

Открыть оригинал
Иллюстрация: исследования ИИ
Google Research BlogИсследования

Empirical Research Assistance (ERA): From Nature publication to catalyzing Computational Discovery

General Science

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.AI Atom FeedИсследования

LLMSurgeon: Diagnosing Data Mixture of Large Language Models

The pretraining data mixture of Large Language Models (LLMs) constitutes their "digital DNA", shaping model behaviors, capabilities, and failure modes. Yet this composition is rarely disclosed, making post-hoc auditing of data combination or provenance difficult. In this work, we formalize $\textbf{Data Mixture Surgery (DMS)}$: given only generated text from…

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.LG Atom FeedИсследования

Fairness-Aware Federated Learning with Trajectory Shapley Value

Federated learning is an emerging distributed paradigm that addresses the challenges posed by heterogeneous, privacy-sensitive data. It enables multiple clients to train a model collaboratively by aggregating their local updates at a server. However, conventional aggregation schemes typically use fixed weights that fail to reflect unequal and time-varying…

Открыть оригинал
Иллюстрация: исследования ИИ
arXiv cs.CL Atom FeedИсследования

Demystifying Data Organization for Enhanced LLM Training

Large Language Models (LLMs) have revolutionized various fields, yet their training efficiency is heavily reliant on effective data curation. While data selection has been widely studied, the strategic data organization for enhanced training remains an underexplored area, particularly since current LLMs are often trained for only one or a few epochs. This…

Открыть оригинал
Иллюстрация: исследования ИИ
Microsoft Research AIИсследования

MagenticLite, MagenticBrain, Fara1.5: An agentic experience optimized for small models

MagenticLite is an agentic system for small models that works across the browser and local file system in a single workflow. It combines specialized models and orchestration to support efficient agentic performance on everyday tasks. The post MagenticLite, MagenticBrain, Fara1.5: An agentic experience optimized for small models appeared first on Microsoft…

Открыть оригинал
Иллюстрация: исследования ИИ
Google DeepMind NewsИсследования

Simulate real-world places with Project Genie and Street View

We’re expanding access to Google AI Ultra subscribers globally and introducing a new capability powered by Street View.

Открыть оригинал