Allen Institute for AI update hub for open science, research results, and model releases.
AI2 · research · 2026-05-13
Score 14
AIMIP is a new open benchmark and dataset for evaluating AI climate models, showing they can match or beat conventional models on some historical climate metrics while still struggling to generalize reliably to long-term warming trends and...
High signal Matched: benchmark, introducing, model, evaluating
AI2 · research · 2026-05-08
Score 12
EMO is a new mixture-of-experts model trained so modular expert groups emerge from data, enabling users to select small task-specific expert subsets while preserving near full-model performance.
High signal Matched: mixture of experts, performance, model, pretraining
AI2 · research · 2026-04-23
Score 8
OlmoEarth Studio now lets users export custom Earth-observation embeddings from our OlmoEarth foundation models and use them for tasks like similarity search, few-shot mapping, change detection, and unsupervised exploration.
High signal Matched: introducing
AI2 · research · 2026-04-07
Score 12
WildDet3D is an open model that predicts 3D bounding boxes from a single image. It generalizes across cameras and object categories, and folds in depth signals when available—alongside a new dataset of verified 3D annotations.
High signal Matched: introducing, model, open model
AI2 · research · 2026-03-18
Score 8
MolmoPoint is a new vision-language model architecture that replaces text-based coordinate outputs with a more natural, token-based pointing mechanism that directly selects regions from visual features.
High signal Matched: model
AI2 · research · 2026-03-05
Score 10
Olmo Hybrid is a fully open 7B language model that combines transformer attention with linear RNN layers to achieve greater expressivity and significantly improved data and compute efficiency compared to pure transformer models.
High signal Matched: introducing, model
AI2 · research · 2026-02-27
Score 8
The Asta Interaction Dataset (AID) contains real researcher queries revealing how scientists actually use AI-powered research tools, and where their habits diverge from what tool builders expect.
High signal Matched: research
AI2 · research · 2026-05-21
Score 0
PointCheck, an independent project, uses Molmo, MolmoWeb, and Olmo 3 to test web accessibility the way a keyboard user would—by navigating real pages and inspecting what's actually on screen.
Watchlist Matched: none
AI2 · research · 2026-05-19
Score 6
OlmoEarth v1.1 is a more efficient family of remote-sensing models that cuts compute costs by up to 3x while maintaining similar performance, making large-scale satellite mapping faster and cheaper to run.
Watchlist Matched: performance
AI2 · research · 2026-05-11
Score 0
Artificial Analysis uses Ai2’s open IFBench eval because it captures a stubborn, real-world capability many benchmarks miss: whether models can reliably follow complex, multi-part user instructions.
Watchlist Matched: eval, benchmarks
AI2 · research · 2026-05-07
Score 6
Ai2 is bringing NSF OMAI compute online to power a fully open AI research ecosystem, turning national infrastructure investment into reusable models, data, methods, and tools that can accelerate scientific discovery.
Watchlist Matched: research
AI2 · research · 2026-05-05
Score 6
MolmoAct 2 is a fully open robotics foundation model that brings faster, stronger 3D action reasoning to real-world robot tasks, alongside a major new bimanual manipulation dataset for researchers to study, reproduce, and build on.
Watchlist Matched: model
AI2 · research · 2026-05-01
Score 0
Interim CEO Peter Clark shares his thoughts on this moment for Ai2, our commitment to open science, and where the institute is headed next.
Watchlist Matched: none
AI2 · research · 2026-04-30
Score 6
AstaBench’s latest update adds new frontier-model results, including GPT-5.5, and highlights growing adoption from groups including the UK AISI, General Reasoning, Elicit, SciSpace, Distyl AI, and EvoScientist.
Watchlist Matched: model, frontier-model
AI2 · research · 2026-04-29
Score 0
MolmoPoint and MolmoWeb extend the Molmo family from visual understanding to visual action, giving researchers open tools for models that can point, navigate, and interact with the world they see.
Watchlist Matched: none
AI2 · research · 2026-04-23
Score 0
OlmPool is a controlled suite of 26 models showing how small architecture choices can compound to make long-context extension much harder, even when training data and extension recipes are held constant.
Watchlist Matched: training, long context, long-context
AI2 · research · 2026-04-22
Score 0
For the past 10 years, Ai2 has built open, real-time tools that help people protect wildlife, oceans, and ecosystems around the world.
Watchlist Matched: none
AI2 · research · 2026-04-20
Score 6
BAR is a recipe for post-training language models one capability at a time—train domain experts independently, merge them into a single mixture-of-experts model, and upgrade any expert without impacting the others.
Watchlist Matched: model, training, post-training
AI2 · research · 2026-04-13
Score 0
Two benchmarks developed at Ai2 – ScienceWorld and DiscoveryWorld – reveal that even incredibly strong AI science agents struggle with problems human scientists solve routinely.
Watchlist Matched: evaluating, benchmarks, agents
AI2 · research · 2026-03-24
Score 6
Introducing MolmoWeb, an open visual web agent that navigates and completes tasks in a browser using screenshots alone, along with MolmoWebMix, the largest public dataset for training web agents.
Watchlist Matched: introducing, training, agent, agents
AI2 · research · 2026-03-23
Score 0
A recap of Ai2's week at NVIDIA GTC 2026, covering panels on open models, live demos of Olmo Hybrid and Asta AutoDiscovery, and conversations on coding agents, hybrid architectures, and robotics.
Watchlist Matched: agents
AI2 · research · 2026-03-11
Score 6
MolmoBot is an open robotic manipulation model suite trained entirely in simulation—demonstrating zero-shot transfer to real-world robots without any real-world data collection or fine-tuning.
Watchlist Matched: model, training, fine-tuning
AI2 · research · 2026-03-11
Score 6
Introducing MolmoBot and MolmoSpaces, an open foundation for training real-world robots to advance science.
Watchlist Matched: introducing, training
AI2 · research · 2026-02-25
Score 0
PreScience is a new benchmark that evaluates whether AI can forecast how science unfolds end-to-end, from team formation through eventual impact.
Watchlist Matched: benchmark
AI2 · research · 2026-02-13
Score 6
Olmix is a framework for language model data mixing that provides empirically grounded defaults and efficient reuse techniques.
Watchlist Matched: model