NVIDIA Dynamo · open-source · 2026-06-03
Full-Stack Optimizations for Agentic Inference
No feed summary available yet.
High signal Matched: inference, agentic
NVIDIA Dynamo · open-source · 2026-06-03
No feed summary available yet.
High signal Matched: inference, agentic
VESSL AI · korea · 2026-06-03
No feed summary available yet.
High signal Matched: gpu, agent
NVIDIA Dynamo · open-source · 2026-06-03
No feed summary available yet.
High signal Matched: agentic
Nebius · cloud · 2026-06-03
No feed summary available yet.
High signal Matched: cloud, agent
FuriosaAI · hardware · 2026-06-03
No feed summary available yet.
High signal Matched: inference, generation, agentic
LightSeek Foundation · research · 2026-06-03
No feed summary available yet.
High signal Matched: inference, kernel, performance, agentic
FriendliAI · inference-infra · 2026-06-03
No feed summary available yet.
High signal Matched: model, agentic
Anthropic · model-lab · 2026-06-03
No feed summary available yet.
High signal Matched: introducing, tool use
Cohere · model-lab · 2026-06-03
No feed summary available yet.
High signal Matched: performance, agentic
Upstage · korea · 2026-06-03
No feed summary available yet.
High signal Matched: upstage, agent
AWS Machine Learning Blog · cloud · 2026-06-03
This post walks through how Baz built their Spec Review agent using Amazon Bedrock and Amazon Bedrock AgentCore. We'll cover the architecture decisions, implementation details, and the business outcomes they achieved by leveraging these AW...
High signal Matched: bedrock, agent
NVIDIA Technical Blog · hardware · 2026-06-02
AI agents are a powerful tool for synthesizing data to accelerate research, summarize information, and help teams make decisions faster. But combining internal...
High signal Matched: research, agent, agents
AWS Machine Learning Blog · cloud · 2026-06-02
GPT-5.5, GPT-5.4, and Codex are now generally available on Amazon Bedrock. Deploy them in production applications and agents today, on Bedrock’s high performance inference engine.
High signal Matched: inference, performance, bedrock, agents
AWS Machine Learning Blog · cloud · 2026-06-02
While deploying Model Context Protocol (MCP) servers in production, enterprises need fine-grained access control across servers, observability into which teams use which tools, security guarantees against data exfiltration, and centralized...
High signal Matched: model, bedrock, mcp
AWS Machine Learning Blog · cloud · 2026-06-02
In this post, we use a lakehouse data agent to demonstrate how you can use Policy for deterministic access control and Lambda interceptors for dynamic validation. We then show how to combine Lambda interceptors and Policy to implement a ge...
High signal Matched: bedrock, agent, agents
AWS Machine Learning Blog · cloud · 2026-06-02
In this post, we address several key risks that surface when designing an agentic payment system, and how to address them with the capabilities of AgentCore payments.
High signal Matched: bedrock, agentic
AWS Machine Learning Blog · cloud · 2026-06-02
When you build agentic AI solutions, you face unique operational challenges. Agents make unpredictable decisions, costs spiral unexpectedly, and debugging non-deterministic failures seems impossible. Agentic AI applications don't just exec...
High signal Matched: bedrock, agents, agentic
vLLM Project · open-source · 2026-06-02
Long-horizon LLM agents create a routing problem that single-turn prompt routers were not designed to solve. A router still needs to know which model is best for the current request, but it also...
High signal Matched: router, model, agents, agentic
NVIDIA Technical Blog · hardware · 2026-06-01
The rise of autonomous, long-running AI agents has introduced a new class of compute demand, namely tasks that maintain large context windows, spawn concurrent...
High signal Matched: multi-node, agents
Lambda · cloud · 2026-06-01
When we design large GPU clusters, the network is no longer a background system. It's part of the compute envelope. At the 800G and NVIDIA GB300 NVL72 scale, the back-end fabric accounts for 86% of networking power in a three-layer cluster...
High signal Matched: generation, token generation, throughput, infiniband, gpu, model, retrieval, agentic
NVIDIA Technical Blog · hardware · 2026-06-01
Each wave of AI has created a new scaling law. Pretraining scaled intelligence through larger datasets, more parameters, and massively parallel GPU systems....
High signal Matched: gpu, pretraining, agentic
AMD ROCm Blogs · hardware · 2026-06-01
Reinforcement learning (RL) is rapidly becoming a foundational technology for Large Language Models (LLMs)—powering key abilities such as reasoning and agentic behaviors. As RL workloads grow more complex and computationally intensive, the...
High signal Matched: performance, gpu, agentic
AWS Machine Learning Blog · cloud · 2026-05-29
This post combines learnings from LangChain’s work on evaluating deep agents and Anthropic’s guide to demystifying evals for AI agents into a practical guide. In this post, you will learn how to: 1) apply five evaluation patterns for deep...
High signal Matched: evaluation, bedrock, evals, evaluating, agent, agents
AWS Machine Learning Blog · cloud · 2026-05-29
Datasets in AgentCore is in public preview. Agent evaluation is most powerful when you combine fast-moving online signals with stable offline baselines. To understand whether your agent is truly improving over time, you need a fixed benchm...
High signal Matched: benchmark, evaluation, bedrock, agent
AWS Machine Learning Blog · cloud · 2026-05-29
This post covers Opus 4.8's improvements and practical guidance for AI engineers integrating the model into agentic systems and production inference workloads on Amazon Bedrock.
High signal Matched: inference, model, bedrock, agentic
PyTorch Foundation · open-source · 2026-05-28
TL;DR: The TokenSpeed inference engine achieved a record-breaking 580 tps running the Qwen3.5-397B-A17B model on GPUs. This extreme performance for agentic workloads is driven by systematic elimination of memory copies,...
High signal Matched: inference, performance, gpu, model, agentic
vLLM Project · open-source · 2026-05-28
As organizations increasingly adopt AI-powered development tools, the need for high-performance agentic models that deliver both accuracy and operational efficiency has become critical. Laguna...
High signal Matched: inference, performance, agentic
Modal · inference-infra · 2026-05-27
Introducing Role-Based Access Control for humans and agents, now available for all users on Teams and Enterprise plans.
High signal Matched: introducing, agents
NVIDIA Technical Blog · hardware · 2026-05-20
Agent harnesses like Claude Code, Codex, and LangChain Deep Agents are excellent orchestrators. They manage sessions, chain tools, execute code, and respond to...
High signal Matched: research, agent, agents
NVIDIA Technical Blog · hardware · 2026-05-19
Autonomous AI agents are becoming more capable. Open models, Model Context Protocol (MCP)-connected tools, and portable skills are also making agents easier to...
High signal Matched: model, agent, agents, mcp
NVIDIA Technical Blog · hardware · 2026-05-19
Evaluating an AI model and evaluating an AI agent are related—but they answer fundamentally different questions. A model benchmark tests the capability of a...
High signal Matched: benchmark, model, evaluation, evaluating, agent, agentic
Together AI · inference-infra · 2026-05-19
Real-world inference benchmarks for coding agents: 31% more TPS than TensorRT-LLM, 2× better TTFT at saturation, and 76% lower cost than Claude Opus 4.6.
High signal Matched: inference, ttft, cost, benchmarks, agents
Modal · inference-infra · 2026-05-19
No feed summary available yet.
High signal Matched: introducing, agents
NVIDIA Technical Blog · hardware · 2026-05-14
Agentic inference has fundamentally changed the runtime dynamics of inference workloads by introducing non-deterministic trajectories—actions, observations,...
High signal Matched: inference, introducing, agentic
LMCache · open-source · 2026-05-13
A practitioner’s guide to KV-cache tiering on ROCm — what works, what doesn’t, and the regime where it actually matters. Key Summary We benchmarked multi-turn agentic workloads using 739 anonymized Claude Code conversation trac...
High signal Matched: lmcache, moe, mi300x, rocm, fp8, agentic
Nota AI · korea · 2026-05-11
Jaehoon Lee Technical Content Manager, Nota AI NetsPresso® now embraces AI agents. An easy-to-use interface sits on top of the validated pipeline that handles everything from model compression to device deployment.When a user...
High signal Matched: inference, endpoint, kernel, verification, moe, benchmark, latency, cost, gpu, release, model, evaluation, quantization, quantized, int4, evaluate, benchmarks, swe-bench, mmlu, agent, agents, api
BAIR · research · 2026-05-08
.apr-fig { text-align: center; margin: 1.35em 0; line-height: 1.4; } .apr-fig--wide img { display: inline-block; width: 100%; max-width: 100%; height: auto; vertical-align: middle; } .apr-fig--wide-0-8 { max-width: 80%; margin-left: auto;...
High signal Matched: inference, decoding, prefill, generation, serve, throughput, kv cache, verification, performance, latency, cost, model, paper, research, evaluation, training, pretraining, sft, benchmarks, long context, context window, agentic, reasoning model
NVIDIA Technical Blog · hardware · 2026-05-08
Bash is one of the most flexible and powerful interfaces exposed to AI agents. In the right system, a model that emits grep, curl, tar, or a shell pipeline is...
High signal Matched: decoding, generation, model, agents
vLLM Project · open-source · 2026-05-06
TL;DR: Agentic workloads generate massive shared prefixes that are often recomputed across turns. By integrating Mooncake's distributed KV cache store into vLLM, we achieve 3.8x higher throughput,...
High signal Matched: serving, throughput, distributed, kv cache, agentic
NVIDIA Technical Blog · hardware · 2026-05-05
The automotive cockpit is undergoing a fundamental shift from rule-based interfaces to agentic, multimodal AI systems capable of reasoning, planning, and...
High signal Matched: cloud, agents, agentic
NVIDIA Technical Blog · hardware · 2026-04-30
NVIDIA CUDA Tile (cuTile) is a tile-based programming model that enables developers to write GPU kernels in terms of tile-level operations—loads, stores, and...
High signal Matched: kernel, cuda, gpu, model, agents
Nota AI · korea · 2026-04-29
Hancheol Park, Ph. D.AI Research Engineer, NetsPresso Tech, Nota AI Geonmin Kim, Ph. D.AI Research Engineer, NetsPresso Tech, Nota AI Geonho LeeEdge AI Engineer Intern, NetsPresso Tech, Nota AI Jaehoon Lee Technical Content Manager,...
High signal Matched: generation, moe, performance, model, weights, paper, research, evaluation, korea, korean, seoul, naver, training, fine-tuning, quantization, agent, agents, agentic
Together AI · inference-infra · 2026-04-29
DeepSeek-V4 Pro is now available on Together AI with 512K context, controllable reasoning modes, and cached-input pricing for long-context reasoning workloads like code agents, document intelligence, and research synthesis.
High signal Matched: research, long-context, agents
Hugging Face · open-source · 2026-04-29
No feed summary available yet.
High signal Matched: introducing, long-context, agents
NVIDIA Technical Blog · hardware · 2026-04-28
Agentic systems often reason across screens, documents, audio, video, and text within a single perception‑to‑action loop. However, they still rely on...
High signal Matched: model, open model, agent, agentic
Together AI · inference-infra · 2026-04-28
NVIDIA Nemotron 3 Nano Omni is now on Together AI: a single open model that reasons across video, images, audio, and text, built for agentic workloads at scale.
High signal Matched: model, open model, agentic
vLLM Project · open-source · 2026-04-28
We are excited to support the newly released NVIDIA Nemotron 3 Nano Omni model on vLLM.
High signal Matched: model, agentic
Sakana AI · model-lab · 2026-04-24
No feed summary available yet.
High signal Matched: model, agent
NVIDIA Technical Blog · hardware · 2026-04-20
AI tools are significantly accelerating software development and changing how developers work with code. These tools serve as real-time copilots, automating...
High signal Matched: serve, agents, agentic
NVIDIA Technical Blog · hardware · 2026-04-17
Coding agents are starting to write production code at scale. Stripe’s agents generate 1,300+ PRs per week. Ramp attributes 30% of merged PRs to agents....
High signal Matched: inference, agents, agentic
Modal · inference-infra · 2026-04-14
Autoresearch automates AI research. Modal automates AI infrastructure.
High signal Matched: research, agents
NVIDIA Technical Blog · hardware · 2026-04-12
The release of MiniMax M2.7 adds enhancements to the popular MiniMax M2.5 model, built for agentic harnesses,...
High signal Matched: release, model, agentic
SkyPilot · open-source · 2026-04-10
With the SkyPilot Agent Skill, your AI coding agent can launch clusters, run training jobs and manage cloud resources across any infrastructure using natural language.
High signal Matched: launch, cloud, training, agent, agents
Google Research · big-tech · 2026-04-09
Generative AI
High signal Matched: introducing, agents
SkyPilot · open-source · 2026-04-09
Coding agents working from code alone generate shallow hypotheses. Adding a research phase — arxiv papers, competing forks, other backends — produced 5 kernel fusions that made llama.cpp CPU inference 15% faster.
High signal Matched: inference, kernel, arxiv, research, agent, agents
LMCache · open-source · 2026-04-04
Modern LLM serving workloads are defined by strict latency requirements, high concurrency, and rapidly growing context lengths. Applications such as multi-turn chat, AI agents, and retrieval-augmented generation continuously build on prior...
High signal Matched: inference, serving, decoding, generation, throughput, lmcache, moe, performance, latency, ttft, retrieval-augmented generation, retrieval, agents
Together AI · inference-infra · 2026-04-02
Production STT and TTS from Deepgram, available on Together AI Dedicated Model Inference for real-time voice agents.
High signal Matched: inference, model, agents
Nota AI · korea · 2026-03-31
Jaehoon Lee Technical Content Manager, Nota AI In March, a single official announcement from Google Research rocked trillions of won in the market capitalization of U.S. infrastructure and semiconductor stocks. The catalyst:...
High signal Matched: inference, serving, generation, throughput, kv cache, benchmark, performance, cost, b200, blackwell, introducing, model, fp8, research, training, fine-tuning, quantization, quantized, agent, agentic, frontier model
Nota AI · korea · 2026-03-23
Jaehoon Lee Technical Content Manager, Nota AI GTC has evolved far beyond a technology conference, drawing attention from global economies and financial markets alike. This year, CEO Jensen Huang took the stage in his tradema...
High signal Matched: inference, prefill, generation, throughput, cuda, kv cache, performance, latency, cost, gpu, npu, launch, model, research, cloud, training, long-context, context window, agent, agents, agentic, open-source
SkyPilot · open-source · 2026-03-19
Karpathy's autoresearch runs one experiment at a time. We gave it access to our GPU infra and let it run experiments in parallel.
High signal Matched: gpu, agent
Hugging Face · open-source · 2026-03-17
No feed summary available yet.
High signal Matched: throughput, agent, computer use
NVIDIA Technical Blog · hardware · 2026-03-16
Reasoning models are growing rapidly in size and are increasingly being integrated into agentic AI workflows that interact with other models and external tools....
High signal Matched: inference, multi-node, agentic
NVIDIA Technical Blog · hardware · 2026-03-16
AI‑native organizations increasingly face scaling challenges as agentic AI workflows drive context windows to millions of tokens and models scale toward...
High signal Matched: introducing, agentic
Together AI · inference-infra · 2026-03-16
Together AI arrives at NVIDIA GTC 2026 with new launches in inference, agents, voice AI, and open models — plus technical sessions from its research and engineering leaders.
High signal Matched: inference, research, agents
Together AI · inference-infra · 2026-03-12
Build real-time voice agents on Together AI with co-located STT, LLM, and TTS infrastructure, native Deepgram and Cartesia support, and end-to-end latency under 500ms.
High signal Matched: latency, agents
SqueezeBits · korea · 2026-03-11
Explore why Physical AI deployment needs synthetic data at scale with Squeezebits' research and discover how to overcome inference bottlenecks to accelerate Roboost Agent.
High signal Matched: inference, research, agent
Together AI · inference-infra · 2026-03-11
NVIDIA Nemotron 3 Super is now available on Together AI Dedicated Inference, delivering efficient multi-agent reasoning, a 1M-token context window, and production-grade deployment on managed infrastructure.
High signal Matched: inference, context window, agent
vLLM Project · open-source · 2026-03-11
We are excited to support the newly released NVIDIA Nemotron 3 Super model on vLLM.
High signal Matched: model, agent
SkyPilot · open-source · 2026-02-27
OpenClaw gives an AI agent full access to your system. Here's why you should run it on an isolated cloud VM, and how to set that up.
High signal Matched: cloud, agent
SqueezeBits · korea · 2026-02-25
Scaling Physical AI requires reliable synthetic data. Learn how RoBoost Agent integrates NVIDIA Cosmos to transform world models into trustworthy data engines for robotics and autonomous driving.
High signal Matched: agent
Together AI · inference-infra · 2026-01-26
Introducing DSGym—a holisti evaluation and training framework for LLM-based data science agents. Features 90+ bioinformatics tasks, 92 Kaggle competitions, and synthetic trajectory generation. Our 4B model achieves state-of-the-art perform...
High signal Matched: generation, performance, introducing, model, evaluation, training, evaluating, agents, open-source
Together AI · inference-infra · 2026-01-13
Together AI teamed with Cursor to build the real-time inference stack that keeps in-editor agents fast and reliable. They productionized NVIDIA Blackwell (B200/GB200), tuning ARM hosts, kernels, and FP4/TensorRT quantization for low latenc...
High signal Matched: inference, latency, b200, gb200, blackwell, model, quantization, agents
vLLM Project · open-source · 2025-12-15
Jan 28th Update: NVIDIA just released their Nemotron 3 Nano model in NVFP4 precision. This model is supported by vLLM out of the box and it uses a new method called Quantization-Aware Distillation...
High signal Matched: model, quantization, agents
Together AI · inference-infra · 2025-12-03
Build, train, and deploy advanced AI agents with integrated reinforcement learning on the Together platform.
High signal Matched: cloud, agents
Together AI · inference-infra · 2025-11-04
Together AI launches the fastest voice AI stack: streaming Whisper STT, serverless open-source TTS (Orpheus & Kokoro), and Voxtral transcription. Sub-second latency for production voice agents.
High signal Matched: inference, latency, agents, open-source
Hugging Face · open-source · 2025-10-23
No feed summary available yet.
High signal Matched: introducing, agent
Google Research · big-tech · 2025-09-25
Generative AI
High signal Matched: research, agent
Together AI · inference-infra · 2025-08-21
Build AI agents for complex, long-running engineering tasks. Learn key patterns from a case study: accelerating LLM inference with speculative decoding.
High signal Matched: inference, decoding, speculative decoding, agents
Hugging Face · open-source · 2025-08-18
No feed summary available yet.
High signal Matched: research, mcp
SkyPilot · open-source · 2025-08-12
Your AI writes code. Now what? If you’re building AI agents in 2025, you probably wondered that as well. Your LLM generates some Python code that analyzes data, manipulates files, or calls APIs. But where does it run? Most people eit...
High signal Matched: cloud, agent, agents, open-source
Together AI · inference-infra · 2025-07-25
Unlock agentic coding with Qwen3-Coder on Together AI: 256K context, SWE-bench rivaling Claude Sonnet 4, zero-setup instant deployment.
High signal Matched: model, swe-bench, agentic
Together AI · inference-infra · 2025-07-14
Run Kimi K2 (1T params) on Together AI—frontier open model for agentic reasoning and coding, serverless deployment, 99.9% SLA, lower cost and instant scaling.
High signal Matched: cost, model, open model, agentic, open-source
BAIR · research · 2025-07-01
.modal { display: none; position: fixed; z-index: 9999; padding-top: 50px; left: 0; top: 0; width: 100%; height: 100%; overflow: auto; background-color: rgba(0,0,0,0.9); } .modal-content { margin: auto; display: block; max-width: 90%; max-...
High signal Matched: inference, generation, performance, model, paper, arxiv, evaluation, training, evaluate, agent, agents
Hugging Face · open-source · 2025-06-06
No feed summary available yet.
High signal Matched: evaluation, agents
AIBrix · open-source · 2025-02-21
Open-source large language models (LLMs) like LLaMA, Deepseek, Qwen and Mistral etc have surged in popularity, offering enterprises greater flexibility, cost savings, and control over their AI deployments. These models have empowered organ...
High signal Matched: inference, generation, latency, cost, introducing, model, agents, open-source
AIBrix · open-source · 2025-02-19
We’re excited to announce the v0.2.0 release of AIBrix! Building on feedback from v0.1.0 production adoption and user interest, this release introduces several new features to enhance performance and usability. Extend the vLLM Prefix...
High signal Matched: inference, serving, prefill, throughput, distributed, multi-node, kv cache, prefix cache, performance, cost, gpu, accelerator, release, agent
Hugging Face · open-source · 2025-02-04
No feed summary available yet.
High signal Matched: benchmark, agent
Modular · inference-infra · 2025-01-30
Agentic Building Blocks: Creating AI Agents with MAX Serve and OpenAI Function Calling
High signal Matched: serve, agents, agentic, function calling
Hugging Face · open-source · 2024-12-31
No feed summary available yet.
High signal Matched: introducing, agents
Hugging Face · open-source · 2024-07-01
No feed summary available yet.
High signal Matched: benchmark, agent
Hugging Face · open-source · 2024-05-13
No feed summary available yet.
High signal Matched: introducing, agents
Hugging Face · open-source · 2023-07-24
No feed summary available yet.
High signal Matched: introducing, agents
Hugging Face · open-source · 2023-02-07
No feed summary available yet.
High signal Matched: introducing, agents
Hugging Face · open-source · 2021-12-02
No feed summary available yet.
High signal Matched: introducing, agents
Prime Intellect · inference-infra · 2026-06-03
No feed summary available yet.
Watchlist Matched: agent
Prime Intellect · inference-infra · 2026-06-03
No feed summary available yet.
Watchlist Matched: agentic
Prime Intellect · inference-infra · 2026-06-03
No feed summary available yet.
Watchlist Matched: training, agents
Runpod · cloud · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
Nebius · cloud · 2026-06-03
No feed summary available yet.
Watchlist Matched: agentic
Nebius · cloud · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
FriendliAI · inference-infra · 2026-06-03
No feed summary available yet.
Watchlist Matched: agentic
FriendliAI · inference-infra · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
Moonshot AI Kimi · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
Moonshot AI Kimi · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agentic
Mistral AI · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
Mistral AI · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agent
Mistral AI · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
Anthropic · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
Anthropic · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agents
Anthropic · model-lab · 2026-06-03
No feed summary available yet.
Watchlist Matched: agentic
Stanford CRFM · research · 2026-06-03
No feed summary available yet.
Watchlist Matched: agent
Hugging Face · open-source · 2026-06-03
No feed summary available yet.
Watchlist Matched: mcp
Hugging Face · open-source · 2026-06-02
No feed summary available yet.
Watchlist Matched: agents, computer use
NVIDIA Technical Blog · hardware · 2026-06-02
AI agents are changing how you interact with your PC. Creators, developers, and AI enthusiasts are already using these agents extensively to assist with...
Watchlist Matched: agents
AWS Machine Learning Blog · cloud · 2026-06-02
This post demonstrates how to implement Open Authorization (OAuth) Code flow as an inbound authorization mechanism for MCP servers hosted on Amazon Bedrock AgentCore Gateway. By the end of this guide, you will have a production-ready setup...
Watchlist Matched: bedrock, mcp
NVIDIA Technical Blog · hardware · 2026-06-02
As AI agents move from the digital world to the physical environment, they can readily use NVIDIA Jetson to accelerate real-world deployment with optimized...
Watchlist Matched: agents, agentic
AWS Machine Learning Blog · cloud · 2026-06-02
In this post, we walk through a practical implementation using KDB-X MCP server integration with Amazon Quick, demonstrating how traders and analysts can ask questions using conversational language and receive actionable insights from data...
Watchlist Matched: performance, mcp
Hugging Face · open-source · 2026-06-01
No feed summary available yet.
Watchlist Matched: agent
NVIDIA Technical Blog · hardware · 2026-06-01
The AI era is driving a new class of infrastructure: AI factories that transform data into intelligence for autonomous AI agents operating at unprecedented...
Watchlist Matched: agents, agentic
Microsoft Research · big-tech · 2026-05-29
Data Formulator introduces AI-powered analytics for enterprise data workflows. Data teams can easily bring enterprise data into an AI-ready workspace where users can explore, analyze, and visualize data with AI agents to turn raw data into...
Watchlist Matched: research, agents
Cloudflare Blog · cloud · 2026-05-28
Here’s how we built Town Lake, Cloudflare's unified analytics platform, alongside Skipper, an internal AI agent running on top of it.
Watchlist Matched: agent
LY Corporation Tech Blog · korea · 2026-05-26
Hi, I'm Jeongwoo, a security platform engineer at LY Corporation developing and operating Athenz.In ...
Watchlist Matched: agent
Hugging Face · open-source · 2026-05-25
No feed summary available yet.
Watchlist Matched: agent
Microsoft Research · big-tech · 2026-05-22
MagenticLite is an agentic system for small models that works across the browser and local file system in a single workflow. It combines specialized models and orchestration to support efficient agentic performance on everyday tasks. The p...
Watchlist Matched: performance, research, agentic
NVIDIA Technical Blog · hardware · 2026-05-21
In quantitative finance, researchers build algorithms to trade assets, derivatives, and other financial instruments. A key part of that work is finding signals:...
Watchlist Matched: agent
NVIDIA Technical Blog · hardware · 2026-05-20
Autonomous AI agents are taking on all types of work for businesses: routing logistics fleets, triaging support tickets, generating code, and orchestrating...
Watchlist Matched: agent, agents, agentic
Modal · inference-infra · 2026-05-20
How Applied Compute trains custom agents with Reinforcement Learning for enterprises like DoorDash, Cognition, and Mercor on Modal.
Watchlist Matched: agents
Cloudflare Blog · cloud · 2026-05-19
Cloudflare has integrated with Anthropic's Claude Managed Agents to provide a fast, isolated execution environment for autonomous code delivery. This means builders can scale agent workflows globally while strictly controlling access to pr...
Watchlist Matched: agent, agents
Modular · inference-infra · 2026-05-19
How I built a pure Mojo app (and 10 libraries) with AI agents
Watchlist Matched: agents
NVIDIA Technical Blog · hardware · 2026-05-13
In today’s data-driven world, organizations increasingly rely on video to capture critical information, yet extracting meaningful, real-time insights from...
Watchlist Matched: agents
Modular · inference-infra · 2026-05-13
Translating to Mojo via AI Agents
Watchlist Matched: agents
Microsoft Research · big-tech · 2026-05-12
Using SocialReasoning Bench, we observed a stable pattern across models—agents execute competently, but fail to consistently improve the user’s position, even with explicit instructions to optimize for user interest. The post SocialReasoni...
Watchlist Matched: research, agents
NVIDIA Technical Blog · hardware · 2026-05-08
An agentic exchange must preserve a structured interaction: assistant turns interleave reasoning with one or more tool calls, and subsequent user turns return...
Watchlist Matched: agentic
NVIDIA Technical Blog · hardware · 2026-05-05
Generative AI’s explosive first chapter was defined by humans sending requests and models responding. The agentic chapter is different. Agents don't...
Watchlist Matched: agents, agentic
NVIDIA Technical Blog · hardware · 2026-05-04
Modern supply chains operate under the constant pressures of fluctuating demand, volatile costs, constrained capacity, and interdependent decision-making....
Watchlist Matched: agent
Cloudflare Blog · cloud · 2026-04-30
Starting today, agents can now be Cloudflare customers. They can create a Cloudflare account, start a paid subscription, register a domain, and get back an API token to deploy code right away. Humans can be in the loop to grant permission,...
Watchlist Matched: agents, api
Lambda · cloud · 2026-04-30
Harnesses If you've used Claude Code or Codex, you've used a harness. A harness is the infrastructure layer that wraps an AI coding agent and decides how it operates, what it can touch, and how you measure whether it worked. It's how most...
Watchlist Matched: gpu, training, post-training, agent, agents, open-source
NVIDIA Technical Blog · hardware · 2026-04-29
The next wave of enterprise productivity is being built on AI factories. As organizations deploy agentic AI systems capable of reasoning, automation, and...
Watchlist Matched: agentic
NVIDIA Technical Blog · hardware · 2026-04-28
The subsurface industry is at a critical point in its digital evolution. For decades, unlocking reservoir potential has relied on experts performing essential...
Watchlist Matched: agentic
Sakana AI · model-lab · 2026-04-27
No feed summary available yet.
Watchlist Matched: agents
Hugging Face · open-source · 2026-04-24
No feed summary available yet.
Watchlist Matched: agents
NVIDIA Technical Blog · hardware · 2026-04-23
In March 2026, three LLM agents generated over 600,000 lines of code, ran 850 experiments, and helped secure a first-place finish in a Kaggle playground...
Watchlist Matched: agents
Google Research · big-tech · 2026-04-22
Generative AI
Watchlist Matched: agents
Cloudflare Blog · cloud · 2026-04-20
Agents Week 2026 is a wrap. Let’s take a look at everything we announced, from compute and security to the agent toolbox, platform tools, and the emerging agentic web. Everything we shipped for the agentic cloud.
Watchlist Matched: cloud, agent, agents, agentic
NVIDIA Technical Blog · hardware · 2026-04-17
Agents are evolving from question-and-answer systems into long-running autonomous assistants that read files, call APIs, and drive multi-step workflows....
Watchlist Matched: agent, agents
LY Corporation Tech Blog · korea · 2026-04-17
As of 2026, the AI paradigm is steadily shifting from mere chat interfaces to action-centric executi...
Watchlist Matched: agent
NVIDIA Technical Blog · hardware · 2026-04-16
Developing real-time vision AI applications presents a significant challenge for developers, often demanding intricate data pipelines, countless lines of code,...
Watchlist Matched: agents
Modular · inference-infra · 2026-04-16
How Frontier Coding Agents Built a Video Diffusion Pipeline on MAX
Watchlist Matched: agents
Hugging Face · open-source · 2026-04-16
No feed summary available yet.
Watchlist Matched: agents
Hugging Face · open-source · 2026-04-15
No feed summary available yet.
Watchlist Matched: agents, tool use
Modal · inference-infra · 2026-04-15
Modal is an official sandbox provider for the OpenAI Agents SDK.
Watchlist Matched: agents, sdk
Together AI · inference-infra · 2026-04-13
EinsteinArena is a platform where AI agents collaborate and compete on open math problems. AI agents on EinsteinArena have already set 11 new state-of-the-art results on open math problems — including pushing the kissing number lower bound...
Watchlist Matched: agents
AI2 · research · 2026-04-13
Two benchmarks developed at Ai2 – ScienceWorld and DiscoveryWorld – reveal that even incredibly strong AI science agents struggle with problems human scientists solve routinely.
Watchlist Matched: evaluating, benchmarks, agents
NVIDIA Technical Blog · hardware · 2026-03-24
Agentic AI is an ecosystem where specialized models work together to handle planning, reasoning, retrieval, and safety guardrailing. As these systems scale,...
Watchlist Matched: rag, retrieval, agents, agentic
Hugging Face · open-source · 2026-03-24
No feed summary available yet.
Watchlist Matched: evaluating, agents
AI2 · research · 2026-03-24
Introducing MolmoWeb, an open visual web agent that navigates and completes tasks in a browser using screenshots alone, along with MolmoWebMix, the largest public dataset for training web agents.
Watchlist Matched: introducing, training, agent, agents
AI2 · research · 2026-03-23
A recap of Ai2's week at NVIDIA GTC 2026, covering panels on open models, live demos of Olmo Hybrid and Asta AutoDiscovery, and conversations on coding agents, hybrid architectures, and robotics.
Watchlist Matched: agents
NVIDIA Technical Blog · hardware · 2026-03-18
While consumer AI offers powerful capabilities, workplace tools often suffer from disjointed data and limited context. Built with LangChain, the NVIDIA AI-Q...
Watchlist Matched: agents
NVIDIA Technical Blog · hardware · 2026-03-17
AI-native services are exposing a new bottleneck in AI infrastructure: As millions of users, agents, and devices demand access to intelligence, the challenge is...
Watchlist Matched: agents
NVIDIA Technical Blog · hardware · 2026-03-16
Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage long-running tasks that use multiple communication channels and...
Watchlist Matched: agents
NVIDIA Technical Blog · hardware · 2026-03-16
AI has evolved from assistants following your directions to agents that act independently. Called claws, these agents can take a goal, figure out how to achieve...
Watchlist Matched: agents
NVIDIA Technical Blog · hardware · 2026-03-16
Artificial intelligence is token-driven. Every prompt, reasoning step, and agent interaction generates tokens. Over the past year, token consumption has grown...
Watchlist Matched: agent
Together AI · inference-infra · 2026-02-25
No feed summary available yet.
Watchlist Matched: training, agents, sota
Hugging Face · open-source · 2026-02-19
No feed summary available yet.
Watchlist Matched: agents
Modal · inference-infra · 2026-02-19
No feed summary available yet.
Watchlist Matched: agent
Hugging Face · open-source · 2026-02-12
No feed summary available yet.
Watchlist Matched: evaluating, agents
Google Research · big-tech · 2026-01-28
Generative AI
Watchlist Matched: agent
Hugging Face · open-source · 2026-01-27
No feed summary available yet.
Watchlist Matched: training, agentic, oss
Hugging Face · open-source · 2026-01-21
No feed summary available yet.
Watchlist Matched: benchmarks, agent
LY Corporation Tech Blog · korea · 2026-01-05
This post is a follow-up to Creating a domain-specific NL-to-SQL MCP server, which introduced our MC...
Watchlist Matched: agent, mcp
Hugging Face · open-source · 2026-01-05
No feed summary available yet.
Watchlist Matched: agents
SkyPilot · open-source · 2025-12-17
Train a tool-calling agent with VeRL and use SkyPilot to scale it up with independent RL trainer and env rollout
Watchlist Matched: agent
Hugging Face · open-source · 2025-12-16
No feed summary available yet.
Watchlist Matched: agents
Hugging Face · open-source · 2025-12-04
No feed summary available yet.
Watchlist Matched: agent
LY Corporation Tech Blog · korea · 2025-11-28
IntroductionEnterprise data analysis faces a fundamental challenge: the gap between business questio...
Watchlist Matched: mcp
Modal · inference-infra · 2025-11-20
Turns out, good devex for agents looks a lot like good devex for humans.
Watchlist Matched: agents
Google Research · big-tech · 2025-11-07
Data Mining & Modeling
Watchlist Matched: agent
Hugging Face · open-source · 2025-10-30
No feed summary available yet.
Watchlist Matched: agent
Together AI · inference-infra · 2025-10-28
Test AI agents in the real world with Collinear TraitMix and Together Evals: dynamic persona simulations, multi-turn dialogs, and LLM-as-judge scoring.
Watchlist Matched: evals, agent, agents
SkyPilot · open-source · 2025-10-14
Want to train an AI agent with RL that can solve math problems or write code? This tutorial walks you through building your own math and coding agents with step-by-step examples with plenty of screenshots to help you along the way. We use...
Watchlist Matched: training, post-training, agent, agents
Google Research · big-tech · 2025-09-30
Generative AI
Watchlist Matched: agent
Hugging Face · open-source · 2025-09-29
No feed summary available yet.
Watchlist Matched: agent
Hugging Face · open-source · 2025-09-23
No feed summary available yet.
Watchlist Matched: training, post-training, agents, computer use
Hugging Face · open-source · 2025-09-22
No feed summary available yet.
Watchlist Matched: agents
Google Research · big-tech · 2025-09-19
Human-Computer Interaction and Visualization
Watchlist Matched: agent, agents
Hugging Face · open-source · 2025-09-10
No feed summary available yet.
Watchlist Matched: training, agents
LY Corporation Tech Blog · korea · 2025-08-20
Hello. I'm Sumin Shin, a developer working on services related to LLM agents at LINE AI LAB, LINE Pl...
Watchlist Matched: agents
Replicate · inference-infra · 2025-08-10
Use our MCP to discover, compare, and run models from apps like Claude, Cursor, and VS Code.
Watchlist Matched: mcp
Google Research · big-tech · 2025-08-01
Machine Intelligence
Watchlist Matched: agent
Hugging Face · open-source · 2025-07-31
No feed summary available yet.
Watchlist Matched: mcp
Hugging Face · open-source · 2025-07-17
No feed summary available yet.
Watchlist Matched: mcp
Hugging Face · open-source · 2025-07-17
No feed summary available yet.
Watchlist Matched: evaluating, agents
Modular · inference-infra · 2025-07-16
AI Agents for AWS Marketplace
Watchlist Matched: agents
Hugging Face · open-source · 2025-07-10
No feed summary available yet.
Watchlist Matched: agent
Hugging Face · open-source · 2025-07-10
No feed summary available yet.
Watchlist Matched: mcp
Hugging Face · open-source · 2025-07-09
No feed summary available yet.
Watchlist Matched: mcp
Together AI · inference-infra · 2025-07-02
No feed summary available yet.
Watchlist Matched: training, agent
Together AI · inference-infra · 2025-06-12
Build a data scientist agent using Together’s open-source models and Code Interpreter—easy to implement, solid benchmarks, and full code on GitHub.
Watchlist Matched: benchmarks, agent, open-source
Hugging Face · open-source · 2025-06-03
No feed summary available yet.
Watchlist Matched: agent
Together AI · inference-infra · 2025-05-28
No feed summary available yet.
Watchlist Matched: training, post-training, agents, open-source
Hugging Face · open-source · 2025-05-23
No feed summary available yet.
Watchlist Matched: agent, agents, mcp
Hugging Face · open-source · 2025-04-30
No feed summary available yet.
Watchlist Matched: mcp
Hugging Face · open-source · 2025-04-25
No feed summary available yet.
Watchlist Matched: agent, agents, mcp
BAIR · research · 2025-03-25
Training Diffusion Models with Reinforcement Learning We deployed 100 reinforcement learning (RL)-controlled cars into rush-hour highway traffic to smooth congestion and reduce fuel consumption for everyone. Our goal is to tackle "stop-and...
Watchlist Matched: throughput, kernel, performance, model, paper, training, agent, agents
Hugging Face · open-source · 2025-02-28
No feed summary available yet.
Watchlist Matched: evaluate, agent
Hugging Face · open-source · 2025-02-04
No feed summary available yet.
Watchlist Matched: agents, open-source
Hugging Face · open-source · 2025-01-13
No feed summary available yet.
Watchlist Matched: agents
Hugging Face · open-source · 2024-08-12
No feed summary available yet.
Watchlist Matched: tool use
Hugging Face · open-source · 2024-04-22
No feed summary available yet.
Watchlist Matched: agent
Hugging Face · open-source · 2024-01-24
No feed summary available yet.
Watchlist Matched: agents, open-source
Hugging Face · open-source · 2023-01-24
No feed summary available yet.
Watchlist Matched: agent