Archive - MLSys Radar

AI agents are changing how you interact with your PC. Creators, developers, and AI enthusiasts are already using these agents extensively to assist with...

agents

Open

Watchlist Matched: agents

AWS Machine Learning Blog · cloud · 2026-06-02

Building a secure auth code flow setup using AgentCore Gateway with MCP clients

Score 7

This post demonstrates how to implement Open Authorization (OAuth) Code flow as an inbound authorization mechanism for MCP servers hosted on Amazon Bedrock AgentCore Gateway. By the end of this guide, you will have a production-ready setup...

cloud agents

Open

Watchlist Matched: bedrock, mcp

NVIDIA Technical Blog · hardware · 2026-06-02

Deploy Agentic-Ready AI at the Edge with Memory Efficiency in NVIDIA JetPack 7.2

Score 4

As AI agents move from the digital world to the physical environment, they can readily use NVIDIA Jetson to accelerate real-world deployment with optimized...

agents

Open

Watchlist Matched: agents, agentic

Cloudflare Blog · cloud · 2026-06-02

Score 4

Physical AI systems must understand the real world before they can act within it. Robots, autonomous vehicles, and smart spaces need to understand what's...

Open

Watchlist Matched: none

NVIDIA Technical Blog · hardware · 2026-06-01

Advancing AI Infrastructure for Agentic AI with NVIDIA DOCA In-Silicon Security

Score 4

The AI era is driving a new class of infrastructure: AI factories that transform data into intelligence for autonomous AI agents operating at unprecedented...

agents

Open

Watchlist Matched: agents, agentic

NVIDIA Technical Blog · hardware · 2026-06-01

NVIDIA DSX OS Delivers Open, Modular Software for Operating AI Factories at Scale

Score 4

AI is now essential infrastructure, powered by AI factories that generate intelligence in the form of tokens. As demand grows, these factories must scale...

Open

Watchlist Matched: none

Modal · inference-infra · 2026-06-01

Reinforcement learning is an infrastructure problem

Score 2

What we've seen helping teams run Reinforcement Learning at scale on Modal. Plus an open-source library to skip the scaffolding.

open-source

Open

Watchlist Matched: open-source

Sakana AI · model-lab · 2026-06-01

金融領域の業務をAIエージェントで変える：Sakana AI、Software Engineerインタビュー

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modular · inference-infra · 2026-05-29

Score 0

Cloudflare Radar data confirms early indications of a partial Internet restoration in Iran, nearly three months after the shutdown began. Traffic spikes and DNS queries have risen, but network activity is currently just 40% of pre-shutdown...

Open

Watchlist Matched: none

Google Research · big-tech · 2026-05-28

Score 2

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-05-27

Score 1

A little over a year ago, the PyTorch Foundation launched the Ambassador Program, an initiative that recognizes and supports independent, trusted voices in the PyTorch community who are passionate about...

Open

Watchlist Matched: none

NVIDIA Technical Blog · hardware · 2026-05-22

Score 1

Thank you to everyone who participated in the PyTorch Docathon 2026! Once again, the community showed up with incredible energy and dedication to make PyTorch documentation better for developers everywhere....

Open

Watchlist Matched: none

AI2 · research · 2026-05-21

Score 0

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-05-20

Score 3

In today’s data-driven world, organizations increasingly rely on video to capture critical information, yet extracting meaningful, real-time insights from...

agents

Open

Watchlist Matched: agents

NVIDIA Technical Blog · hardware · 2026-05-13

Accelerated X-Ray Analysis for Nanoscale Imaging (XANI) of Novel Materials

Score 3

A massive-scale X-ray free-electron laser (XFEL) enables tracking structural and electron dynamics in novel systems, including fusion materials, semiconductors,...

Open

Watchlist Matched: none

Modular · inference-infra · 2026-05-13

Translating to Mojo via AI Agents

Score 1

Translating to Mojo via AI Agents

agents

Open

Watchlist Matched: agents

Modal · inference-infra · 2026-05-12

Score 0

No feed summary available yet.

Open

Watchlist Matched: none

Microsoft Research · big-tech · 2026-05-09

Building realistic electric transmission grid dataset at scale: a pipeline from open dataset

Score 6

Microsoft Research is excited to release an open dataset of approximate transmission topology of the U.S. power grid derived from publicly available data. The ability to study transmission-level power grid behavior is essential for modern...

model-release research

Open

Watchlist Matched: release, research

Sakana AI · model-lab · 2026-05-09

Sparser, Faster, Lighter Transformer Language Models

Score 0

No feed summary available yet.

Open

Watchlist Matched: none

NVIDIA Technical Blog · hardware · 2026-05-08

Streaming Tokens and Tools: Multi-Turn Agentic Harness Support in NVIDIA Dynamo

Score 3

An agentic exchange must preserve a structured interaction: assistant turns interleave reasoning with one or more tool calls, and subsequent user turns return...

agents

Open

Watchlist Matched: agentic

Cloudflare Blog · cloud · 2026-05-08

Building for the future

Score 0

This afternoon, we sent the following email to our global team. One of our core values at Cloudflare is transparency, and we believe it's important that you hear this directly from us because it’s a major moment at Cloudflare.

Open

Watchlist Matched: none

Lambda · cloud · 2026-05-07

Lambda closes $1 billion senior secured credit facility to meet gigawatt-scale AI infrastructure demand

Score 0

Upsized financing builds on August 2025 credit facility, supporting continued expansion of Lambda's AI factory footprint

Open

Watchlist Matched: none

Cloudflare Blog · cloud · 2026-05-07

How Cloudflare responded to the “Copy Fail” Linux vulnerability

Score 4

When a critical Linux kernel privilege escalation was publicly disclosed, Cloudflare's security and engineering teams detected, investigated, and mitigated the threat across our global fleet, confirming zero customer impact and no maliciou...

kernel

Open

Watchlist Matched: kernel

Modular · inference-infra · 2026-05-07

Modular 26.3: Mojo 1.0 Beta, MAX Video Gen, and more

Score 1

Modular 26.3: Mojo 1.0 Beta, MAX Video Gen, and more

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-05-07

Score 0

Co-founder Stephen Balaban to lead technology vision full-time as CTO; global infrastructure operator Michel Combes named CEO; former AT&T CEO John Donovan appointed Chairman of the Board

Open

Watchlist Matched: none

NVIDIA Technical Blog · hardware · 2026-05-05

Building for the Rising Complexity of Agentic Systems with Extreme Co-Design

Score 3

Generative AI’s explosive first chapter was defined by humans sending requests and models responding. The agentic chapter is different.  Agents don't...

agents

Open

Watchlist Matched: agents, agentic

AI2 · research · 2026-05-05

MolmoAct 2: An open foundation for robots that work in the real world

Score 6

MolmoAct 2 is a fully open robotics foundation model that brings faster, stronger 3D action reasoning to real-world robot tasks, alongside a major new bimanual manipulation dataset for researchers to study, reproduce, and build on.

model-release

Open

Watchlist Matched: model

NVIDIA Technical Blog · hardware · 2026-05-04

Optimize Supply Chain Decision Systems Using NVIDIA cuOpt Agent Skills

Score 3

Modern supply chains operate under the constant pressures of fluctuating demand, volatile costs, constrained capacity, and interdependent decision-making....

agents

Open

Watchlist Matched: agent

Lambda · cloud · 2026-05-04

Most AI teams treat compute as a commodity. It's not.

Score 6

Score 3

Creative and visualization teams today produce more assets, in more formats, with leaner teams. Generative AI can accelerate that work – compressing tasks...

Open

Watchlist Matched: none

Together AI · inference-infra · 2026-04-30

Score 0

No feed summary available yet.

agents

Open

Watchlist Matched: agents

Sakana AI · model-lab · 2026-04-26

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Google Research · big-tech · 2026-04-23

It's all about the angle: Your photos, re-composed

Score 0

Generative AI

Open

Watchlist Matched: none

AI2 · research · 2026-04-23

OlmPool: How small architectural choices compound to undermine long context extension

Score 0

OlmPool is a controlled suite of 26 models showing how small architecture choices can compound to make long-context extension much harder, even when training data and extension recipes are held constant.

training long-context

Open

Watchlist Matched: training, long context, long-context

NVIDIA Technical Blog · hardware · 2026-04-22

Simplify Sparse Deep Learning with Universal Sparse Tensor in nvmath-python

Score 3

In a previous post, we introduced the Universal Sparse Tensor (UST), enabling developers to decouple a tensor’s sparsity from its memory layout for greater...

Open

Watchlist Matched: none

Cloudflare Blog · cloud · 2026-04-22

Score 0

For the past 10 years, Ai2 has built open, real-time tools that help people protect wildlife, oceans, and ecosystems around the world.

Open

Watchlist Matched: none

Cloudflare Blog · cloud · 2026-04-21

Moving past bots vs. humans

Score 0

As AI assistants and privacy proxies challenge the capabilities of traditional bot detection, the Web needs new models for accountability. We believe that control should remain with the client, and that an open ecosystem of anonymous crede...

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-04-21

QIMMA قِمّة ⛰: A Quality-First Arabic LLM Leaderboard

Score 1

No feed summary available yet.

evals

Open

Watchlist Matched: leaderboard

Hugging Face · open-source · 2026-04-21

Score 3

The development of socially acceptable nuclear reactors requires that they are safe, clean, efficient, economical, and sustainable. Meeting these requirements...

Open

Watchlist Matched: none

Google Research · big-tech · 2026-04-16

Designing synthetic datasets for the real world: Mechanism design and reasoning from first principles

Score 0

Generative AI

Open

Watchlist Matched: none

Google Research · big-tech · 2026-04-16

AI-generated synthetic neurons speed up brain mapping

Score 0

General Science

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2026-04-16

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-04-16

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Score 1

No feed summary available yet.

training fine-tuning

Open

Watchlist Matched: training, finetuning

Hugging Face · open-source · 2026-04-15

Inside VAKRA: Reasoning, Tool Use, and Failure Modes of Agents

Using AI to visualize critical paths on LINE app for Android

Score 0

I work in the LINE Official Account (OA) team as an Android developer. One of our jobs is to maintai...

Open

Watchlist Matched: none

Google Research · big-tech · 2026-04-03

Evaluating alignment of behavioral dispositions in LLMs

Score 0

Granite 4.0 3B Vision: Compact Multimodal Intelligence for Enterprise Documents

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

NVIDIA Technical Blog · hardware · 2026-03-31

Build and Stream Browser-Based XR Experiences with NVIDIA CloudXR.js

Score 3

Delivering high-fidelity VR and AR experiences to enterprise users has typically required native application development, custom device management, and complex...

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-03-31

Training mRNA Language Models Across 25 Species for $165

Score 1

No feed summary available yet.

training

Open

Watchlist Matched: training

Google Research · big-tech · 2026-03-31

Safeguarding cryptocurrency by disclosing quantum vulnerabilities responsibly

Score 0

Algorithms & Theory

Open

Watchlist Matched: none

Modular · inference-infra · 2026-03-31

Modverse #54: From GTC to Edinburgh, a Community Building Momentum

Score 1

Modverse #54: From GTC to Edinburgh, a Community Building Momentum

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-03-31

TRL v1.0: Post-Training Library Built to Move with the Field

Score 1

A New Framework for Evaluating Voice Agents (EVA)

Score 1

No feed summary available yet.

evals agents

Open

Watchlist Matched: evaluating, agents

AI2 · research · 2026-03-24

MolmoWeb: An open agent for automating web tasks

Score 6

Introducing MolmoWeb, an open visual web agent that navigates and completes tasks in a browser using screenshots alone, along with MolmoWebMix, the largest public dataset for training web agents.

model-release training agents

Open

Watchlist Matched: introducing, training, agent, agents

AI2 · research · 2026-03-23

Highlights from Ai2 at NVIDIA GTC 2026

Score 0

A recap of Ai2's week at NVIDIA GTC 2026, covering panels on open models, live demos of Olmo Hybrid and Asta AutoDiscovery, and conversations on coding agents, hybrid architectures, and robotics.

agents

Open

Watchlist Matched: agents

NVIDIA Technical Blog · hardware · 2026-03-18

How to Build Deep Agents for Enterprise Search with NVIDIA AI-Q and LangChain

Score 3

While consumer AI offers powerful capabilities, workplace tools often suffer from disjointed data and limited context. Built with LangChain, the NVIDIA AI-Q...

agents

Open

Watchlist Matched: agents

LY Corporation Tech Blog · korea · 2026-03-18

Unification of Group Chat on the LINE App

Score 0

This article was originally published on the pre-merger blog (first published on February 24, 2022) ...

Open

Watchlist Matched: none

Google Research · big-tech · 2026-03-18

Score 3

Healthcare faces a structural demand–capacity crisis: a projected global shortfall of ~10 million clinicians by 2030, billions of diagnostic exams annually...

Open

Watchlist Matched: none

NVIDIA Technical Blog · hardware · 2026-03-16

Scaling Autonomous AI Agents and Workloads with NVIDIA DGX Spark

Score 3

Autonomous AI agents are driving the next wave of AI innovation. These agents must often manage long-running tasks that use multiple communication channels and...

agents

Open

Watchlist Matched: agents

NVIDIA Technical Blog · hardware · 2026-03-16

Score 3

Physics forms the foundation of robotic simulation, enabling realistic modeling of motion and interaction. For tasks like locomotion and manipulation,...

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2026-03-13

Improving code quality - Session 69: My tips for code quality

Score 0

Hello, I'm Munetoshi Ishikawa, a mobile client developer for the LINE messaging app.This article is ...

Open

Watchlist Matched: none

Google Research · big-tech · 2026-03-12

Protecting cities with AI-driven flash flood forecasting

Score 0

Climate & Sustainability

Open

Watchlist Matched: none

Google Research · big-tech · 2026-03-12

Exploring the feasibility of conversational diagnostic AI in a real-world clinical study

Score 0

Generative AI

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2026-03-11

Distributed mobile team collaboration: Code & design reviews, architecture discussions, and continuous practice

Score 6

In November 2025, mobile engineers from our Tokyo and Ho Chi Minh City (HCMC) Development Centers ca...

distributed

Open

Watchlist Matched: distributed

Modular · inference-infra · 2026-03-11

PreScience: Forecasting the future of science end-to-end

Score 0

PreScience is a new benchmark that evaluates whether AI can forecast how science unfolds end-to-end, from team formation through eventual impact.

benchmark

Open

Watchlist Matched: benchmark

Replicate · inference-infra · 2026-02-24

How to prompt Seedream 5.0

Score 6

Seedream 5.0 brings multi-step reasoning, example-based editing, and deep domain knowledge to image generation. Here's what you should know.

inference

Open

Watchlist Matched: generation

LY Corporation Tech Blog · korea · 2026-02-20

Improving code quality - Session 67: Excessive errors are like insufficient ones

Score 0

The original article was published on April 17, 2025.Hello, I'm Munetoshi Ishikawa, a mobile client ...

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-02-20

GGML and llama.cpp join HF to ensure the long-term progress of Local AI

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-02-20

Score 1

The Claude C Compiler: What It Reveals About the Future of Software

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-02-18

One-Shot Any Web App with Gradio's gr.HTML

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Google Research · big-tech · 2026-02-18

Teaching AI to read a map

Score 0

Machine Perception

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2026-02-13

Improving code quality - Session 66: Assertive but still worried

Score 0

The original article was published on April 10, 2025.Hello, I'm Munetoshi Ishikawa, a mobile client ...

Open

Watchlist Matched: none

Hugging Face · open-source · 2026-02-13

Score 0

Human-Computer Interaction and Visualization

Open

Watchlist Matched: none

Modal · inference-infra · 2026-02-11

Try GLM-5.1, the new frontier of open intelligence, on Modal

Score 1

Score 6

Score 0

Hiring Alon Gavrielov further deepens Together AI’s commitment to building AI factories that deliver the most reliable, efficient, and scalable infrastructure for AI-native teams.

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2026-01-30

Improving code quality - Session 64: Keep primary constructors simple

Score 0

The original article was published on March 27, 2025.Hello, I'm Masakuni Ōishi, an engineer working ...

Open

Watchlist Matched: none

NC AI · korea · 2026-01-29

Safety is a given, cost savings are a bonus: why AI services need dedicated guardrails

Score 6

Score 1

Train a tool-calling agent with VeRL and use SkyPilot to scale it up with independent RL trainer and env rollout

agents

Open

Watchlist Matched: agent

Hugging Face · open-source · 2025-12-16

CUGA on Hugging Face: Democratizing Configurable AI Agents

Score 1

No feed summary available yet.

agents

Open

Watchlist Matched: agents

vLLM Project · open-source · 2025-12-14

Token-Level Truth: Real-Time Hallucination Detection for Production LLMs

Score 3

Your LLM just called a tool, received accurate data, and still got the answer wrong. Welcome to the world of extrinsic hallucination—where models confidently ignore the ground truth sitting right...

Open

Watchlist Matched: none

Google Research · big-tech · 2025-12-12

Spotlight on innovation: Google-sponsored Data Science for Health Ideathon across Africa

Score 0

Conferences & Events

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2025-12-12

Improving code quality - Session 57: Seeing is believing

Score 0

The original article was published on February 6, 2025.Hello, I'm Munetoshi Ishikawa, a mobile clien...

Open

Watchlist Matched: none

Together AI · inference-infra · 2025-12-12

Announcing Together Python SDK v2.0

Score 3

Score 1

Modverse #52: Advancing AI Together — Community Projects & Platform Milestones

Open

Watchlist Matched: none

NC AI · korea · 2025-12-02

VARCO 3D 크리에이터 프로그램 시작— 뜨거웠던 킥오프 현장 스케치

Score 0

No feed summary available yet.

Open

Watchlist Matched: none

NC AI · korea · 2025-12-01

VARCO 3D 1.0 정식 런칭

Score 0

No feed summary available yet.

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2025-11-28

Creating a domain-specific NL-to-SQL MCP server

Score 0

IntroductionEnterprise data analysis faces a fundamental challenge: the gap between business questio...

agents

Open

Watchlist Matched: mcp

LY Corporation Tech Blog · korea · 2025-11-28

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-11-25

Continuous batching from first principles

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2025-11-21

PyTorch and LLVM in 2025 — Keeping up With AI Innovation

Score 1

PyTorch and LLVM in 2025 — Keeping up With AI Innovation

Open

Watchlist Matched: none

Google Research · big-tech · 2025-11-06

Forecasting the future of forests with AI: From counting losses to predicting risk

Score 0

Climate & Sustainability

Open

Watchlist Matched: none

Google Research · big-tech · 2025-11-05

Exploring a space-based, scalable AI infrastructure system design

Score 0

General Science

Open

Watchlist Matched: none

BAIR · research · 2025-11-01

RL without TD learning

Score 4

In this post, I’ll introduce a reinforcement learning (RL) algorithm based on an “alternative” paradigm: divide and conquer. Unlike traditional methods, this algorithm is not based on temporal difference (TD) learning (which has scalabilit...

benchmark model-release research training

Open

Watchlist Matched: benchmark, performance, model, paper, training

Modal · inference-infra · 2025-10-31

Product updates: Updates to Volumes, JS and Go SDKs, and more

Score 1

Welcome to another round of Modal Product Updates! Here's what's new this month.

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2025-10-31

Improving code quality - Session 51: Convincing questions

Score 0

The original article was published on November 21, 2024.Hello, I'm Munetoshi Ishikawa, a mobile clie...

Open

Watchlist Matched: none

Google Research · big-tech · 2025-10-30

Toward provably private insights into AI use

Score 0

Generative AI

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-10-30

Aligning to What? Rethinking Agent Generalization in MiniMax M2

Score 1

Score 3

Score 1

Improving code quality - Session 49: Dependency between reality and illusion

Score 0

The original article was published on November 7, 2024.Hello, I'm Munetoshi Ishikawa, a mobile clien...

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-10-17

AI for Food Allergies

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Google Research · big-tech · 2025-10-17

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Google Research · big-tech · 2025-10-09

XR Blocks: Accelerating AI + XR innovation

Score 0

Generative AI

Open

Watchlist Matched: none

Google Research · big-tech · 2025-10-08

Speech-to-Retrieval (S2R): A new approach to voice search

Score 0

Machine Intelligence

rag

Open

Watchlist Matched: retrieval

Hugging Face · open-source · 2025-10-07

BigCodeArena: Judging code generations end to end with code executions

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

LY Corporation Tech Blog · korea · 2025-10-02

Behind the scenes: Supporting engineers and designers during Tech Week 2025

Score 0

Hello! I’m Yoshidumi from developer relations (DevRel), and I oversaw Tech Week 2025.Tech Week 2025,...

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-10-02

SOTA OCR with Core ML and dots.ocr

Score 1

No feed summary available yet.

frontier-model

Open

Watchlist Matched: sota

Replicate · inference-infra · 2025-10-02

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modal · inference-infra · 2025-09-22

Justin Dignelli joins Modal as VP of Sales

Score 1

We're excited to welcome Justin Dignelli to Modal. As VP of Sales, he will be leading our GTM efforts.

Open

Watchlist Matched: none

Modular · inference-infra · 2025-09-22

Modular 25.6: Unifying the latest GPUs from NVIDIA, AMD, and Apple

Score 1

Modular 25.6: Unifying the latest GPUs from NVIDIA, AMD, and Apple

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-09-22

Gaia2 and ARE: Empowering the community to study agents

Score 1

No feed summary available yet.

agents

Open

Watchlist Matched: agents

Modal · inference-infra · 2025-09-22

Build an AI coding platform that scales to millions of monthly sessions

Score 1

Modal Sandboxes: impeccable vibes meet incredible scale.

Open

Watchlist Matched: none

Google Research · big-tech · 2025-09-20

Announcing Replicate's remote MCP server

Score 6

We've partnered with Bria to bring a suite of commercial-grade image generation and editing models to Replicate. Built entirely on licensed data, Bria’s tools are designed for enterprises and developers building safely with visual AI.

inference

Open

Watchlist Matched: generation

Hugging Face · open-source · 2025-07-17

Score 0

A deep-dive into the Taylor Seer optimization technique

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-07-15

Migrating the Hub from Git LFS to Xet

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-07-10

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modal · inference-infra · 2025-07-10

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-07-08

Efficient MultiModal Data Pipeline

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modal · inference-infra · 2025-07-07

How Modal powered 250,000 Lovable app creations in a weekend

Score 0

During a single weekend event, Lovable users built 250,000 new applications, all running in isolated development environments. Lovable used Modal to generate 1 million code sandboxes—with 20,000 running concurrently at peak—over just 48 ho...

Open

Watchlist Matched: none

Modular · inference-infra · 2025-07-03

Score 0

We hosted a hackathon with BFL for FLUX.1 Kontext. Here were the winners.

Open

Watchlist Matched: none

Modal · inference-infra · 2025-06-30

How Quora uses Modal to run thousands of Python sandboxes simultaneously

Score 1

Quora is building Poe, a platform where anyone can deploy a public AI chatbot. Quora uses Modal Sandboxes at scale to safely run LLM-generated code in the context of user chats.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-06-28

Welcome the NVIDIA Llama Nemotron Nano VLM to Hugging Face Hub

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-06-26

Gemma 3n fully available in the open-source ecosystem!

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2025-06-23

Transformers backend integration in SGLang

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modular · inference-infra · 2025-06-20

FLUX.1 Kontext models: Character consistency and precise image editing without fine-tuning

Score 3

No feed summary available yet.

fine-tuning

Open

Watchlist Matched: fine-tuning

Modal · inference-infra · 2025-05-28

Twirl is joining Modal

Score 1

Twirl, a Stockholm-based data orchestration platform, is joining Modal.

Open

Watchlist Matched: none

Together AI · inference-infra · 2025-05-28

Mixture-of-Agents Alignment: Harnessing the Collective Intelligence of Open-Source LLMs to Improve Post-Training

Score 3

Run OpenAI’s latest models on Replicate

Score 0

OpenAI's latest models are now available on Replicate, including GPT-4.1, GPT-4o, and the o-series.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-05-21

Falcon-Arabic: A Breakthrough in Arabic Language Models

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-05-21

Exploring Quantization Backends in Diffusers

Score 1

No feed summary available yet.

quantization

Open

Watchlist Matched: quantization

Hugging Face · open-source · 2025-05-21

nanoVLM: The simplest repository to train your VLM in pure PyTorch

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modal · inference-infra · 2025-05-20

Modal's serverless KV store gets its limit raised to infinity

Score 1

We've supercharged our Dicts to support new caching and locking workflows—oh, and unlimited items.

Open

Watchlist Matched: none

Together AI · inference-infra · 2025-05-20

Together Code Sandbox: the most robust infrastructure for building AI coding products at scale

Score 3

No feed summary available yet.

Open

Watchlist Matched: none

Together AI · inference-infra · 2025-05-20

Together Code Interpreter: execute LLM-generated code seamlessly with a simple API call

Score 3

No feed summary available yet.

api

Open

Watchlist Matched: api

Hugging Face · open-source · 2025-05-19

Microsoft and Hugging Face expand collaboration

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-05-15

Falcon-Edge: A series of powerful, universal, fine-tunable 1.58bit language models.

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2025-05-15

Run 30,000+ LoRAs on Hugging Face with Replicate

Score 6

Score 1

Today we're releasing lightweight client libraries for JavaScript and Go, making it easier to start sandboxes and call serverless functions — no Python required.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-04-30

How to Build an MCP Server with Gradio

Score 1

No feed summary available yet.

agents

Open

Watchlist Matched: mcp

Hugging Face · open-source · 2025-04-30

The 4 Things Qwen-3’s Chat Template Teaches Us

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-04-29

Score 0

Advanced face swap and AI avatars from Easel AI are now on Replicate.

Open

Watchlist Matched: none

Modal · inference-infra · 2025-04-15

Our first brand campaign

Score 1

Behind the scenes of updating our visual identity and launching our first-ever out-of-home campaign in San Francisco.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-04-14

4M Models Scanned: Protect AI + Hugging Face 6 Months In

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-04-14

Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖

Score 0

Score 0

We take a quick look at the latest creative models, experiments, and community projects.

fine-tuning

Open

Watchlist Matched: lora

Hugging Face · open-source · 2025-03-27

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-02-28

Trace & Evaluate your Agent with Arize Phoenix

Score 1

Score 1

The Rise and Fall of ONNX (feat. PyTorch 2.0)

Score 1

This article explores the rise and fall of ONNX, from its early success as a unifying stasndard for AI frameworks to its gradual shift into a niche tool in the era of PyTorch 2.0.

Open

Watchlist Matched: none

Hugging Face · open-source · 2025-02-04

Open-source DeepResearch – Freeing our search agents

Score 1

Mastering Long Contexts in LLMs with KVPress

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modular · inference-infra · 2025-01-23

Use MAX with Open WebUI for RAG and Web Search

Score 1

Score 1

Modal is excited to announce its SOC 2 Type II certification.

Open

Watchlist Matched: none

Modal · inference-infra · 2024-12-28

Product updates: memory snapshotting, OIDC, async job queues & more

Score 1

Welcome to another round of Modal Product Updates! Here's what's new this month.

Open

Watchlist Matched: none

SqueezeBits · korea · 2024-12-23

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-12-16

AI video is having its Stable Diffusion moment

Score 0

There are lots of models that are as good as OpenAI's Sora now.

Open

Watchlist Matched: none

Modal · inference-infra · 2024-12-10

What is LLM fine-tuning?

Score 1

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open source

SqueezeBits · korea · 2024-11-26

[vLLM vs TensorRT-LLM] #9. Parallelism Strategies

Score 1

This article provides a comparative analysis of different parallelism strategies on vLLM and TensorRT-LLM frameworks.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-11-26

Score 1

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-30

Fine-tune FLUX.1 to create images of yourself

Score 6

Create your own fine-tuned Flux model to generate new images of yourself.

model-release

Open

Watchlist Matched: model

Hugging Face · open-source · 2024-08-27

Scaling robotics datasets with video encoding

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-23

Replicate Intelligence #12

Score 0

Flux LoRAs, Hot Zuck, and Replicate on Lex Fridman

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-08-22

The 5 Most Under-Rated Tools on Hugging Face

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-08-21

Improving Hugging Face Training Efficiency Through Packing with Flash Attention 2

Score 1

No feed summary available yet.

training

Open

Watchlist Matched: training

Modal · inference-infra · 2024-08-16

Inside the Modal code playground

Score 1

How we built an in-browser code playground using Modal Sandboxes.

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-16

Replicate Intelligence #11

Score 0

Fine tune FLUX.1, generative video games, a vision for the metaverse

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-15

Fine-tune FLUX.1 with your own images

Score 6

We've added fine-tuning (LoRA) support to FLUX.1 image generation models. You can train FLUX.1 on your own images with one line of code using Replicate's API.

inference fine-tuning api

Open

Watchlist Matched: generation, fine-tuning, lora, api

Hugging Face · open-source · 2024-08-14

A failed experiment: Infini-Attention, and why we should keep trying?

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-08-13

Introduction to ggml

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-08-12

Tool Use, Unified

Score 1

No feed summary available yet.

agents

Open

Watchlist Matched: tool use

Replicate · inference-infra · 2024-08-09

Replicate Intelligence #10

Score 0

Flux developments, Minecraft bot, Streamlit cookbook with Zeke

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-08-08

Score 6

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-06-21

Replicate Intelligence #5

Score 6

Score 6

Copy and paste a few commands into terminal to play with Stable Diffusion 3 on your own GPU-powered machine.

hardware

Open

Watchlist Matched: gpu

Replicate · inference-infra · 2024-06-14

Replicate Intelligence #4

Score 0

Find concepts in GPT models, real-time speech to text in the browser, H100s are coming

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-06-13

From DeepSpeed to FSDP and Back Again with Hugging Face Accelerate

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-06-12

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Modal · inference-infra · 2024-06-06

Score 1

Deep dive into ownership in Mojo

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-05-31

Replicate Intelligence #2

Score 6

Faster image generation, AI-powered world simulator, insights on AI dataset complexity

inference

Open

Watchlist Matched: generation

Hugging Face · open-source · 2024-05-31

The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare

Score 1

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-03-22

Total noob’s intro to Hugging Face Transformers

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-03-22

Binary and Scalar Embedding Quantization for Significantly Faster & Cheaper Retrieval

Score 1

No feed summary available yet.

quantization rag

Open

Watchlist Matched: quantization, retrieval

Hugging Face · open-source · 2024-03-20

🪆 Introduction to Matryoshka Embedding Models

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-02-21

Welcome Gemma - Google’s new open LLM

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-02-19

🤗 PEFT welcomes new merging methods

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-02-16

Synthetic data: save money, time and carbon with open source

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open source

Modal · inference-infra · 2024-02-15

Product updates: WebSocket support, interactive commands & more

Score 1

We've been busy in 2024 so far, bringing you WebSockets, interactive commands, H100s and more. Learn about what's new at Modal.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-02-14

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-01-18

Preference Tuning LLMs with Direct Preference Optimization Methods

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2024-01-14

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-12-06

SetFitABSA: Few-Shot Aspect Based Sentiment Analysis using SetFit

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-12-06

Score 0

We've added a CLI command that makes it easy to get started with Replicate.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-11-09

SDXL in 4 steps with Latent Consistency LoRAs

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-11-08

Generate music from chord progressions and text prompts with MusicGen-Chord

Score 6

We’ve added chord conditioning to Meta’s MusicGen model, so you can create automatic backing tracks in any style using text prompts and chord progressions.

model-release

Open

Watchlist Matched: model

Hugging Face · open-source · 2023-10-27

Personal Copilot: Train Your Own Coding Assistant

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-10-25

Interactively explore your Huggingface dataset with one line of code

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-10-24

The N Implementation Details of RLHF with PPO

Score 1

No feed summary available yet.

training

Open

Watchlist Matched: rlhf

Hugging Face · open-source · 2023-10-24

Exploring simple optimizations for SDXL

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-10-19

Gradio-Lite: Serverless Gradio Running Entirely in Your Browser

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-10-13

Fine-tune MusicGen to generate music in any style

Score 0

Score 1

No feed summary available yet.

evals

Open

Watchlist Matched: leaderboard

Hugging Face · open-source · 2023-09-15

We're cutting our prices in half

Score 0

The price of public models is being cut in half, and soon we'll start charging new users for setup and idle time on private models.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-08-14

A guide to prompting Llama 2

Score 0

Learn the art of the Llama prompt.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-08-14

Streaming output for language models

Score 0

Our API now supports server-sent event streams for language models. Learn how to use them to make your apps more responsive.

api

Open

Watchlist Matched: api

Hugging Face · open-source · 2023-08-10

Hugging Face Hub on the AWS Marketplace: Pay with your AWS Account

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-08-09

Optimizing Bark using 🤗 Transformers

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-08-09

Score 1

Hugging Face · open-source · 2023-06-22

Panel on Hugging Face

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-06-20

AI Policy @🤗: Response to the U.S. NTIA's Request for Comment on AI Accountability

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-06-19

Fine-Tune MMS Adapter Models for low-resource ASR

Score 1

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open source

Hugging Face · open-source · 2023-05-25

Optimizing Stable Diffusion for Intel CPUs with NNCF and 🤗 Optimum

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-05-24

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Score 1

Score 1

No feed summary available yet.

api

Open

Watchlist Matched: api

Hugging Face · open-source · 2023-04-26

Running IF with 🧨 diffusers on a Free Tier Google Colab

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2023-04-26

Databricks ❤️ Hugging Face: up to 40% faster training and tuning of Large Language Models

Score 1

Fine-tune LLaMA to speak like Homer Simpson

Score 0

Score 1

Hugging Face · open-source · 2022-11-03

Fine-Tune Whisper For Multilingual ASR with 🤗 Transformers

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2022-11-02

Accelerate your models with 🤗 Optimum Intel and OpenVINO

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2022-10-13

🧨 Stable Diffusion in JAX / Flax !

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2022-10-05

Japanese Stable Diffusion

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2022-10-03

Very Large Language Models and How to Evaluate Them

Score 1

Score 0

The basics of using the API to create your own images from text.

api

Open

Watchlist Matched: api

Hugging Face · open-source · 2022-07-14

The Technology Behind BLOOM Training

Score 1

Hugging Face · open-source · 2021-11-30

Getting Started with Hugging Face Transformers for IPUs with Optimum

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2021-11-15

Fine-Tune XLSR-Wav2Vec2 for low-resource ASR with 🤗 Transformers

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2021-10-26

Course Launch Community Event

Score 2

No feed summary available yet.

model-release

Open

Watchlist Matched: launch

Hugging Face · open-source · 2021-10-26

Large Language Models: A New Moore's Law?

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2021-10-20

The Age of Machine Learning As Code Has Arrived

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2021-10-13

Fine tuning CLIP with Remote Sensing (Satellite) images and captions

Score 1

Score 1

No feed summary available yet.

Open

Watchlist Matched: none

Hugging Face · open-source · 2020-07-03

The Reformer - Pushing the limits of language modeling

Score 1

No feed summary available yet.

Open

Watchlist Matched: none