open-source - MLSys Blogs

Fireworks AI · inference-infra · 2026-06-03

Introducing Fireworks on Microsoft Foundry: Bringing Best-in-Class Open Model inference to Azure

Score 15

No feed summary available yet.

inference model-release open-source

Open

High signal Matched: inference, model, open model

PyTorch Foundation · open-source · 2026-05-27

Alibaba Cloud Joins the PyTorch Foundation as a Platinum Member

Score 11

The PyTorch Foundation, a community-driven hub for open source AI under the Linux Foundation, is announcing today that Alibaba Cloud has joined as a Platinum member. Alibaba Cloud is a...

cloud open-source

Open

High signal Matched: cloud, open source

LMCache · open-source · 2026-05-27

When Open Source Meets Open Source: A Joint Effort Between LMCache and Mooncake

Score 11

A collaboration story about LMCache multiprocess mode + MooncakeStore — From 0 to 1, from functional to optimized. 1. Before We Begin Recently, the LMCache community and the Mooncake community carried out a series of valuable open-source c...

kv-cache fine-tuning open-source

Open

High signal Matched: lmcache, adapter, open-source, open source

Lambda · cloud · 2026-05-22

DeepSeek V4: the most expected open-source model ever released, and the quietest landing

Score 18

After 15 months of incremental updates, leaks, and rumored leaks, DeepSeek released version 4. It arrived without the fanfare R1 and R1-preview commanded in early 2025. That quiet reception is the most interesting thing about the release....

inference serving benchmark model-release open-source

Open

High signal Matched: inference, serving, performance, cost, release, model, open-source

AMD ROCm Blogs · hardware · 2026-05-22

From Build to Benchmark: ONNX Model Serving with Triton Inference Server on AMD GPUs

Score 30

Triton Inference Server is an open-source platform designed to streamline AI inferencing. It supports the deployment, scaling, and inference of trained models from multiple frameworks, including ONNX Runtime, TensorFlow, PyTorch, and other...

inference serving kernel triton benchmark model-release cloud open-source

Open

High signal Matched: inference, inferencing, serving, triton, benchmark, model, cloud, open-source

AMD ROCm Blogs · hardware · 2026-05-20

ROCm 7.13: Expanding Hardware, Tools, and Reach

Score 14

AMD released ROCm Core 7.13, the AMD GPU Driver 31.30, and AMD GPU Virtualization 9.0. With these releases, ROCm software expands hardware support across enterprise datacenters. The platform introduces AMD’s latest Instinct accelerators, e...

benchmark hardware open-source

Open

High signal Matched: performance, gpu, rocm, open-source

Microsoft Research · big-tech · 2026-05-14

mimalloc: A new, high-performance, scalable memory allocator for the modern era

Score 8

mimalloc is an open-source, modern, scalable memory allocator that is a drop-in replacement for malloc and free. It is relatively small (~12K lines), with clear internal data structures, and is easy to build and integrate into other projec...

benchmark research open-source

Open

High signal Matched: performance, research, open-source

NVIDIA Technical Blog · hardware · 2026-04-28

NVIDIA Nemotron 3 Nano Omni Powers Multimodal Agent Reasoning in a Single Efficient Open Model

Score 16

Agentic systems often reason across screens, documents, audio, video, and text within a single perception‑to‑action loop. However, they still rely on...

model-release agents open-source

Open

High signal Matched: model, open model, agent, agentic

Together AI · inference-infra · 2026-04-28

Together AI Brings NVIDIA Nemotron 3 Nano Omni to Developers on Day 0

Score 12

NVIDIA Nemotron 3 Nano Omni is now on Together AI: a single open model that reasons across video, images, audio, and text, built for agentic workloads at scale.

model-release agents open-source

Open

High signal Matched: model, open model, agentic

Nota AI · korea · 2026-04-22

[Deep Dive: NetsPresso®] From Quantization to Graph Optimization: A Step-by-Step Model Deployment Pipeline

Score 54

  Jaehoon Lee Technical Content Manager, Nota AI   Series Notice: NetsPresso® Technical Blog, Part 2In Part 1, we walked through a scenario of deploying Llama 3.2 1B on an edge device to illustrate the NetsPresso® workflow. The f...

inference kernel cuda benchmark hardware model-release research korea training quantization evals api open-source

Open

High signal Matched: inference, kernel, cuda, matmul, benchmark, performance, latency, cost, npu, model, weights, paper, research, evaluation, furiosa, training, quantization, int8, int4, awq, gptq, sdk, open-source

LMCache · open-source · 2026-04-18

LMCache: A Journey

Score 12

GTC wrapped up a month ago. Our open-source KV cache management library, LMCache, was shown in Jensen Huang’s keynote, was spotlighted by NVIDIA SVP Kevin Deierling, I was invited to speak at the first-ever industry KV cache tutorial...

kv-cache open-source

Open

High signal Matched: kv cache, lmcache, open-source

SqueezeBits · korea · 2026-04-14

Recap: 2nd vLLM Korea Meetup 2026

Score 12

Check out highlights from the 2nd vLLM Korea Meetup! open-source use cases and real-world production examples that showcase vLLM's technical maturity!

korea open-source

Open

High signal Matched: korea, open-source

NVIDIA Technical Blog · hardware · 2026-04-09

Running Large-Scale GPU Workloads on Kubernetes with Slurm

Score 12

Slurm is an open source cluster management and job scheduling system for Linux. It manages job scheduling for over 65% of TOP500 systems. Most organizations...

hardware open-source

Open

High signal Matched: gpu, open source

AI2 · research · 2026-04-07

Introducing WildDet3D: Open-world 3D detection from a single image

Score 12

WildDet3D is an open model that predicts 3D bounding boxes from a single image. It generalizes across cameras and object categories, and folds in depth signals when available—alongside a new dataset of verified 3D annotations.

model-release open-source

Open

High signal Matched: introducing, model, open model

vLLM Project · open-source · 2026-04-02

Announcing Gemma 4 on vLLM: Byte for byte, the most capable open models

Score 16

With the debut of Gemma 4, vLLM introduces immediate support for Google's most sophisticated open model lineup, spanning multiple hardware backends, with first-ever Day 0 support on Google TPUs,...

model-release open-source

Open

High signal Matched: model, open model

Together AI · inference-infra · 2026-03-31

Aurora

Score 12

1.25x over a well-trained static speculator. Aurora is an open-source RL framework that turns speculative decoding from a one-time offline setup into a self-improving system that learns from every request it serves.

inference speculative-decoding open-source

Open

High signal Matched: decoding, speculative decoding, open-source

Nota AI · korea · 2026-03-23

[GTC 2026 Recap] The Trillion-Dollar Inference Race Begins: How Nota AI Fills the Gap

Score 42

  Jaehoon Lee Technical Content Manager, Nota AI   GTC has evolved far beyond a technology conference, drawing attention from global economies and financial markets alike. This year, CEO Jensen Huang took the stage in his tradema...

inference serving kernel cuda kv-cache benchmark hardware model-release research cloud training long-context agents open-source

Open

High signal Matched: inference, prefill, generation, throughput, cuda, kv cache, performance, latency, cost, gpu, npu, launch, model, research, cloud, training, long-context, context window, agent, agents, agentic, open-source

Together AI · inference-infra · 2026-03-17

Mamba-3

Score 10

Meet Mamba-3: the SSM built for inference. Faster than Transformers at decode, stronger than Mamba-2, and open-source from day one.

inference open-source

Open

High signal Matched: inference, open-source

Nota AI · korea · 2026-03-13

NotaMoEQuantization: An MoE-Specific Quantization Method for Solar-Open-100B

Score 62

  Hancheol Park, Ph. D. AI Research Engineer, Nota AI Tairen PiaoAI Research Engineer, Nota AI Tae-Ho KimCTO & Co-Founder, Nota AI ✔️ Resource : The official quantized model of Solar-Open-100B, which passed the first round of Sout...

inference serving moe benchmark hardware model-release research korea training quantization evals long-context open-source

Open

High signal Matched: inference, serving, prefill, generation, throughput, moe, router, benchmark, performance, latency, ttft, tpot, blackwell, release, model, weights, open model, research, evaluation, korea, korean, upstage, training, post-training, quantization, quantized, int4, evaluate, benchmarks, mmlu, long-context

Together AI · inference-infra · 2026-03-02

Introducing Together AI’s new look

Score 14

We've refreshed our visual identity — designed with Pentagram to express how Together AI connects open-source innovation, systems research, and builders to unlock new possibilities.

model-release research open-source

Open

High signal Matched: introducing, research, open-source

Together AI · inference-infra · 2026-02-02

Fine-tuning open LLM judges to outperform GPT-5.2

Score 14

Fine-tuned open-source LLM judges can outperform GPT-5.2 at evaluating model outputs. Using Direct Preference Optimization on just 5,400 preference pairs, we trained GPT-OSS 120B to beat GPT-5.2 on human preference alignment—at 15x lower c...

inference benchmark model-release fine-tuning evals open-source

Open

High signal Matched: inference, cost, model, fine-tuning, evaluating, open-source, oss

Together AI · inference-infra · 2026-02-02

Together Evaluations now supports comparing top commercial APIs vs. open source models

Score 12

Together Evaluations now supports OpenAI, Anthropic, and Google models for cross-provider benchmarking. Compare open-source, fine-tuned, and proprietary models side-by-side to make data-driven decisions on quality, cost, and performance—al...

benchmark open-source

Open

High signal Matched: performance, cost, open-source, open source

vLLM Project · open-source · 2026-02-01

GPT-OSS Performance Optimizations on NVIDIA Blackwell: Pushing the Pareto Frontier

Score 18

TL;DR: In collaboration with the open-source community, vLLM + NVIDIA has achieved significant performance milestones on the gpt-oss-120b model running on NVIDIA's Blackwell GPUs. Through deep...

benchmark hardware model-release open-source

Open

High signal Matched: performance, blackwell, model, open-source, oss

Together AI · inference-infra · 2026-01-26

DSGym: A holistic framework for evaluating and training data science agents

Score 18

Introducing DSGym—a holisti evaluation and training framework for LLM-based data science agents. Features 90+ bioinformatics tasks, 92 Kaggle competitions, and synthetic trajectory generation. Our 4B model achieves state-of-the-art perform...

inference benchmark model-release research training evals agents open-source

Open

High signal Matched: generation, performance, introducing, model, evaluation, training, evaluating, agents, open-source

Together AI · inference-infra · 2026-01-08

How to choose the right open model for production

Score 20

Learn how to choose the right open-source model for production by evaluating model quality, benchmarking performance, and deploying open models that balance cost, speed, and accuracy.

benchmark model-release evals open-source

Open

High signal Matched: performance, cost, model, open model, evaluating, open-source

Together AI · inference-infra · 2025-12-01

Together AI delivers fastest inference for the top open-source models

Score 20

Together AI achieves up to 2x faster inference for top open-source models like Qwen, DeepSeek, and Kimi through GPU optimization, advanced speculative decoding, and FP4 quantization—ranking #1 in speed benchmarks on NVIDIA Blackwell archit...

inference speculative-decoding hardware quantization evals open-source

Open

High signal Matched: inference, decoding, speculative decoding, gpu, blackwell, quantization, benchmarks, open-source

Together AI · inference-infra · 2025-11-04

Announcing the fastest inference for realtime voice AI agents

Score 14

Together AI launches the fastest voice AI stack: streaming Whisper STT, serverless open-source TTS (Orpheus & Kokoro), and Voxtral transcription. Sub-second latency for production voice agents.

inference benchmark agents open-source

Open

High signal Matched: inference, latency, agents, open-source

Hugging Face · open-source · 2025-10-16

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

Score 10

No feed summary available yet.

cloud open-source

Open

High signal Matched: cloud, oss

Together AI · inference-infra · 2025-08-19

Transform OpenAI gpt-oss Models into Domain Experts with Together AI Fine-Tuning

Score 10

Customize OpenAI’s gpt-oss-20B/120B with Together AI’s fine-tuning: train, optimize, and instantly deploy domain experts with enterprise reliability and cost efficiency.

benchmark fine-tuning open-source

Open

High signal Matched: cost, fine-tuning, oss

Together AI · inference-infra · 2025-08-15

Fine-Tuning Small Open-Source LLMs to Outperform Large Closed-Source Models by 60% on Specialized Tasks

Score 12

Parsed fine-tuned a 27B open-source model to beat Claude Sonnet 4 by 60% on a real-world healthcare task—while running 10–100x cheaper.

model-release fine-tuning open-source

Open

High signal Matched: model, fine-tuning, open-source

SkyPilot · open-source · 2025-08-12

Self-host open-source LLM agent sandbox on your own cloud

Score 10

Your AI writes code. Now what? If you’re building AI agents in 2025, you probably wondered that as well. Your LLM generates some Python code that analyzes data, manipulates files, or calls APIs. But where does it run? Most people eit...

cloud agents open-source

Open

High signal Matched: cloud, agent, agents, open-source

Together AI · inference-infra · 2025-08-05

Announcing the Availability of OpenAI's Open Models on Together AI

Score 12

Access OpenAI’s gpt-oss-120B on Together AI: Apache-2.0 open-weight model with serverless & dedicated endpoints, $0.50/1M in, $1.50/1M out, 99.9% SLA.

model-release open-source

Open

High signal Matched: model, oss

Hugging Face · open-source · 2025-08-05

Welcome GPT OSS, the new open-source model family from OpenAI!

Score 10

No feed summary available yet.

model-release open-source

Open

High signal Matched: model, open-source, oss

Together AI · inference-infra · 2025-07-28

Together Evaluations: Benchmark Models for Your Tasks

Score 16

Together Evaluations is a flexible framework for benchmarking LLMs using strong open-source models as judges. Skip manual labeling and rigid metrics—get fast, customizable insights into model quality for your specific tasks.

benchmark model-release open-source

Open

High signal Matched: benchmark, model, open-source

Together AI · inference-infra · 2025-07-17

Together AI Delivers Top Speeds for DeepSeek-R1-0528 Inference on NVIDIA Blackwell

Score 18

Together AI inference is now among the world’s fastest, most capable platforms for running open-source reasoning models like DeepSeek-R1 at scale, thanks to our new inference engine designed for NVIDIA HGX B200.

inference hardware open-source

Open

High signal Matched: inference, b200, blackwell, open-source

Together AI · inference-infra · 2025-07-14

Kimi K2: Leading Open-Source Model Now Available on Together AI

Score 16

Run Kimi K2 (1T params) on Together AI—frontier open model for agentic reasoning and coding, serverless deployment, 99.9% SLA, lower cost and instant scaling.

benchmark model-release agents open-source

Open

High signal Matched: cost, model, open model, agentic, open-source

llm-d · open-source · 2025-05-20

llm-d Press Release

Score 20

Red Hat launches llm-d: Open source distributed AI inference platform backed by NVIDIA, Google Cloud, IBM. Scale generative AI with intelligent routing on Kubernetes.

inference distributed model-release cloud open-source

Open

High signal Matched: inference, distributed, release, cloud, open source

Nota AI · korea · 2025-05-07

Efficient LLaMA-3.2-Vision by Trimming Cross-attended Visual Features</span#x3E;

Score 28

inference kv-cache benchmark model-release research training evals open-source

Open

High signal Matched: inference, generation, kv cache, benchmark, performance, latency, model, weights, research, training, benchmarks, open-source

SqueezeBits · korea · 2025-03-26

TensorRT-LLM Goes Open Source!

Score 12

With TensorRT-LLM now open source, we can finally take a deep dive into the secret sauce behind its impressive performance.

benchmark open-source

Open

High signal Matched: performance, open source

Replicate · inference-infra · 2025-03-05

Wan2.1: generate videos with an API

Score 10

Wan2.1 is the most capable open-source video generation model, producing coherent and high-quality outputs. Learn how to run it in the cloud with a single line of code.

inference model-release cloud api open-source

Open

High signal Matched: generation, model, cloud, api, open-source

AIBrix · open-source · 2025-02-21

Introducing AIBrix: Cost-Effective and Scalable Control Plane for vLLM

Score 26

Open-source large language models (LLMs) like LLaMA, Deepseek, Qwen and Mistral etc have surged in popularity, offering enterprises greater flexibility, cost savings, and control over their AI deployments. These models have empowered organ...

inference benchmark model-release agents open-source

Open

High signal Matched: inference, generation, latency, cost, introducing, model, agents, open-source

SqueezeBits · korea · 2025-02-10

The Missing Piece of TensorRT-LLM

Score 8

This article is about an open-source library for direct conversion of PyTorch models to TensorRT-LLM.

open-source

Open

High signal Matched: open-source

Hugging Face · open-source · 2024-12-10

LeMaterial: an open source initiative to accelerate materials discovery and research

Score 10

No feed summary available yet.

research open-source

Open

High signal Matched: research, open source

AIBrix · open-source · 2024-11-13

Introducing AIBrix v0.1.0: Building the Future of Scalable, Cost-Effective AI Infrastructure for Large Models

Score 32

In recent years, large language models (LLMs) have revolutionized AI applications, powering solutions in areas like chatbots, automated content generation, and advanced recommendation engines. Services like OpenAI’s have gained significant...

inference kv-cache benchmark hardware model-release cloud open-source

Open

High signal Matched: decoding, prefill, generation, kv cache, performance, cost, gpu, release, introducing, cloud, open-source

Nota AI · korea · 2024-08-02

Deploying an Efficient Vision-Language Model on Mobile Devices

Score 38

  Jaeyeon KimResearch Engineer, Nota AI Geonmin KimResearch Engineer, Nota AI Hancheol ParkTeam Lead of NetsPresso Application, Nota AI   IntroductionRecent large language models (LLMs) have demonstrated unprecedented performance...

inference benchmark model-release research cloud training fine-tuning evals open-source

Open

High signal Matched: decoding, benchmark, performance, latency, tokens/sec, model, arxiv, research, technical report, evaluation, cloud, training, lora, benchmarks, leaderboard, open-source

Replicate · inference-infra · 2024-07-23

Run Meta Llama 3.1 405B with an API

Score 8

Llama 3.1 405B: is the most powerful open-source language model from Meta. Learn how to run it in the cloud with one line of code.

model-release cloud api open-source

Open

High signal Matched: model, cloud, api, open-source

Replicate · inference-infra · 2024-04-23

Run Snowflake Arctic with an API

Score 8

Arctic is a new open-source language model from Snowflake. Learn how to run it in the cloud with one line of code.

model-release cloud api open-source

Open

High signal Matched: model, cloud, api, open-source

Replicate · inference-infra · 2024-01-30

Run Code Llama 70B with an API

Score 8

Code Llama 70B is one of the powerful open-source code generation models. Learn how to run it in the cloud with one line of code.

inference cloud api open-source

Open

High signal Matched: generation, cloud, api, open-source

Replicate · inference-infra · 2023-11-10

Using open-source models for faster and cheaper text embeddings

Score 10

An interactive example showing how to embed text using a state-of-the-art embedding model that beats OpenAI's embeddings API on price and performance.

benchmark model-release api open-source

Open

High signal Matched: performance, model, api, open-source

Replicate · inference-infra · 2023-10-06

How to run Mistral 7B with an API

Score 8

Mistral 7B is an open-source large language model. Learn what it's good at and how to run it in the cloud with one line of code.

model-release cloud api open-source

Open

High signal Matched: model, cloud, api, open-source

Replicate · inference-infra · 2023-07-27

Run Llama 2 with an API

Score 8

Llama 2 is the first open source language model of the same caliber as OpenAI’s models. Learn how to run it in the cloud with one line of code.

model-release cloud api open-source

Open

High signal Matched: model, cloud, api, open source

Replicate · inference-infra · 2023-07-19

What happened with Llama 2 in the last 24 hours? 🦙

Score 8

A roundup of recent developments from the llamaverse following the second major release of Meta's open-source large language model.

model-release open-source

Open

High signal Matched: release, model, open-source

Hugging Face · open-source · 2023-07-17

Open-Source Text Generation & LLM Ecosystem at Hugging Face

Score 10

No feed summary available yet.

inference open-source

Open

High signal Matched: generation, open-source

Replicate · inference-infra · 2023-04-21

Language model roundup, April 2023

Score 8

A roundup of recent developments from the world of open-source language models.

model-release open-source

Open

High signal Matched: model, open-source

Runpod · cloud · 2026-06-03

HubThe fastest way to deploy open-source AI.

Score 2

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Modal · inference-infra · 2026-06-01

Reinforcement learning is an infrastructure problem

Score 2

What we've seen helping teams run Reinforcement Learning at scale on Modal. Plus an open-source library to skip the scaffolding.

open-source

Open

Watchlist Matched: open-source

Together AI · inference-infra · 2026-05-14

Violin: An open-source video translation skill that breaks language barriers

Score 3

Violin is an open-source AI video translation tool that combines speech recognition, LLM translation, and text-to-speech to make video content accessible across languages.

open-source

Open

Watchlist Matched: open-source

Lambda · cloud · 2026-04-30

Creating highly efficient agents: 450M tool-calling tokens distilled for post-training from top open-source models

Score 4

Harnesses If you've used Claude Code or Codex, you've used a harness. A harness is the infrastructure layer that wraps an AI coding agent and decides how it operates, what it can touch, and how you measure whether it worked. It's how most...

hardware training agents open-source

Open

Watchlist Matched: gpu, training, post-training, agent, agents, open-source

NVIDIA Technical Blog · hardware · 2026-04-20

Maximizing Memory Efficiency to Run Bigger Models on NVIDIA Jetson

Score 3

The boom in open source generative AI models is pushing beyond data centers into machines operating in the physical world. Developers are eager to deploy these...

open-source

Open

Watchlist Matched: open source

Hugging Face · open-source · 2026-03-18

State of Open Source on Hugging Face: Spring 2026

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open source

Hugging Face · open-source · 2026-03-10

Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2026-02-04

The Future of the Global Open-Source AI Ecosystem: From DeepSeek to AI+

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2026-01-28

Architectural Choices in China's Open-Source AI Ecosystem: Building Beyond DeepSeek

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2026-01-27

Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective

Score 1

No feed summary available yet.

training agents open-source

Open

Watchlist Matched: training, agentic, oss

Hugging Face · open-source · 2025-12-04

We Got Claude to Fine-Tune an Open Source LLM

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open source

Hugging Face · open-source · 2025-10-24

LeRobot v0.4.0: Supercharging OSS Robot Learning

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: oss

Hugging Face · open-source · 2025-09-11

Tricks from OpenAI gpt-oss YOU 🫵 can use with transformers

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: oss

Together AI · inference-infra · 2025-08-11

OpenAI's New Open gpt-oss Models vs o4-mini: A Real-World Comparison

Score 3

No feed summary available yet.

open-source

Open

Watchlist Matched: oss

Hugging Face · open-source · 2025-08-05

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Replicate · inference-infra · 2025-07-31

Open source video is back

Score 6

Wan 2.2 is our fastest, cheapest video model.

model-release open-source

Open

Watchlist Matched: model, open source

Hugging Face · open-source · 2025-07-09

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2025-06-26

Gemma 3n fully available in the open-source ecosystem!

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Together AI · inference-infra · 2025-06-12

From Zero to One: Building An Autonomous and Open Data Scientist Agent from Scratch

Score 3

Build a data scientist agent using Together’s open-source models and Code Interpreter—easy to implement, solid benchmarks, and full code on GitHub.

evals agents open-source

Open

Watchlist Matched: benchmarks, agent, open-source

Together AI · inference-infra · 2025-05-28

Mixture-of-Agents Alignment: Harnessing the Collective Intelligence of Open-Source LLMs to Improve Post-Training

Score 3

No feed summary available yet.

training agents open-source

Open

Watchlist Matched: training, post-training, agents, open-source

Modular · inference-infra · 2025-05-06

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

Score 1

Modular Platform 25.3: 450K+ Lines of Open Source Code and pip Packaging

open-source

Open

Watchlist Matched: open source

Hugging Face · open-source · 2025-04-14

Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition 🤖

Score 0

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2025-03-11

LeRobot goes to driving school: World’s largest open-source self-driving dataset

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2025-02-04

Open-source DeepResearch – Freeing our search agents

Score 1

No feed summary available yet.

agents open-source

Open

Watchlist Matched: agents, open-source

Replicate · inference-infra · 2025-01-24

You can now fine-tune open-source video models

Score 0

Train your own versions of Tencent's HunyuanVideo for style, motion, and characters on Replicate.

open-source

Open

Watchlist Matched: open-source

Hugging Face · open-source · 2024-12-02

Open Source Developers Guide to the EU AI Act

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open source

Replicate · inference-infra · 2024-11-26

FLUX fine-tunes are now fast

Score 0

We've made running fine-tunes on Replicate much faster, and the optimizations are open-source.

open-source

Open

Watchlist Matched: open-source

Replicate · inference-infra · 2024-10-10

FLUX is fast and it's open source

Score 0

FLUX is now much faster on Replicate, and we’ve made our optimizations open-source so you can see exactly how they work and build upon them.

open-source

Open

Watchlist Matched: open-source, open source

Replicate · inference-infra · 2024-08-02

Replicate Intelligence #9

Score 6

Open source frontier image model, cut objects from videos, new Python web framework from Jeremy Howard

model-release open-source

Open

Watchlist Matched: model, open source

Replicate · inference-infra · 2024-08-01

Run FLUX with an API

Score 6

FLUX.1 is a new text-to-image model from Black Forest Labs, the creators of Stable Diffusion, that exceeds the capabilities of previous open-source models.

model-release api open-source

Open

Watchlist Matched: model, api, open-source

Modular · inference-infra · 2024-07-23

Announcing stack-pr: an open source tool for managing stacked PRs on GitHub

Score 1

Announcing stack-pr: an open source tool for managing stacked PRs on GitHub

open-source

Open

Watchlist Matched: open source

Replicate · inference-infra · 2024-05-24

Replicate Intelligence #1

Score 0

DIY Llama 3 implementation, open-source smart glasses, steering language models with dictionary learning

open-source

Open

Watchlist Matched: open-source

Modular · inference-infra · 2024-04-02

What’s new in Mojo 24.2: Mojo Nightly, Enhanced Python Interop, OSS stdlib and more

Score 1

What’s new in Mojo 24.2: Mojo Nightly, Enhanced Python Interop, OSS stdlib and more

open-source

Open

Watchlist Matched: oss

Modular · inference-infra · 2024-03-28

The Next Big Step in Mojo🔥 Open Source

Score 1

The Next Big Step in Mojo🔥 Open Source

open-source

Open

Watchlist Matched: open source

Modal · inference-infra · 2024-03-26

How Ramp automated receipt processing with fine-tuned LLMs

Score 1

Find out how Ramp uses Modal to customize open source LLMs to automate receipt processing.

open-source

Open

Watchlist Matched: open source

Hugging Face · open-source · 2024-02-16

Synthetic data: save money, time and carbon with open source

Score 1

No feed summary available yet.

open-source

Open

Watchlist Matched: open source

Hugging Face · open-source · 2024-01-24

Open-source LLMs as LangChain Agents

Score 1

No feed summary available yet.

agents open-source

Open

Watchlist Matched: agents, open-source

Replicate · inference-infra · 2023-12-06

Clone your voice using open-source models

Score 0

We’ve added fine-tuning for realistic voice cloning (RVC). You can train RVC on your own dataset from a YouTube video with a few lines of code using Replicate's API.

fine-tuning api open-source

Open

Watchlist Matched: fine-tuning, api, open-source

Replicate · inference-infra · 2023-12-05