MLSys Radar

Replicate

Model hosting platform blog covering inference APIs, model optimization, GPUs, fine-tuning, and open model deployment workflows.

Country
Unknown
Category
inference-infra
Blog
https://replicate.com/blog
Feed
https://replicate.com/blog/rss
Feed discovery status
known

Replicate · inference-infra · 2026-02-18

Recraft V4: image generation with design taste

Score 8

Recraft V4 generates art-directed images — and actual editable SVGs — with strong composition, accurate text rendering, and what the Recraft team calls "design taste." Four models are available on Replicate now.

inference

Open

High signal Matched: generation

Replicate · inference-infra · 2025-07-07

Compare AI video models

Score 8

It's hard keeping up with every new video model. In this post we'll help you pick the best one for your needs.

model-release

Open

High signal Matched: model

Replicate · inference-infra · 2024-11-15

NVIDIA L40S GPUs are here

Score 8

NVIDIA L40S GPUs are here, with better performance and lower cost.

benchmark

Open

High signal Matched: performance, cost

Replicate · inference-infra · 2024-10-03

FLUX1.1 [pro] is here

Score 10

Black Forest Labs continue to push boundaries with their latest release of FLUX.1 image generation model.

inference model-release

Open

High signal Matched: generation, release, model

Replicate · inference-infra · 2024-06-12

H100s are coming to Replicate

Score 8

We'll soon support NVIDIA's H100 GPUs for predictions and training. Let us know if you want early access.

hardware training

Open

High signal Matched: h100, training

Replicate · inference-infra · 2024-06-12

Run Stable Diffusion 3 with an API

Score 8

Stable Diffusion 3 is the latest text-to-image model from Stability, with improved image quality, typography, prompt understanding, and resource efficiency. Learn how to run it in the cloud with one line of code.

model-release cloud api

Open

High signal Matched: model, cloud, api

Replicate · inference-infra · 2023-07-27

Run Llama 2 with an API

Score 8

Llama 2 is the first open source language model of the same caliber as OpenAI’s models. Learn how to run it in the cloud with one line of code.

model-release cloud api open-source

Open

High signal Matched: model, cloud, api, open source

Replicate · inference-infra · 2026-02-24

How to prompt Seedream 5.0

Score 6

Seedream 5.0 brings multi-step reasoning, example-based editing, and deep domain knowledge to image generation. Here's what you should know.

inference

Open

Watchlist Matched: generation

Replicate · inference-infra · 2025-11-25

Run FLUX.2 on Replicate

Score 6

FLUX.2 brings professional-grade image generation and editing with unprecedented detail, multi-reference support, and enterprise efficiency.

inference

Open

Watchlist Matched: generation

Replicate · inference-infra · 2025-11-20

How to prompt Nano Banana Pro

Score 6

Nano Banana Pro brings powerful new capabilities in image generation and editing. Here are the main prompt tricks you should know.

inference

Open

Watchlist Matched: generation

Replicate · inference-infra · 2025-10-16

How to prompt Veo 3.1

Score 6

Google's Veo 3.1 brings powerful new video generation capabilities including reference images, first/last frame control, and enhanced image-to-video. Here's everything you need to know.

inference

Open

Watchlist Matched: generation

Replicate · inference-infra · 2025-07-21

Generate consistent characters

Score 0

We compare the best image models for generating consistent characters from a single reference image.

Open

Watchlist Matched: none

Replicate · inference-infra · 2025-07-17

Bria is now on Replicate

Score 6

We've partnered with Bria to bring a suite of commercial-grade image generation and editing models to Replicate. Built entirely on licensed data, Bria’s tools are designed for enterprises and developers building safely with visual AI.

inference

Open

Watchlist Matched: generation

Replicate · inference-infra · 2025-07-01

The FLUX.1 Kontext hackathon

Score 0

We hosted a hackathon with BFL for FLUX.1 Kontext. Here were the winners.

Open

Watchlist Matched: none

Replicate · inference-infra · 2025-05-07

Ideogram 3.0 on Replicate

Score 0

Ideogram 3.0 is packed with powerful design, style transfer, and realism capabilities.

Open

Watchlist Matched: none

Replicate · inference-infra · 2025-05-06

Run MiniMax Speech-02 models with an API

Score 0

MiniMax's Speech-02 models give you high-quality text-to-speech with voice cloning, emotional expression, and multilingual support.

api

Open

Watchlist Matched: api

Replicate · inference-infra · 2025-04-16

Easel AI is now on Replicate

Score 0

Advanced face swap and AI avatars from Easel AI are now on Replicate.

Open

Watchlist Matched: none

Replicate · inference-infra · 2025-04-01

Stylized video with Wan2.1

Score 0

One of the most fun ways to use Wan2.1 is video style transfer. Learn how here.

Open

Watchlist Matched: none

Replicate · inference-infra · 2025-03-05

Wan2.1 parameter sweep

Score 6

We've been playing with Alibaba's WAN2.1 text-to-video model lately. What happens when you tweak those mysterious parameters? Let's find out.

model-release

Open

Watchlist Matched: model

Replicate · inference-infra · 2024-11-26

FLUX fine-tunes are now fast

Score 0

We've made running fine-tunes on Replicate much faster, and the optimizations are open-source.

open-source

Open

Watchlist Matched: open-source

Replicate · inference-infra · 2024-10-10

FLUX is fast and it's open source

Score 0

FLUX is now much faster on Replicate, and we’ve made our optimizations open-source so you can see exactly how they work and build upon them.

open-source

Open

Watchlist Matched: open-source, open source

Replicate · inference-infra · 2024-09-09

Fine-tune FLUX.1 with an API

Score 0

Create and run your own fine-tuned Flux models programmatically using Replicate's HTTP API.

api

Open

Watchlist Matched: api

Replicate · inference-infra · 2024-08-23

Replicate Intelligence #12

Score 0

Flux LoRAs, Hot Zuck, and Replicate on Lex Fridman

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-16

Replicate Intelligence #11

Score 0

Fine tune FLUX.1, generative video games, a vision for the metaverse

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-15

Fine-tune FLUX.1 with your own images

Score 6

We've added fine-tuning (LoRA) support to FLUX.1 image generation models. You can train FLUX.1 on your own images with one line of code using Replicate's API.

inference fine-tuning api

Open

Watchlist Matched: generation, fine-tuning, lora, api

Replicate · inference-infra · 2024-08-09

Replicate Intelligence #10

Score 0

Flux developments, Minecraft bot, Streamlit cookbook with Zeke

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-02

FLUX.1: First Impressions

Score 0

We explore FLUX.1's unique strengths and aesthetics to see what we can generate.

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-08-01

Run FLUX with an API

Score 6

FLUX.1 is a new text-to-image model from Black Forest Labs, the creators of Stable Diffusion, that exceeds the capabilities of previous open-source models.

model-release api open-source

Open

Watchlist Matched: model, api, open-source

Replicate · inference-infra · 2024-06-14

Replicate Intelligence #4

Score 0

Find concepts in GPT models, real-time speech to text in the browser, H100s are coming

Open

Watchlist Matched: none

Replicate · inference-infra · 2024-05-31

Replicate Intelligence #2

Score 6

Faster image generation, AI-powered world simulator, insights on AI dataset complexity

inference

Open

Watchlist Matched: generation

Replicate · inference-infra · 2024-05-24

Replicate Intelligence #1

Score 0

DIY Llama 3 implementation, open-source smart glasses, steering language models with dictionary learning

open-source

Open

Watchlist Matched: open-source

Replicate · inference-infra · 2023-11-23

How to run Yi chat models with an API

Score 6

The Yi series models are large language models trained from scratch by developers at 01.AI. Learn how to run them in the cloud with one line of code.

cloud api

Open

Watchlist Matched: cloud, api

Replicate · inference-infra · 2023-08-22

Painting with words: a history of text-to-image AI

Score 6

With the recent release of Stable Diffusion XL fine-tuning on Replicate, and today being the 1-year anniversary of Stable Diffusion, now feels like the perfect opportunity to take a step back and reflect on how text-to-image AI has improve...

model-release fine-tuning

Open

Watchlist Matched: release, fine-tuning

Replicate · inference-infra · 2023-08-16

We're cutting our prices in half

Score 0

The price of public models is being cut in half, and soon we'll start charging new users for setup and idle time on private models.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-08-14

Streaming output for language models

Score 0

Our API now supports server-sent event streams for language models. Learn how to use them to make your apps more responsive.

api

Open

Watchlist Matched: api

Replicate · inference-infra · 2023-08-08

Fine-tune SDXL with your own images

Score 0

We’ve added fine-tuning (Dreambooth, Textual Inversion and LoRA) support to SDXL 1.0. You can train SDXL on your own images with one line of code using the Replicate API.

fine-tuning api

Open

Watchlist Matched: fine-tuning, lora, api

Replicate · inference-infra · 2023-07-26

Run SDXL with an API

Score 0

How to run Stable Diffusion XL 1.0 using the Replicate API

api

Open

Watchlist Matched: api

Replicate · inference-infra · 2023-05-18

Status page

Score 0

We've added a status page to provide real-time updates on the health of Replicate.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-03-18

Week 3 of LLaMA 🦙

Score 0

A roundup of recent developments from the llamaverse.

Open

Watchlist Matched: none

Replicate · inference-infra · 2023-02-21

Machine learning needs better tools

Score 0

Lots of people want to build things with machine learning, but they don't have the expertise to use it.

Open

Watchlist Matched: none

Replicate · inference-infra · 2022-08-29

Run Stable Diffusion with an API

Score 0

How to use Replicate to integrate Stable Diffusion into hacks, apps, and projects

api

Open

Watchlist Matched: api

Replicate · inference-infra · 2022-08-11

Join us at Uncanny Spaces

Score 0

We're bringing people together to explore what's being created with machine learning.

Open

Watchlist Matched: none

Replicate · inference-infra · 2022-08-05

Automating image collection

Score 0

Using CLIP and LAION5B to collect thousands of captioned images.

Open

Watchlist Matched: none

Replicate · inference-infra · 2022-05-27

Constraining CLIPDraw

Score 0

An introduction to differentiable programming and the process of refining generative art models.

Open

Watchlist Matched: none

Replicate · inference-infra · 2022-05-16

Hello, world!

Score 0

We're a small team of engineers and machine learning enthusiasts working to make machine learning more accessible.

Open

Watchlist Matched: none