Model hosting platform blog covering inference APIs, model optimization, GPUs, fine-tuning, and open model deployment workflows.
Replicate · inference-infra · 2026-02-18
Score 8
Recraft V4 generates art-directed images — and actual editable SVGs — with strong composition, accurate text rendering, and what the Recraft team calls "design taste." Four models are available on Replicate now.
High signal Matched: generation
Replicate · inference-infra · 2025-09-23
Score 8
Here is the ultimate comparison post on all the latest image editing models.
High signal Matched: model
Replicate · inference-infra · 2025-09-17
Score 8
Find the best models and collections with a single API call.
High signal Matched: introducing, api
Replicate · inference-infra · 2025-09-08
Score 8
Cache your compiled models for faster boot and inference times
High signal Matched: inference
Replicate · inference-infra · 2025-07-07
Score 8
It's hard keeping up with every new video model. In this post we'll help you pick the best one for your needs.
High signal Matched: model
Replicate · inference-infra · 2025-05-22
Score 8
Google's flagship image generation model, Imagen 4, is now available for you to try on Replicate. Create images with fine detail, versatile styles, and improved typography.
High signal Matched: generation, model
Replicate · inference-infra · 2025-05-16
Score 12
NVIDIA H100 GPUs are here, with better performance and lower cost.
High signal Matched: performance, cost, h100
Replicate · inference-infra · 2025-03-05
Score 10
Wan2.1 is the most capable open-source video generation model, producing coherent and high-quality outputs. Learn how to run it in the cloud with a single line of code.
High signal Matched: generation, model, cloud, api, open-source
Replicate · inference-infra · 2024-11-15
Score 8
NVIDIA L40S GPUs are here, with better performance and lower cost.
High signal Matched: performance, cost
Replicate · inference-infra · 2024-10-22
Score 8
We've partnered with Ideogram to bring their inpainting model to Replicate's API.
High signal Matched: model, api
Replicate · inference-infra · 2024-10-03
Score 10
Black Forest Labs continue to push boundaries with their latest release of FLUX.1 image generation model.
High signal Matched: generation, release, model
Replicate · inference-infra · 2024-07-23
Score 8
Llama 3.1 405B: is the most powerful open-source language model from Meta. Learn how to run it in the cloud with one line of code.
High signal Matched: model, cloud, api, open-source
Replicate · inference-infra · 2024-06-14
Score 8
Create your own custom version of Stability's latest image generation model and run it on Replicate via the web or API.
High signal Matched: generation, model, api
Replicate · inference-infra · 2024-06-12
Score 8
We'll soon support NVIDIA's H100 GPUs for predictions and training. Let us know if you want early access.
High signal Matched: h100, training
Replicate · inference-infra · 2024-06-12
Score 8
Stable Diffusion 3 is the latest text-to-image model from Stability, with improved image quality, typography, prompt understanding, and resource efficiency. Learn how to run it in the cloud with one line of code.
High signal Matched: model, cloud, api
Replicate · inference-infra · 2024-04-23
Score 8
Arctic is a new open-source language model from Snowflake. Learn how to run it in the cloud with one line of code.
High signal Matched: model, cloud, api, open-source
Replicate · inference-infra · 2024-04-18
Score 8
Llama 3 is the latest language model from Meta. Learn how to run it in the cloud with one line of code.
High signal Matched: model, cloud, api
Replicate · inference-infra · 2024-01-30
Score 8
Code Llama 70B is one of the powerful open-source code generation models. Learn how to run it in the cloud with one line of code.
High signal Matched: generation, cloud, api, open-source
Replicate · inference-infra · 2023-11-10
Score 10
An interactive example showing how to embed text using a state-of-the-art embedding model that beats OpenAI's embeddings API on price and performance.
High signal Matched: performance, model, api, open-source
Replicate · inference-infra · 2023-10-25
Score 8
How to run a latent consistency model on your M1 or M2 Mac
High signal Matched: model
Replicate · inference-infra · 2023-10-17
Score 10
In this post we'll explore the basics of retrieval augmented generation by creating an example app that uses bge-large-en for embeddings, ChromaDB for vector store, and mistral-7b-instruct for language model generation.
High signal Matched: generation, model, retrieval augmented generation, retrieval
Replicate · inference-infra · 2023-10-06
Score 8
Mistral 7B is an open-source large language model. Learn what it's good at and how to run it in the cloud with one line of code.
High signal Matched: model, cloud, api, open-source
Replicate · inference-infra · 2023-07-27
Score 8
Llama 2 is the first open source language model of the same caliber as OpenAI’s models. Learn how to run it in the cloud with one line of code.
High signal Matched: model, cloud, api, open source
Replicate · inference-infra · 2023-07-19
Score 8
A roundup of recent developments from the llamaverse following the second major release of Meta's open-source large language model.
High signal Matched: release, model, open-source
Replicate · inference-infra · 2023-05-26
Score 8
Prompt engineering and training are often the first solutions we reach for to improve language model behavior, but they're not the only way.
High signal Matched: model, training
Replicate · inference-infra · 2023-04-21
Score 8
A roundup of recent developments from the world of open-source language models.
High signal Matched: model, open-source
Replicate · inference-infra · 2023-03-23
Score 8
No feed summary available yet.
High signal Matched: model, lora
Replicate · inference-infra · 2023-02-07
Score 10
It's like DreamBooth, but much faster. And you can run it in the cloud on Replicate.
High signal Matched: introducing, cloud, lora
Replicate · inference-infra · 2022-11-21
Score 10
With just a handful of images and a single API call, you can train a model, publish it to Replicate, and run predictions on it in the cloud.
High signal Matched: model, cloud, api
Replicate · inference-infra · 2022-08-31
Score 8
How to run Stable Diffusion locally so you can hack on it
High signal Matched: gpu
Replicate · inference-infra · 2022-07-05
Score 8
Inspired by model cards, we've created templates for documenting models on Replicate.
High signal Matched: model
Replicate · inference-infra · 2026-04-15
Score 6
If you have never tried a video model before, now is the time.
Watchlist Matched: model
Replicate · inference-infra · 2026-02-24
Score 6
Seedream 5.0 brings multi-step reasoning, example-based editing, and deep domain knowledge to image generation. Here's what you should know.
Watchlist Matched: generation
Replicate · inference-infra · 2025-11-26
Score 6
Isaac 0.1 is a lightweight, grounded vision-language model built for real-world perception.
Watchlist Matched: model
Replicate · inference-infra · 2025-11-25
Score 6
FLUX.2 brings professional-grade image generation and editing with unprecedented detail, multi-reference support, and enterprise efficiency.
Watchlist Matched: generation
Replicate · inference-infra · 2025-11-20
Score 6
Nano Banana Pro brings powerful new capabilities in image generation and editing. Here are the main prompt tricks you should know.
Watchlist Matched: generation
Replicate · inference-infra · 2025-11-19
Score 0
Generate game assets, sprites, tiles, and pixel art with Retro Diffusion's suite of carefully crafted models.
Watchlist Matched: none
Replicate · inference-infra · 2025-11-17
Score 0
No feed summary available yet.
Watchlist Matched: none
Replicate · inference-infra · 2025-10-21
Score 0
Turn whole documents into markdown or grab line-level polygons with two new models from Datalab.
Watchlist Matched: none
Replicate · inference-infra · 2025-10-16
Score 6
Google's Veo 3.1 brings powerful new video generation capabilities including reference images, first/last frame control, and enhanced image-to-video. Here's everything you need to know.
Watchlist Matched: generation
Replicate · inference-infra · 2025-10-02
Score 0
No feed summary available yet.
Watchlist Matched: none
Replicate · inference-infra · 2025-08-10
Score 0
Use our MCP to discover, compare, and run models from apps like Claude, Cursor, and VS Code.
Watchlist Matched: mcp
Replicate · inference-infra · 2025-08-01
Score 0
You'll be surprised what you can do with AI video now.
Watchlist Matched: none
Replicate · inference-infra · 2025-07-31
Score 6
Wan 2.2 is our fastest, cheapest video model.
Watchlist Matched: model, open source
Replicate · inference-infra · 2025-07-21
Score 0
We compare the best image models for generating consistent characters from a single reference image.
Watchlist Matched: none
Replicate · inference-infra · 2025-07-17
Score 6
We've partnered with Bria to bring a suite of commercial-grade image generation and editing models to Replicate. Built entirely on licensed data, Bria’s tools are designed for enterprises and developers building safely with visual AI.
Watchlist Matched: generation
Replicate · inference-infra · 2025-07-16
Score 0
A deep-dive into the Taylor Seer optimization technique
Watchlist Matched: none
Replicate · inference-infra · 2025-07-01
Score 0
We hosted a hackathon with BFL for FLUX.1 Kontext. Here were the winners.
Watchlist Matched: none
Replicate · inference-infra · 2025-06-10
Score 0
Learn expert prompting techniques to create stunning videos with Google's Veo 3.
Watchlist Matched: none
Replicate · inference-infra · 2025-06-06
Score 6
We're sharing our experiments and tips on Google's new Veo 3 model.
Watchlist Matched: model
Replicate · inference-infra · 2025-06-02
Score 0
FLUX.1 Kontext is everywhere - see what folks are cooking.
Watchlist Matched: none
Replicate · inference-infra · 2025-05-29
Score 6
This is how to get the most from Black Forest Labs' new image editing model.
Watchlist Matched: model
Replicate · inference-infra · 2025-05-22
Score 0
OpenAI's latest models are now available on Replicate, including GPT-4.1, GPT-4o, and the o-series.
Watchlist Matched: none
Replicate · inference-infra · 2025-05-15
Score 6
We've partnered with Hugging Face to bring Replicate inference to their platform.
Watchlist Matched: inference
Replicate · inference-infra · 2025-05-07
Score 0
Ideogram 3.0 is packed with powerful design, style transfer, and realism capabilities.
Watchlist Matched: none
Replicate · inference-infra · 2025-05-06
Score 0
MiniMax's Speech-02 models give you high-quality text-to-speech with voice cloning, emotional expression, and multilingual support.
Watchlist Matched: api
Replicate · inference-infra · 2025-04-16
Score 0
Advanced face swap and AI avatars from Easel AI are now on Replicate.
Watchlist Matched: none
Replicate · inference-infra · 2025-04-01
Score 0
One of the most fun ways to use Wan2.1 is video style transfer. Learn how here.
Watchlist Matched: none
Replicate · inference-infra · 2025-03-28
Score 0
We take a quick look at the latest creative models, experiments, and community projects.
Watchlist Matched: lora
Replicate · inference-infra · 2025-03-05
Score 6
We've been playing with Alibaba's WAN2.1 text-to-video model lately. What happens when you tweak those mysterious parameters? Let's find out.
Watchlist Matched: model
Replicate · inference-infra · 2025-01-24
Score 0
Train your own versions of Tencent's HunyuanVideo for style, motion, and characters on Replicate.
Watchlist Matched: open-source
Replicate · inference-infra · 2025-01-17
Score 0
Create AI videos with a convenient workflow.
Watchlist Matched: none
Replicate · inference-infra · 2024-12-16
Score 0
There are lots of models that are as good as OpenAI's Sora now.
Watchlist Matched: none
Replicate · inference-infra · 2024-11-26
Score 0
We've made running fine-tunes on Replicate much faster, and the optimizations are open-source.
Watchlist Matched: open-source
Replicate · inference-infra · 2024-11-21
Score 6
A new set of image generation capabilities for FLUX models, including inpainting, outpainting, canny edge detection, and depth maps.
Watchlist Matched: generation
Replicate · inference-infra · 2024-10-22
Score 6
Stability AI's latest text-to-image model is now available on Replicate and you can run it with an API.
Watchlist Matched: model, api
Replicate · inference-infra · 2024-10-10
Score 0
FLUX is now much faster on Replicate, and we’ve made our optimizations open-source so you can see exactly how they work and build upon them.
Watchlist Matched: open-source, open source
Replicate · inference-infra · 2024-09-20
Score 0
It's easy to fine-tune Flux, but sometimes you need to do a little more work to get the best results. This post covers techniques you can use to improve your fine-tuned Flux models.
Watchlist Matched: training
Replicate · inference-infra · 2024-09-09
Score 0
Create and run your own fine-tuned Flux models programmatically using Replicate's HTTP API.
Watchlist Matched: api
Replicate · inference-infra · 2024-08-30
Score 6
Create your own fine-tuned Flux model to generate new images of yourself.
Watchlist Matched: model
Replicate · inference-infra · 2024-08-23
Score 0
Flux LoRAs, Hot Zuck, and Replicate on Lex Fridman
Watchlist Matched: none
Replicate · inference-infra · 2024-08-16
Score 0
Fine tune FLUX.1, generative video games, a vision for the metaverse
Watchlist Matched: none
Replicate · inference-infra · 2024-08-15
Score 6
We've added fine-tuning (LoRA) support to FLUX.1 image generation models. You can train FLUX.1 on your own images with one line of code using Replicate's API.
Watchlist Matched: generation, fine-tuning, lora, api
Replicate · inference-infra · 2024-08-09
Score 0
Flux developments, Minecraft bot, Streamlit cookbook with Zeke
Watchlist Matched: none
Replicate · inference-infra · 2024-08-02
Score 6
Open source frontier image model, cut objects from videos, new Python web framework from Jeremy Howard
Watchlist Matched: model, open source
Replicate · inference-infra · 2024-08-02
Score 0
We explore FLUX.1's unique strengths and aesthetics to see what we can generate.
Watchlist Matched: none
Replicate · inference-infra · 2024-08-01
Score 6
FLUX.1 is a new text-to-image model from Black Forest Labs, the creators of Stable Diffusion, that exceeds the capabilities of previous open-source models.
Watchlist Matched: model, api, open-source
Replicate · inference-infra · 2024-07-26
Score 6
A top-tier open-ish language model, new safety classifiers, model search API
Watchlist Matched: model, api
Replicate · inference-infra · 2024-07-12
Score 6
Data curation, data generation, data data data
Watchlist Matched: generation
Replicate · inference-infra · 2024-06-28
Score 6
Google's Gemma2 models, language model leaderboard, tips for Stable Diffusion 3
Watchlist Matched: model, leaderboard
Replicate · inference-infra · 2024-06-21
Score 6
Really good coding model, AI search breakthroughs, Discord support bot
Watchlist Matched: model
Replicate · inference-infra · 2024-06-18
Score 0
We show you how to use Stable Diffusion 3 to get the best images, including new techniques for prompting.
Watchlist Matched: none
Replicate · inference-infra · 2024-06-18
Score 0
A step-by-step guide to generating images with Stable Diffusion 3 on your M-series Mac using MPS acceleration.
Watchlist Matched: none
Replicate · inference-infra · 2024-06-14
Score 6
Copy and paste a few commands into terminal to play with Stable Diffusion 3 on your own GPU-powered machine.
Watchlist Matched: gpu
Replicate · inference-infra · 2024-06-14
Score 0
Find concepts in GPT models, real-time speech to text in the browser, H100s are coming
Watchlist Matched: none
Replicate · inference-infra · 2024-06-07
Score 6
Garden State Llama, applied LLMs guide, real-time image generation
Watchlist Matched: generation
Replicate · inference-infra · 2024-05-31
Score 6
Faster image generation, AI-powered world simulator, insights on AI dataset complexity
Watchlist Matched: generation
Replicate · inference-infra · 2024-05-24
Score 0
DIY Llama 3 implementation, open-source smart glasses, steering language models with dictionary learning
Watchlist Matched: open-source
Replicate · inference-infra · 2024-05-23
Score 0
No feed summary available yet.
Watchlist Matched: none
Replicate · inference-infra · 2023-12-06
Score 0
Or, how I met a virtual David Attenborough.
Watchlist Matched: none
Replicate · inference-infra · 2023-12-06
Score 0
We’ve added fine-tuning for realistic voice cloning (RVC). You can train RVC on your own dataset from a YouTube video with a few lines of code using Replicate's API.
Watchlist Matched: fine-tuning, api, open-source
Replicate · inference-infra · 2023-12-05
Score 0
We've raised a $40 million Series B led by a16z.
Watchlist Matched: open-source
Replicate · inference-infra · 2023-11-23
Score 6
The Yi series models are large language models trained from scratch by developers at 01.AI. Learn how to run them in the cloud with one line of code.
Watchlist Matched: cloud, api
Replicate · inference-infra · 2023-11-22
Score 0
We've added a CLI command that makes it easy to get started with Replicate.
Watchlist Matched: none
Replicate · inference-infra · 2023-11-08
Score 6
We’ve added chord conditioning to Meta’s MusicGen model, so you can create automatic backing tracks in any style using text prompts and chord progressions.
Watchlist Matched: model
Replicate · inference-infra · 2023-10-13
Score 0
We’ve added fine-tuning support to MusicGen. You can train the small, medium and melody models on your own audio files using Replicate.
Watchlist Matched: fine-tuning
Replicate · inference-infra · 2023-10-09
Score 0
How to use Llama 2 models with grammars for information extraction tasks.
Watchlist Matched: none
Replicate · inference-infra · 2023-10-04
Score 0
Combine AnimateDiff and the ST-MFNet frame interpolator to create smooth and realistic videos from a text prompt
Watchlist Matched: none
Replicate · inference-infra · 2023-09-06
Score 0
We've made some dramatic improvements to cold boots for fine-tuned models.
Watchlist Matched: none
Replicate · inference-infra · 2023-08-22
Score 6
With the recent release of Stable Diffusion XL fine-tuning on Replicate, and today being the 1-year anniversary of Stable Diffusion, now feels like the perfect opportunity to take a step back and reflect on how text-to-image AI has improve...
Watchlist Matched: release, fine-tuning
Replicate · inference-infra · 2023-08-16
Score 0
The price of public models is being cut in half, and soon we'll start charging new users for setup and idle time on private models.
Watchlist Matched: none
Replicate · inference-infra · 2023-08-14
Score 0
Learn the art of the Llama prompt.
Watchlist Matched: none
Replicate · inference-infra · 2023-08-14
Score 0
Our API now supports server-sent event streams for language models. Learn how to use them to make your apps more responsive.
Watchlist Matched: api
Replicate · inference-infra · 2023-08-08
Score 0
We’ve added fine-tuning (Dreambooth, Textual Inversion and LoRA) support to SDXL 1.0. You can train SDXL on your own images with one line of code using the Replicate API.
Watchlist Matched: fine-tuning, lora, api
Replicate · inference-infra · 2023-07-26
Score 0
How to run Stable Diffusion XL 1.0 using the Replicate API
Watchlist Matched: api
Replicate · inference-infra · 2023-07-22
Score 0
How to run Llama 2 on Mac, Linux, Windows, and your phone.
Watchlist Matched: none
Replicate · inference-infra · 2023-07-20
Score 0
So you want to train a llama...
Watchlist Matched: none
Replicate · inference-infra · 2023-05-18
Score 0
We've added a status page to provide real-time updates on the health of Replicate.
Watchlist Matched: none
Replicate · inference-infra · 2023-04-19
Score 0
Give it a machine learning directory and AutoCog will create predict.py and cog.yaml until it successfully runs a prediction
Watchlist Matched: none
Replicate · inference-infra · 2023-04-05
Score 0
No feed summary available yet.
Watchlist Matched: none
Replicate · inference-infra · 2023-03-18
Score 0
A roundup of recent developments from the llamaverse.
Watchlist Matched: none
Replicate · inference-infra · 2023-03-17
Score 0
With a small amount of data and an hour of training you can make LLaMA output text in the voice of the dataset.
Watchlist Matched: training
Replicate · inference-infra · 2023-03-16
Score 0
We'll show you how to train Alpaca, a fine-tuned version of LLaMA that can respond to instructions like ChatGPT.
Watchlist Matched: none
Replicate · inference-infra · 2023-02-21
Score 0
Lots of people want to build things with machine learning, but they don't have the expertise to use it.
Watchlist Matched: none
Replicate · inference-infra · 2022-08-29
Score 0
How to use Replicate to integrate Stable Diffusion into hacks, apps, and projects
Watchlist Matched: api
Replicate · inference-infra · 2022-08-25
Score 6
A tutorial for building a chat bot that replies to prompts with the output of a text-to-image model.
Watchlist Matched: model
Replicate · inference-infra · 2022-08-11
Score 0
We're bringing people together to explore what's being created with machine learning.
Watchlist Matched: none
Replicate · inference-infra · 2022-08-05
Score 0
Using CLIP and LAION5B to collect thousands of captioned images.
Watchlist Matched: none
Replicate · inference-infra · 2022-07-18
Score 0
The basics of using the API to create your own images from text.
Watchlist Matched: api
Replicate · inference-infra · 2022-05-27
Score 0
An introduction to differentiable programming and the process of refining generative art models.
Watchlist Matched: none
Replicate · inference-infra · 2022-05-16
Score 0
We're a small team of engineers and machine learning enthusiasts working to make machine learning more accessible.
Watchlist Matched: none