Practical ML and LLM deployment content including Bedrock, SageMaker, and agent operations.
AWS Machine Learning Blog · cloud · 2026-06-03
Score 11
Fine-tuning for domain-specific tasks means improving performance in one area without degrading the model’s general capabilities, and getting that balance right is harder than it looks. This post walks through how to navigate that balance,...
High signal Matched: performance, model, training, checkpointing, fine-tuning
AWS Machine Learning Blog · cloud · 2026-06-03
Score 9
In this post, we'll walk through implementing object detection with Amazon Nova 2 Lite. You'll learn how to deploy an object detection application using Amazon Bedrock, AWS Lambda, and Amazon API Gateway. You'll also learn how to craft eff...
High signal Matched: bedrock, api
AWS Machine Learning Blog · cloud · 2026-06-03
Score 11
This post walks through how Baz built their Spec Review agent using Amazon Bedrock and Amazon Bedrock AgentCore. We'll cover the architecture decisions, implementation details, and the business outcomes they achieved by leveraging these AW...
High signal Matched: bedrock, agent
AWS Machine Learning Blog · cloud · 2026-06-02
Score 9
Today, we’re excited to announce the ability to reference a secret in AWS Secrets Manager for AgentCore Identity, so you can reference your own preconfigured secret from Secrets Manager and retain full control over how it is managed. With...
High signal Matched: bedrock
AWS Machine Learning Blog · cloud · 2026-06-02
Score 9
In this post, we walk through how to use Amazon Quick Research to integrate biomedical data sources for rare cancer research. The walkthrough uses pediatric sarcoma as the research domain and draws on publicly available datasets from PubMe...
High signal Matched: research
AWS Machine Learning Blog · cloud · 2026-06-02
Score 13
GPT-5.5, GPT-5.4, and Codex are now generally available on Amazon Bedrock. Deploy them in production applications and agents today, on Bedrock’s high performance inference engine.
High signal Matched: inference, performance, bedrock, agents
AWS Machine Learning Blog · cloud · 2026-06-02
Score 11
While deploying Model Context Protocol (MCP) servers in production, enterprises need fine-grained access control across servers, observability into which teams use which tools, security guarantees against data exfiltration, and centralized...
High signal Matched: model, bedrock, mcp
AWS Machine Learning Blog · cloud · 2026-06-02
Score 9
In this post, we use a lakehouse data agent to demonstrate how you can use Policy for deterministic access control and Lambda interceptors for dynamic validation. We then show how to combine Lambda interceptors and Policy to implement a ge...
High signal Matched: bedrock, agent, agents
AWS Machine Learning Blog · cloud · 2026-06-02
Score 9
In this post, we address several key risks that surface when designing an agentic payment system, and how to address them with the capabilities of AgentCore payments.
High signal Matched: bedrock, agentic
AWS Machine Learning Blog · cloud · 2026-06-02
Score 9
When you build agentic AI solutions, you face unique operational challenges. Agents make unpredictable decisions, costs spiral unexpectedly, and debugging non-deterministic failures seems impossible. Agentic AI applications don't just exec...
High signal Matched: bedrock, agents, agentic
AWS Machine Learning Blog · cloud · 2026-06-02
Score 15
If you’re iterating on deploying large language models (LLMs) on AWS GPU instances, you’ve probably noticed the larger the model to be loaded into GPU High Bandwidth Memory (HBM), the longer the painful wait until the GPUs are ready for in...
High signal Matched: inference, gpu, model
AWS Machine Learning Blog · cloud · 2026-05-30
Score 17
This post demonstrates a comprehensive observability solution using Amazon Managed Grafana dashboards that provides a holistic view of both quality and quantity for LLMs served on Amazon SageMaker AI endpoints with inference components.
High signal Matched: inference, gpu, sagemaker
AWS Machine Learning Blog · cloud · 2026-05-29
Score 13
Azercell Telecom LLC, Azerbaijan's leading telecommunications provider, wanted to build an Azerbaijani large language model (LLM) on Amazon SageMaker AI for telecom use cases and a customer-facing chatbot. The challenge: adapting foundatio...
High signal Matched: model, sagemaker, training
AWS Machine Learning Blog · cloud · 2026-05-29
Score 11
In this post, you learn how to build a custom portal with embedded SageMaker AI MLflow Apps UI. You walk through the architecture pattern behind a React front end paired with a Flask reverse proxy that handles AWS Signature Version 4 (SigV...
High signal Matched: cloud, sagemaker
AWS Machine Learning Blog · cloud · 2026-05-29
Score 11
In this post, we demonstrate how to build a secure Flask-based MLflow proxy service that provides HTTPS access to Amazon SageMaker MLflow without requiring the MLflow SDK. This solution is for organizations undergoing cloud transformation...
High signal Matched: cloud, sagemaker, api, sdk
AWS Machine Learning Blog · cloud · 2026-05-29
Score 9
This post combines learnings from LangChain’s work on evaluating deep agents and Anthropic’s guide to demystifying evals for AI agents into a practical guide. In this post, you will learn how to: 1) apply five evaluation patterns for deep...
High signal Matched: evaluation, bedrock, evals, evaluating, agent, agents
AWS Machine Learning Blog · cloud · 2026-05-29
Score 13
Datasets in AgentCore is in public preview. Agent evaluation is most powerful when you combine fast-moving online signals with stable offline baselines. To understand whether your agent is truly improving over time, you need a fixed benchm...
High signal Matched: benchmark, evaluation, bedrock, agent
AWS Machine Learning Blog · cloud · 2026-05-29
Score 11
This post covers Opus 4.8's improvements and practical guidance for AI engineers integrating the model into agentic systems and production inference workloads on Amazon Bedrock.
High signal Matched: inference, model, bedrock, agentic
AWS Machine Learning Blog · cloud · 2026-06-02
Score 7
This post demonstrates how to implement Open Authorization (OAuth) Code flow as an inbound authorization mechanism for MCP servers hosted on Amazon Bedrock AgentCore Gateway. By the end of this guide, you will have a production-ready setup...
Watchlist Matched: bedrock, mcp
AWS Machine Learning Blog · cloud · 2026-06-02
Score 7
In this post, we walk through a practical implementation using KDB-X MCP server integration with Amazon Quick, demonstrating how traders and analysts can ask questions using conversational language and receive actionable insights from data...
Watchlist Matched: performance, mcp