PyTorch Foundation · open-source · 2026-05-26
Score 18
Code available at: https://github.com/facebookresearch/ads_model_kernel_library In this post, we present the design of TLX Block Attention — a Triton kernel targeting NVIDIA Blackwell GPUs that exploits compile-time knowledge of a block-di...
High signal Matched: kernel, triton, blackwell, model
AMD ROCm Blogs · hardware · 2026-05-22
Score 30
Triton Inference Server is an open-source platform designed to streamline AI inferencing. It supports the deployment, scaling, and inference of trained models from multiple frameworks, including ONNX Runtime, TensorFlow, PyTorch, and other...
High signal Matched: inference, inferencing, serving, triton, benchmark, model, cloud, open-source
vLLM Project · open-source · 2026-03-04
Score 14
This article is adapted from a Red Hat hosted vLLM Office Hours session with Burkhard Ringlein from IBM Research, featuring a deep technical walkthrough of the vLLM Triton attention backend....
High signal Matched: triton, research
Modular · inference-infra · 2025-03-26
Score 10
What about Triton and Python eDSLs? (Democratizing AI Compute, Part 7)
High signal Matched: triton