Official PyTorch ecosystem blog covering community updates and systems optimization.
PyTorch Foundation · open-source · 2026-06-01
Score 11
TL;DR: This case study demonstrates how LinkedIn re-architected its distributed linear programming solver, DuaLip, by developing a GPU-accelerated PyTorch version to handle extreme-scale optimization challenges like web applications. This...
High signal Matched: distributed, gpu
PyTorch Foundation · open-source · 2026-05-28
Score 15
When you use PyTorch’s compiler, your model runs faster, up to 10x faster. But what’s actually happening? Without compilation, the GPU runs a kernel, a function on the GPU, for...
High signal Matched: kernel, gpu, model
PyTorch Foundation · open-source · 2026-05-28
Score 17
TL;DR: The TokenSpeed inference engine achieved a record-breaking 580 tps running the Qwen3.5-397B-A17B model on GPUs. This extreme performance for agentic workloads is driven by systematic elimination of memory copies,...
High signal Matched: inference, performance, gpu, model, agentic
PyTorch Foundation · open-source · 2026-05-27
Score 11
The PyTorch Foundation, a community-driven hub for open source AI under the Linux Foundation, is announcing today that Alibaba Cloud has joined as a Platinum member. Alibaba Cloud is a...
High signal Matched: cloud, open source
PyTorch Foundation · open-source · 2026-05-26
Score 18
Code available at: https://github.com/facebookresearch/ads_model_kernel_library In this post, we present the design of TLX Block Attention — a Triton kernel targeting NVIDIA Blackwell GPUs that exploits compile-time knowledge of a block-di...
High signal Matched: kernel, triton, blackwell, model
PyTorch Foundation · open-source · 2026-05-19
Score 8
TLDR: PyTorch 2.11 makes it possible to install CUDA-enabled PyTorch wheels on aarch64 Linux directly from PyPI, eliminating the need for custom package indexes and workarounds that previously complicated deployment...
High signal Matched: cuda
PyTorch Foundation · open-source · 2026-05-19
Score 14
TL;DR: Introducing the ExecuTorch MLX Delegate The new MLX delegate enables optimized, GPU-accelerated inference for PyTorch models on Apple Silicon Macs, using Apple’s MLX framework. The delegate seamlessly integrates with...
High signal Matched: inference, gpu, introducing
PyTorch Foundation · open-source · 2026-05-14
Score 12
We are excited to announce the release of PyTorch® 2.12 (release notes)! The PyTorch 2.12 release features the following changes: Batched linalg.eigh on CUDA is up to 100x faster due...
High signal Matched: cuda, release
PyTorch Foundation · open-source · 2026-06-04
Score 4
TL;DR DeepSpeed now supports Muon Optimizer! Muon Optimizer has gained great momentum with significant adoption from frontier AI Labs. One of those AI Labs is Moonshot AI, which has adopted...
Watchlist Matched: none
PyTorch Foundation · open-source · 2026-05-23
Score 1
A little over a year ago, the PyTorch Foundation launched the Ambassador Program, an initiative that recognizes and supports independent, trusted voices in the PyTorch community who are passionate about...
Watchlist Matched: none
PyTorch Foundation · open-source · 2026-05-21
Score 1
Thank you to everyone who participated in the PyTorch Docathon 2026! Once again, the community showed up with incredible energy and dedication to make PyTorch documentation better for developers everywhere....
Watchlist Matched: none