ODBIERZ TWÓJ BONUS :: »

Distributed AI Systems. A practical guide to building scalable training, inference, and serving systems for production AI Fuheng Wu

(ebook) (audiobook) (audiobook) Język publikacji: angielski
Distributed AI Systems. A practical guide to building scalable training, inference, and serving systems for production AI Fuheng Wu - okladka książki

Distributed AI Systems. A practical guide to building scalable training, inference, and serving systems for production AI Fuheng Wu - okladka książki

Distributed AI Systems. A practical guide to building scalable training, inference, and serving systems for production AI Fuheng Wu - audiobook MP3

Distributed AI Systems. A practical guide to building scalable training, inference, and serving systems for production AI Fuheng Wu - audiobook CD

Autor:
Fuheng Wu
Ocena:
Bądź pierwszym, który oceni tę książkę
Stron:
560
Dostępne formaty:
     PDF
     ePub
Ebook
125,10 zł 139,00 zł (-10%)
125,10 zł najniższa cena z 30 dni

Dodaj do koszyka Dostępny natychmiast po opłaceniu zakupu lub Kup na prezent Kup 1-kliknięciem

Przenieś na półkę

Do przechowalni

As AI models grow to billions and trillions of parameters, distributed systems are essential for training and serving them. Many resources cover fragments of this domain, but none provide a full path from distributed training to inference and production deployment. This book fills that gap with practical, production-focused examples.
It starts with GPU and memory estimation, data preparation, and an overview of GPU architecture, interconnects, and core parallelism strategies. You’ll learn training techniques including data parallelism for single and multi-node setups, parameter sharding for memory-efficient scaling, and methods to reduce memory usage in large models.
The next section covers distributed inference and deployment. You’ll build high-performance systems using optimized attention, caching, operator fusion, and router-based designs. You’ll deploy on schedulers and container platforms with GPU-aware orchestration and assemble production stacks emphasizing reliability, scalability, and observability.
The final section covers benchmarking, performance tuning, and trends like MoE models, edge - cloud co-ordination, and advanced parallelism. Each chapter includes tested code and debugging guidance.
By the end, you’ll be able to build distributed AI systems that scale from a single GPU to large clusters.

Wybrane bestsellery

O autorze książki

Henry (Fuheng) Wu is a Principal Machine Learning Tech Lead at Oracle's Generative AI organization, specializing in distributed training, large-scale inference, and GPU systems. He has delivered core components of Oracle's Vision and Document Understanding AI services, and co-authored a Microsoft and Oracle blog on high-performance deep learning. He has contributed to open-source projects including SGLang, genai-bench, pyLLaMA, chatLLaMA, and Oracle's HiQ observability system. With hands-on experience across PyTorch Distributed, DeepSpeed, Kubernetes GPU clusters, and production LLM serving, he focuses on building practical, scalable AI systems used in real-world enterprise workloads.

Packt Publishing - inne książki

Zamknij

Przenieś na półkę
Dodano produkt na półkę
Usunięto produkt z półki
Przeniesiono produkt do archiwum
Przeniesiono produkt do biblioteki
Proszę czekać...
ajax-loader

Zamknij

Wybierz metodę płatności

Ebook
125,10 zł
Dodaj do koszyka
Zamknij Pobierz aplikację mobilną Ebookpoint