(0)

Kirjoita arvostelu

-35%

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Peter E Poisson

(0)

Kirjoita arvostelu

Kieli englanti

Kansi Pehmeäkantinen

Julkaistu 2025-07-26

18,61 € 28,63 €

-35% koodilla BOOKS

Pehmeäkantinen 28,63 € Kovakantinen

Loppu

30 päivän palautusoikeus

Are you struggling to scale your large language models (LLMs) without breaking the bank or sacrificing latency? This book offers a clear roadmap to optimize inference, reduce costs, and scale seamlessly across platforms like PyTorch, ONNX, vLLM, and more.Optimizing LLM Performance is your hands-on guide to boosting the efficiency of large language models in production environments. Whether you're building c ... Täydellinen kuvaus

Saatat myös pitää

-35%

TOP

If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All

Eliezer Yudkowsky, Nate Soares

16,35 € 25,15 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

The God Test

Robert Wright

16,35 € 25,15 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

How To Think About AI: A Guide For The Perplexed

Richard Susskind

13,26 € 20,40 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Gödel, Escher, Bach: An Eternal Golden Braid

Douglas R. Hofstadter

22,74 € 34,99 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

AI Engineering: Building Applications with Foundation Models

Chip Huyen

79,31 € 122,02 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI

Karen Hao

18,02 € 27,72 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Python Crash Course: A Hands-On, Project-Based Introduction to Programming

Eric Matthes

39,40 € 60,62 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

The Rust Programming Language

Steve Klabnik, Carol Nichols, Chris Krycho

47,78 € 73,51 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

How to Talk to AI: (And How Not To)

Jamie Bartlett

12,99 € 19,99 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

The Pragmatic Programmer: journey to mastery, 20th Anniversary Edition, 2/e: your journey to mastery, 20th Anniversary Edition

Andrew Hunt, David Thomas

47,96 € 73,78 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Hackers. 25th Anniversary Edition: Heroes of the Computer Revolution

Steven Levy

29,75 € 45,77 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

The Art of Game Design: A Book of Lenses

Jesse Schell

85,29 € 131,21 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

How Linux Works: What Every Superuser Should Know

Brian Ward

39,83 € 61,27 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Hands-On Large Language Models: Language Understanding and Generation

Maarten Grootendorst, Jay Alammar

79,31 € 122,02 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Fundamentals of Software Architecture: A Modern Engineering Approach

Mark Richards, Neal Ford

79,31 € 122,02 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Speak Data: Artists, Scientists, Thinkers, and Dreamers on How We Live Our Lives in Numbers

Giorgia Lupi, Phillip Cox

30,80 € 47,38 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Clean Architecture: A Craftsman's Guide to Software Structure and Design

Robert C. Martin

34,18 € 52,58 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

World of Warcraft Chronicle, Volume 2

39,83 € 61,27 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Linux Basics for Hackers, 2nd Edition: Getting Started with Networking, Scripting, and Security in Kali

Occupytheweb

31,86 € 49,01 €

-35% koodilla BOOKS

Toimittajalla varastossa

-35%

TOP

Deep Learning: Foundations and Concepts

Christopher M. Bishop, Hugh Bishop

93,59 € 143,98 €

-35% koodilla BOOKS

Toimittajalla varastossa

Kuvaus

Optimizing LLM Performance is your hands-on guide to boosting the efficiency of large language models in production environments. Whether you're building chatbots, document summarizers, or enterprise AI tools, this book teaches proven methods to accelerate inference while maintaining accuracy. It dives deep into hardware-aware optimizations, quantization, model pruning, compiler acceleration, and memory-efficient runtime strategies without locking you into any single framework.

Written with clarity and real-world use in mind, the book features practical case studies, side-by-side performance comparisons, and up-to-date techniques from the cutting edge of AI deployment. If you're building, serving, or scaling LLMs in 2025, this is the performance engineering guide you've been waiting for.

Key Features:
- Framework-agnostic optimization techniques using PyTorch, ONNX Runtime, vLLM, llama.cpp, and more
- Deep dive into quantization (INT8/4-bit), distillation, pruning, and KV caching
- Hands-on examples with FastAPI, Hugging Face Transformers, and serverless deployment
- Covers performance profiling, streaming, batching, and cost-efficient scaling
- Future-proof insights on compiler-aware models, LoRA 2.0, and edge inference

Ready to build LLM systems that are faster, cheaper, and more scalable?
Grab your copy of Optimizing LLM Performance today and deploy smarter.

Lisätietoja

Kirjoittaja	Peter E Poisson
Julkaisija	Amazon Digital Services LLC - Kdp
Julkaisuvuosi	2025
Kannen tyyppi	Pehmeäkantinen
EAN	9798294338459

Kirjoita oma arvostelusi

Arvostelet: Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Arvostelusi:

Goodreads-arvostelut

18,61 € 28,63 €

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Saatat myös pitää

If Anyone Builds It, Everyone Dies: Why Superhuman AI Would Kill Us All

The God Test

How To Think About AI: A Guide For The Perplexed

Gödel, Escher, Bach: An Eternal Golden Braid

AI Engineering: Building Applications with Foundation Models

Empire of AI: Dreams and Nightmares in Sam Altman's OpenAI

Python Crash Course: A Hands-On, Project-Based Introduction to Programming

The Rust Programming Language

How to Talk to AI: (And How Not To)

The Pragmatic Programmer: journey to mastery, 20th Anniversary Edition, 2/e: your journey to mastery, 20th Anniversary Edition

Hackers. 25th Anniversary Edition: Heroes of the Computer Revolution

The Art of Game Design: A Book of Lenses

How Linux Works: What Every Superuser Should Know

Hands-On Large Language Models: Language Understanding and Generation

Fundamentals of Software Architecture: A Modern Engineering Approach

Speak Data: Artists, Scientists, Thinkers, and Dreamers on How We Live Our Lives in Numbers

Clean Architecture: A Craftsman's Guide to Software Structure and Design

World of Warcraft Chronicle, Volume 2

Linux Basics for Hackers, 2nd Edition: Getting Started with Networking, Scripting, and Security in Kali

Deep Learning: Foundations and Concepts

Kuvaus

Lisätietoja

Goodreads-arvostelut

Olibro

Asiakaspalvelu

Tietoa

Ota yhteyttä

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More - Peter E Poisson

Optimizing LLM Performance Framework-Agnostic Techniques for Speed, Scalability, and Cost-Efficient Inference Across PyTorch, ONNX, VLLM, and More

Saatat myös pitää

Kuvaus

Lisätietoja

Goodreads-arvostelut