{"product_id":"practical-gpu-programming-high-performance-computing-with-cuda-cupy-and-python-on-modern-gpus-paperback","title":"Practical GPU Programming: High-performance computing with CUDA, CuPy, and Python on modern GPUs - Paperback","description":"\u003cdiv\u003e\u003cp style=\"text-align: right;\"\u003e\u003ca href=\"https:\/\/reportcopyrightinfringement.com\/\" target=\"_blank\" rel=\"nofollow\"\u003e\u003cb\u003eReport copyright infringement\u003c\/b\u003e\u003c\/a\u003e\u003c\/p\u003e\u003c\/div\u003e\u003cp\u003eby \u003cb\u003eMaris Fenlor\u003c\/b\u003e (Author)\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003eIf you're a Python pro looking to get the most out of your code with GPUs, then Practical GPU Programming is the right book for you. This book will walk you through the basics of GPU architectures, show you hands-on parallel programming techniques, and give you the know-how to confidently speed up real workloads in data processing, analytics, and engineering.\u003c\/p\u003e\u003cp\u003eThe first thing you'll do is set up the environment, install CUDA, and get a handle on using Python libraries like PyCUDA and CuPy. You'll then dive into memory management, kernel execution, and parallel patterns like reductions and histogram computations. Then, we'll dive into sorting and search techniques, but with a focus on how GPU acceleration transforms business data processing. We'll also put a strong emphasis on linear algebra to show you how to supercharge classic vector and matrix operations with cuBLAS and CuPy. Plus, with batched computations, efficient broadcasting, custom kernels, and mixed-library workflows, you can tackle both standard and advanced problems with ease.\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003eThroughout, we evaluate numerical accuracy and performance side by side, so you can understand both the strengths and limitations of GPU-based solutions. The book covers nearly every essential skill and modern toolkit for practical GPU programming, but it's not going to turn you into a master overnight.\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003eKey Learnings\u003cul\u003e\n\u003cli\u003eBoost processing speed and efficiency for data-intensive tasks.\u003c\/li\u003e\n\u003cli\u003eUse CuPy and PyCUDA to write and execute custom CUDA kernels.\u003c\/li\u003e\n\u003cli\u003eMaximize GPU occupancy and throughput efficiency by using optimal thread block and grid configuration.\u003c\/li\u003e\n\u003cli\u003eReduce global memory bottlenecks in kernels by using shared memory and coalesced access patterns.\u003c\/li\u003e\n\u003cli\u003ePerform dynamic kernel compilation to ensure tailored performance.\u003c\/li\u003e\n\u003cli\u003eUse CuPy to carry out custom, high-speed elementwise GPU operations and expressions.\u003c\/li\u003e\n\u003cli\u003eImplement bitonic and radix sort algorithms for large or batch integer datasets.\u003c\/li\u003e\n\u003cli\u003eExecute parallel linear search kernels to detect patterns rapidly.\u003c\/li\u003e\n\u003cli\u003eScale matrix operations using Batched GEMM and high-level cuBLAS routines.\u003c\/li\u003e\n\u003c\/ul\u003e\u003cp\u003e\u003c\/p\u003eTable of Content\u003col\u003e\n\u003cli\u003eIntroduction to GPU Fundamentals\u003c\/li\u003e\n\u003cli\u003eSetting up GPU Programming Environment\u003c\/li\u003e\n\u003cli\u003eBasic Data Transfers and Memory Types\u003c\/li\u003e\n\u003cli\u003eSimple Parallel Patterns\u003c\/li\u003e\n\u003cli\u003eIntroduction to Kernel Optimization\u003c\/li\u003e\n\u003cli\u003eWorking with PyCUDA and CuPy Features\u003c\/li\u003e\n\u003cli\u003ePractical Sorting and Search\u003c\/li\u003e\n\u003cli\u003eLinear Algebra Essentials on GPU\u003c\/li\u003e\n\u003c\/ol\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003eNumber of Pages:\u003c\/strong\u003e 130\u003c\/div\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003eDimensions:\u003c\/strong\u003e 0.28 x 9.25 x 7.5 IN\u003c\/div\u003e\n            \u003cdiv\u003e\n\u003cstrong\u003ePublication Date:\u003c\/strong\u003e February 20, 2025\u003c\/div\u003e\n            ","brand":"Books by splitShops","offers":[{"title":"Default Title","offer_id":43155110887487,"sku":"9789349174795","price":75.58,"currency_code":"USD","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0105\/8226\/1823\/files\/PheYh3Ao1t9789349174795.webp?v=1776962301","url":"https:\/\/dhl-adrianne.myshopify.com\/products\/practical-gpu-programming-high-performance-computing-with-cuda-cupy-and-python-on-modern-gpus-paperback","provider":"BBB","version":"1.0","type":"link"}