Building LLMs with PyTorch

Anand Trivedi

SKU: 9789365898255

$39.95
Type:
Quantity:

FREE PREVIEW

ISBN: 9789365898255
eISBN: 9789365894158
Authors: Anand Trivedi
Rights: Worldwide
Edition: 2025
Pages: 534
Dimension: 7.5*9.25 Inches
Book Type: Paperback

PyTorch has become the go-to framework for building cutting-edge large language models (LLMs), enabling developers to harness the power of deep learning for natural language processing. This book serves as your practical guide to navigating the intricacies of PyTorch, empowering you to create your own LLMs from the ground up. 

You will begin by mastering PyTorch fundamentals, including tensors, autograd, and model creation, before diving into core neural network concepts like gradients, loss functions, and backpropagation. Progressing through regression and image classification with convolutional neural networks, you will then explore advanced image processing through object detection and segmentation. The book seamlessly transitions into NLP, covering RNNs, LSTMs, and attention mechanisms, culminating in the construction of Transformer-based LLMs, including a practical mini-GPT project. You will also get a strong understanding of generative models like VAEs and GANs.

By the end of this book, you will possess the technical proficiency to build, train, and deploy sophisticated LLMs using PyTorch, equipping you to contribute to the rapidly evolving landscape of AI.

WHAT YOU WILL LEARN
● Build and train PyTorch models for linear and logistic regression.
● Configure PyTorch environments and utilize GPU acceleration with CUDA.
● Construct CNNs for image classification and apply transfer learning techniques.
● Master PyTorch tensors, autograd, and build fundamental neural networks.
● Utilize SSD and YOLO for object detection and perform image segmentation.
● Develop RNNs and LSTMs for sequence modeling and text generation.
● Implement attention mechanisms and build Transformer-based language models.
● Create generative models using VAEs and GANs for diverse applications.
● Build and deploy your own mini-GPT language model, applying the acquired skills.

WHO THIS BOOK IS FOR
Software engineers, AI researchers, architects seeking AI insights, and professionals in finance, medical, engineering, and mathematics will find this book a comprehensive starting point, regardless of prior deep learning expertise.

1. Introduction to Deep Learning
2. Nuts and Bolts of AI with PyTorch
3. Introduction to Convolution Neural Network
4. Model Building with Custom Layers and PyTorch 2.0
5. Advances in Computer Vision: Transfer Learning and Object Detection
6. Advanced Object Detection and Segmentation
7. Mastering Object Detection with Detectron2
8. Introduction to RNNs and LSTMs
9. Understanding Text Processing and Generation in Machine Learning
10. Transformers Unleashed
11. Introduction to GANs: Building Blocks of Generative Models
12. Conditional GANs, Latent Spaces, and Diffusion Models
13. PyTorch 2.0: New Features, Efficient CUDA Usage, and Accelerated Model Training
14. Building Large Language Models from Scratch

Anand Trivedi has been solving problems through code and complexities for 15 years in the IT industry, working until 2015 as a Java developer and then dedicating himself to the field of Artificial Intelligence and Machine Learning. Anand has experience building technologies for startups, especially those starting from scratch, and taking them to business. Anand contributed to various startups like Heaps App, Bharatnaukri, and Aavenir from scratch while also working in mid-size to large-size companies building AI. Anand was one of the founding members at Aavenir and worked there for six years, scaling from 0 to 100+ people. He has been a speaker at various prestigious events and was selected in the 40 under 40 AI engineers in 2024. His journey continues, and he has now actively started researching into fields of AI that LLM can't solve and haven't been solved till date , specifically in Microfinance and Quantum Finance.

You may also like

Recently viewed