Computer Vision Projects with PyTorch - Akshay Kulkarni, Adarsha Shivananda, Nitin Ranjan Sharma

Computer Vision Projects with PyTorch (eBook)

Design and Develop Production-Grade Models
eBook Download: PDF
2022 | 1st ed.
XVI, 346 Seiten
Apress (Verlag)
978-1-4842-8273-1 (ISBN)
Systemvoraussetzungen
62,99 inkl. MwSt
  • Download sofort lieferbar
  • Zahlungsarten anzeigen
Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.

The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO, data augmentation, and other regularization techniques used in the industry. And then it gives you a quick overview of the PyTorch libraries used in the book. After that, it takes you through the implementation of image classification problems, object detection techniques, and transfer learning while training and running inference. The book covers image segmentation and an anomaly detection model. And it discusses the fundamentals of video processing for computer vision tasks putting images into videos. The book concludes with an explanation of the complete model building process for deep learning frameworks using optimized techniques with highlights on model AI explainability.

After reading this book, you will be able to build your own computer vision projects using transfer learning and PyTorch.


What You Will Learn
  • Solve problems in computer vision with PyTorch.
  • Implement transfer learning and perform image classification, object detection, image segmentation, and other computer vision applications
  • Design and develop production-grade computer vision projects for real-world industry problems
  • Interpret computer vision models and solve business problems

Who This Book Is For

Data scientists and machine learning engineers interested in building computer vision projects and solving business problems


Akshay R Kulkarni is an AI and machine learning (ML) evangelist and a thought leader. He has consulted for Fortune 500 and global enterprises to drive AI and data science-led strategic transformations. He is currently the manager of data science & AI at Publicis Sapien. He is a Google developer and author of the book Natural Language Processing Recipes (Apress). He is a regular speaker at major AI and data science conferences (including Strata, O'Reilly AI Conf, and GIDS). Akshay is a visiting faculty member for some of the top graduate institutes in India. In 2019, he was featured as one of the top 40 under 40 Data Scientists in India. In his spare time, he enjoys reading, writing, coding, and helping aspiring data scientists. He lives in Bangalore with his family.

Adarsha Shivananda is a senior data scientist on Indegene's product and technology team where he works on building machine learning and artificial intelligence (AI) capabilities for pharma products. He aims to build a pool of exceptional data scientists within and outside of the organization to solve problems through training programs, and always wants to stay ahead of the curve. Previously, he worked with Tredence Analytics and IQVIA. He has worked extensively in the pharma, healthcare, retail, and marketing domains. He lives in Bangalore and loves to read and teach data science.

Nitin Ranjan Sharma is a manager at Novartis, involved in leading a team to develop products using multi-modal techniques. He has been a consultant developing solutions for Fortune 500 companies, involved in solving complex business problems using machine learning and deep learning frameworks. His major focus area and core expertise are computer vision and solving some of the challenging business problems dealing with images and video data. Before Novartis, he was part of the data science team at Publicis Sapient, EY, and TekSystems Global Services. He is a regular speaker at data science communities and meet-ups and also an open-source contributor. He has also been training and mentoring data science enthusiasts.

Design and develop end-to-end, production-grade computer vision projects for real-world industry problems. This book discusses computer vision algorithms and their applications using PyTorch.The book begins with the fundamentals of computer vision: convolutional neural nets, RESNET, YOLO, data augmentation, and other regularization techniques used in the industry. And then it gives you a quick overview of the PyTorch libraries used in the book. After that, it takes you through the implementation of image classification problems, object detection techniques, and transfer learning while training and running inference. The book covers image segmentation and an anomaly detection model. And it discusses the fundamentals of video processing for computer vision tasks putting images into videos. The book concludes with an explanation of the complete model building process for deep learning frameworks using optimized techniques with highlights on model AI explainability.After reading this book, you will be able to build your own computer vision projects using transfer learning and PyTorch.What You Will LearnSolve problems in computer vision with PyTorch.Implement transfer learning and perform image classification, object detection, image segmentation, and other computer vision applicationsDesign and develop production-grade computer vision projects for real-world industry problemsInterpret computer vision models and solve business problemsWho This Book Is ForData scientists and machine learning engineers interested in building computer vision projects and solving business problems
Erscheint lt. Verlag 18.7.2022
Zusatzinfo XVI, 346 p. 154 illus.
Sprache englisch
Themenwelt Mathematik / Informatik Informatik Programmiersprachen / -werkzeuge
Informatik Theorie / Studium Künstliche Intelligenz / Robotik
Schlagworte Artificial Intelligence • computer vision • Deep learning • Image Analysis • image classification • Image Processing • Object detection • Python • PyTorch
ISBN-10 1-4842-8273-6 / 1484282736
ISBN-13 978-1-4842-8273-1 / 9781484282731
Haben Sie eine Frage zum Produkt?
PDFPDF (Wasserzeichen)
Größe: 10,6 MB

DRM: Digitales Wasserzeichen
Dieses eBook enthält ein digitales Wasser­zeichen und ist damit für Sie persona­lisiert. Bei einer missbräuch­lichen Weiter­gabe des eBooks an Dritte ist eine Rück­ver­folgung an die Quelle möglich.

Dateiformat: PDF (Portable Document Format)
Mit einem festen Seiten­layout eignet sich die PDF besonders für Fach­bücher mit Spalten, Tabellen und Abbild­ungen. Eine PDF kann auf fast allen Geräten ange­zeigt werden, ist aber für kleine Displays (Smart­phone, eReader) nur einge­schränkt geeignet.

Systemvoraussetzungen:
PC/Mac: Mit einem PC oder Mac können Sie dieses eBook lesen. Sie benötigen dafür einen PDF-Viewer - z.B. den Adobe Reader oder Adobe Digital Editions.
eReader: Dieses eBook kann mit (fast) allen eBook-Readern gelesen werden. Mit dem amazon-Kindle ist es aber nicht kompatibel.
Smartphone/Tablet: Egal ob Apple oder Android, dieses eBook können Sie lesen. Sie benötigen dafür einen PDF-Viewer - z.B. die kostenlose Adobe Digital Editions-App.

Buying eBooks from abroad
For tax law reasons we can sell eBooks just within Germany and Switzerland. Regrettably we cannot fulfill eBook-orders from other countries.

Mehr entdecken
aus dem Bereich
der Praxis-Guide für Künstliche Intelligenz in Unternehmen - Chancen …

von Thomas R. Köhler; Julia Finkeissen

eBook Download (2024)
Campus Verlag
38,99
Wie du KI richtig nutzt - schreiben, recherchieren, Bilder erstellen, …

von Rainer Hattenhauer

eBook Download (2023)
Rheinwerk Computing (Verlag)
24,90