Practical Machine Learning for Computer Vision

Valliappa (Lak) Lakshmanan is currently a Technical Lead for Data and Machine Learning Professional Services for Google Cloud.

Meer over de auteurs

Valliappa Lakshmanan, Martin Görner, Ryan Gillard

Practical Machine Learning for Computer Vision

Name: Practical Machine Learning for Computer Vision
Author: Valliappa Lakshmanan

End-to-End Machine Learning for Images

Paperback Engels 2021 1e druk 9781098102364

100,80

In winkelwagen

Verwachte levertijd ongeveer 16 werkdagen

Samenvatting

By using machine learning models to extract information from images, organizations today are making breakthroughs in healthcare, manufacturing, retail, and other industries. This practical book shows ML engineers and data scientists how to solve a variety of image problems including classification, object detection, autoencoders, image generation, counting, and captioning with proven ML techniques.

Google engineers Valliappa Lakshmanan, Martin Görner, and Ryan Gillard show you how to develop accurate and explainable computer vision ML models and put them into large-scale production using robust ML architecture in a flexible and maintainable way. You'll learn how to design, train, evaluate, and predict with models written in TensorFlow/Keras. This book also covers best practices to improve the operationalization of the models using end-to-end ML pipelines.

You'll learn how to:
- Design ML architecture for computer vision tasks
- Select a model (such as ResNet, SqueezeNet, or EfficientNet) appropriate to your task
- Create an end-to-end ML pipeline to train, evaluate, deploy, and explain your model
- Preprocess images for data augmentation and to support learnability
- Incorporate explainability and responsible AI best practices
- Deploy image models as web services or on edge devices
- Monitor and manage ML models

Specificaties

ISBN13:9781098102364

Trefwoorden:Programmeren, machine learning, Computer vision

Taal:Engels

Bindwijze:paperback

Aantal pagina's:350

Uitgever:O'Reilly

Druk:1

Verschijningsdatum:6-8-2021

Hoofdrubriek:IT-management / ICT

Lezersrecensies

Wees de eerste die een lezersrecensie schrijft!

Schrijf een recensie

Uw cijfer

?

Log in om te stemmen

Over Valliappa Lakshmanan

Valliappa (Lak) Lakshmanan is currently a Technical Lead for Data and Machine Learning Professional Services for Google Cloud. His mission is to democratize machine learning so that it can be done by anyone anywhere using Google's amazing infrastructure, without deep knowledge of statistics or programming or ownership of a lot of hardware. Before Google, he led a team of data scientists at the Climate Corporation and was a Research Scientist at NOAA National Severe Storms Laboratory, working on machine learning applications for severe weather diagnosis and prediction. http://aisoftwarellc.weebly.com/

Andere boeken door Valliappa Lakshmanan

Bekijk alle boeken

Inhoudsopgave

1. Machine Learning for Computer Vision
Machine Learning
Deep Learning Use Cases
Organization of Book
Summary

2. ML Models for Vision
Machine Perception
Working with the dataset
Linear Model Using Keras
Keras Model
Training the Model
Neural Network Using Keras
Neural Networks
Deep neural networks
Summary

3. Image Vision
Pretrained Embeddings
Pretrained Model
Transfer Learning
Fine-tuning
Convolutional networks
Convolutional Filters
Stacking convolutional layers
Pooling layers
AlexNet
The Quest for Depth
Filter factorization
1x1 convolutions
VGG19
Global average pooling
Modular Architectures
Inception
SqueezeNet
ResNet and Skip Connections
DenseNet
Depth-separable convolutions
Xception
Neural Architecture Search Designs
NasNet
Depth-wise convolutions
Inverted Residual Bottlenecks
MobileNetV2
EfficientNet: Putting It All Together
Choosing a Model
Performance Comparison
Ensembling
Recommended Strategy
Summary

4. Creating Vision Datasets
Collecting Images
Photographs
Imaging
Proof of concept
Data Types
Channels
Geospatial data
Audio and Video
Manual Labeling
Folder structure
Metadata table
Object detection
Labeling at scale
Labeling user interface
Multiple tasks
Voting and crowdsourcing
Labeling services
Automated Labeling
Labels from related data
Noisy student
Self-supervised learning
Bias
Sources of bias
Selection bias
Measurement bias
Confirmation bias
Detecting bias
Creating a Dataset
Splitting data
TensorFlow Records
Reading TensorFlow Records
Summary