Developing High-quality ML Solutions

When a deployed ML model produces poor predictions, it can be due to a wide range of problems. It can be the result of bugs that are typical in any program—but it can also be the result of ML-specific problems. Perhaps data skews and anomalies are causing model performance to degrade over time. Or the data format is inconsistent between the model’s native interface and the serving API. If  models aren’t monitored, they can fail silently. 

When a model is embedded into an application, issues like this can create poor user experiences. If the model is part of an internal process, the issues can negatively impact business decision-making. 

Software engineering has many processes, tools, and practices to ensure software quality, all of which help make sure that the software is working in production as intended. These tools include software testing, verification and validation, and logging and monitoring. 

In ML systems, the tasks of building, deploying, and operating the systems present additional challenges that require additional processes and practices. Not only are ML systems particularly data-dependent because they inform decision-making from data automatically, but they’re also dual training-serving systems. This duality can result in training-serving skew. ML systems are also prone to staleness in automated decision-making systems.

These additional challenges mean that you need different kinds of testing and monitoring for ML models and systems than you do for other software systems—during development, during deployment, and in production. 

Based on Google work with customers, they’ve created a comprehensive collection of guidelines for each process in the MLOps lifecycle. The guidelines cover how to assess, ensure, and control the quality of your ML solutions. Google have published this complete set of guidelines on the Google Cloud site

To give you an idea of what you can learn, here’s a summary of what the guidelines cover:

To read the full list of Google recommendations, read the document Guidelines for developing high-quality ML solutions.

Related posts

Supercharging security with generative AI

by Kartika Triyanti
12 months ago

What is GCP?

by Cloud Ace Indonesia
3 years ago

Cloud Data Loss Prevention’s sensitive data intelligence service is now available in Security Command Center

by Cloud Ace Indonesia
10 months ago