Understanding End-to-End Machine Learning Process (Part 2 of 5)

By Ashwin Venugopal - January 30, 2023

To read part 1, please click here

To read part 3, please click here

To read part 4, please click here

To read part 5, please click here

Classifying ML Algorithms

There are three main types of ML algorithms:

Supervised Learning- In this one, models are trained with a so-called labeled dataset i.e. we will also know about the required output along with the required input. It's divided into two groups- classification problems works with discrete results having output as a class or group, for example- identifying fraud in money transactions or doing object detection in images; whereas regression problems works with continuous results with output as a certain value, for example- forecasting prices for houses or the stock market or predicting population growth. However, this learning requires labeling the whole dataset which is generally a tedious task.

Unsupervised Learning- As the name suggests, here, models are trained on unlabeled dataset which refers to self-organized learning to find patterns in data, known as clustering. For example- filtering of spam emails from inbox or movie recommendations or clothing a person might like to watch or purchase. The learning algorithms are generally used in real-time scenarios where data is processed directly. This type of learning doesn't require labelling of the dataset.

Reinforcement Learning- Here, algorithms learn by reacting to a given environment on their own, just like the humans learn while growing-up. For example- the technique of teaching a dog to behave. However, technically, this is realized through a so-called agent guided by a policy map to decide the probability of taking action within a specific state. For example- training navigation control for a robot or an AI opponent for a game.

Discovering the End-to-End ML Process

In order to get a clear picture of the required steps to create a high-quality ML model, we can refer to the following diagram:

Hence, the above figure defines the following distinct steps:

Excavating data and sources.
Preparing and cleaning data.
Defining labels and engineering features
Training models
Deploying models

Although these steps shows a single ML project, but. if you are dealing with a lot of projects and data, then, you will have to adopt some form of automation and operationalization, referred as MLOps.

To read part 1, please click here

To read part 3, please click here

To read part 4, please click here

To read part 5, please click here

Search This Blog

Blogs by Ashwin