Understanding End-to-End Machine Learning Process (Part 2 of 5)
To read part 1, please click here
Classifying ML Algorithms
There are three main types of ML algorithms:
- Supervised Learning- In this one, models are trained with a so-called labeled dataset i.e. we will also know about the required output along with the required input. It's divided into two groups- classification problems works with discrete results having output as a class or group, for example- identifying fraud in money transactions or doing object detection in images; whereas regression problems works with continuous results with output as a certain value, for example- forecasting prices for houses or the stock market or predicting population growth. However, this learning requires labeling the whole dataset which is generally a tedious task.
- Unsupervised Learning- As the name suggests, here, models are trained on unlabeled dataset which refers to self-organized learning to find patterns in data, known as clustering. For example- filtering of spam emails from inbox or movie recommendations or clothing a person might like to watch or purchase. The learning algorithms are generally used in real-time scenarios where data is processed directly. This type of learning doesn't require labelling of the dataset.
- Reinforcement Learning- Here, algorithms learn by reacting to a given environment on their own, just like the humans learn while growing-up. For example- the technique of teaching a dog to behave. However, technically, this is realized through a so-called agent guided by a policy map to decide the probability of taking action within a specific state. For example- training navigation control for a robot or an AI opponent for a game.
Discovering the End-to-End ML Process
In order to get a clear picture of the required steps to create a high-quality ML model, we can refer to the following diagram:
Hence, the above figure defines the following distinct steps:
- Excavating data and sources.
- Preparing and cleaning data.
- Defining labels and engineering features
- Training models
- Deploying models
Although these steps shows a single ML project, but. if you are dealing with a lot of projects and data, then, you will have to adopt some form of automation and operationalization, referred as MLOps.
To read part 1, please click here
Comments
Post a Comment