Variance stabilizing transformations in machine learning

12 min readJun 2, 2022

Variable transformation in machine learning. — Variable transformation in machine learning

You’ve probably heard that before training machine learning models, data scientists transform random variables to change their distribution into something closer to the normal distribution.

But, why do we do this? Which variables should we transform? Which transformations should we use? And, do we need to transform variables to train any machine learning algorithm?

These are the questions that we will address throughout this article. Let’s get started.

Feature engineering for machine learning

This article is the fourth in a series of articles on feature engineering for machine learning. You can learn more about how data scientists preprocess their data for machine learning at the following links:

Feature engineering for machine learning
Missing data imputation
Categorical variable encoding
Variable transformation (you are here)
Discretization
Feature Scaling
Feature creation
Python libraries for feature engineering
Excellent resources for learning about feature engineering

Variance stabilizing transformations in machine learning

Feature engineering for machine learning

Written by Sole from Train in Data