[Exploratory Data Analysis] How to perform multivariate analysis

Let me clarify from the beginning that this is not an extensive or in-depth guide to multivariate analysis. Fortunately, we can find many online resources on that. It is more like a beginners’ friendly guide. In our adventure, we will use the Palmer Archipelago (Antarctica) penguin data dataset that comes as a kind of alternative

Continue reading [Exploratory Data Analysis] How to perform multivariate analysis

[Exploratory Data Analysis] How to perform univariate analysis

Let’s assume that we are about to start working on a new classification problem. Before we start, it’s always a good idea to perform a univariate analysis on the target variable. This is helpful because it’s going to give us insights into the distribution of the target. What if we discover that the dataset is

Continue reading [Exploratory Data Analysis] How to perform univariate analysis

[Feature Engineering] how to convert a date string to datetime

Let’s say that we have a dataset like rainfall in Australia (where I can find this? in kaggle) In this dataset, we have a date column that looks like this: >>> df[‘Date’] 0 2008-12-01 1 2008-12-02 2 2008-12-03 3 2008-12-04 4 2008-12-05 … 145455 2017-06-21 145456 2017-06-22 145457 2017-06-23 145458 2017-06-24 145459 2017-06-25 Name: Date,

Continue reading [Feature Engineering] how to convert a date string to datetime