← Back to all posts

Reading time - 7 mins

1. ML Picks of the Week

A weekly dose of ML tools, concepts & interview prep. All in under 1 minute.

2. Technical ML Section

Learn 5 Practical Tips for Time Series Analysis from my experience

3. Career ML Section

Learn 3 tips that can help you grow your ML career income

1. ML Picks of the Week

🥇ML Tool

Python Library StatsForecast

StatsForecast is a computationally efficient library for time series forecasting, built for scale and performance.

It offers state-of-the-art implementations of classic statistical models like ARIMA, ETS, and AutoETS — all written in Numba for speed.

StatsForecast is especially powerful for:

✅ Running thousands of forecasts in parallel (yes, really fast)
✅ Getting accurate baselines for production models
✅ Automating time series pipelines with minimal code

If you’re still waiting minutes for ARIMA to run in loops, switch to StatsForecast.

📈 ML Concept

Gini Impurity

The Gini impurity is a metric used in decision trees to decide how good a split is.

It measures how mixed the classes are in a node — the lower the Gini value, the purer the node (i.e., more of the same class).

It’s fast to compute and works especially well for classification tasks.

Gini is the default split criterion in many algorithms like CART and Random Forest.

Gini is especially useful for:

✅ Building fast and interpretable classification trees
✅ Selecting the most informative features
✅ Creating consistent and efficient splits in large datasets

Read a more detailed explanation of Gini Impurity HERE.

🤔 ML Interview Question

What is one-hot encoding?

One-hot encoding is a common way to convert categorical variables into a format that machine learning models can work with.

It creates a new binary column for each unique category, assigning 1 to the matching category and 0 to the rest.

Example:
Color = Red → [Red: 1, Green: 0, Blue: 0]

Why is this important?

→ Most ML models (like linear models or neural nets) can’t work directly with text.
→ One-hot ensures that the model doesn’t assume any order or ranking between categories.
→ It’s simple, reliable, and widely used in preprocessing pipelines.

Read a more detailed intro to one-hot encoding HERE.

2. Technical ML Section

5 Practical Tips for Time Series Analysis

Analyzing time series might seem easy. But in real-world cases, it's not because of many reasons, like:

Nan values
Outliers
Noise
Inconsistent timestamps
Long, hard-to-analyze sequences

In this newsletter, you'll learn 5 practical tips I learned over 7 years in time series analysis. They'll help you to extract more value from data in your projects.

1️⃣ Before computing correlation, remove outliers

This tip is especially important when you are selecting features for your model or trying to understand feature relationships.

Imagine you have 2 time series, both having outliers as in the figure below. It can be 2 features or 1 target and 1 feature.

Time Series with outliers

You see that there are some outliers, but they might not be significant for your analysis, because compared to the entire sequence, their fraction is 1-2%.

If you compute the Pearson correlation between these series, you will get 0.32.

However, if you first remove them, as shown in the figure below, your correlation will reach 0.42. Big difference for practical use cases.

Time series without outliers

2️⃣ Apply resample (groupby) over a bigger time horizon than the original sampling frequency

This usually helps to remove noise and see the general relationships much better, including correlation.

In pandas, you can do this with "resample" method in Pandas.

Another alternative is "groupby".

Using resampling for our dataset significantly improves observability of the relationship as well as the correlation value (from 0.32 to 0.65).

Time series pandas groupby

3️⃣ Don’t use scatter, use regression and contour (kdeplot in seaborn) plots

This tip saved me a lot of time from the troubles.

Pro tip: combine regression and contour plots with resampled data from Tip 2.

Here is the original dataset with scatter, regression, and contour plots.

Scatter and regression plots

Here are the same plots on the grouped data. Now, compare the regression and contour plot below with the scatter plot above.

Way easier to see the relationships.

Regression plot for time series

4️⃣ Use Median filter to see the relationships through noise

Often, when you visualize the raw data, it's really hard to see if the data goes down/up/straight due to high noise.

Median filter is a very quick and easy way to see the time series trends and get an overall view of the time series behavior.

You can see how filtering the noise helps us to see the trends and cyclic behavior of our time series below.

Median Filter for time series data

To apply a median filter, you can use pandas rolling.

5️⃣ Use Plotly to scroll and zoom in through the time series

Plotly library makes it much quicker to go through the data and find insights rather than zooming and scrolling through changing the plotted indices in matplotlilb.

It does not mean that you need to use it for your entire analysis, but for trends and oscillations analysis over a time period, Plotly is very handy.

You can see how easy it is from the figure below.

Plotly for time series visualization

That's it for the Technical Part!

Follow me on LinkedIn for more daily ML breakdowns.

Let's dive into the Career part!

2. ML Career Section

3 tips for your ML income growth

These tips are based on 7+ years in ML and coaching 70+ people.

They work — and they work especially well if you’re in your first one to six years in a Machine Learning career

The 3 tips are:

Learn how to lead ML teams
Specialize in 1–3 ML or business domains
Be present online and network actively

Why do these actions help?

→ Leadership = Higher pay
You don’t need a manager title.

But you do need to take ownership, lead small projects, and drive outcomes.

People who lead, earn more.

→ Domain depth = Higher value
Trying to learn “everything ML” leads nowhere.

Knowing 1–2 domains deeply makes you stand out.

Companies pay more for focused expertise.

→ Visibility = More opportunities
If no one knows what you do, no one can hire you.

Posting online, sharing projects, and connecting with others bring jobs, freelance gigs, and credibility.

This is the rule of 2025: "If you don't exist online, you don't exist".

Hope this helps in your ML Career Journey!

Useful articles

That is it for this week!

If you haven’t yet, follow me on LinkedIn where I share Technical and Career ML content every day!

Whenever you're ready, there are 3 ways I can help you:

Join Maistermind for 1 weekly piece with 2 ML guides:

1. Technical ML tutorial or skill learning guide
2. Tips list to grow ML career, LinkedIn, income

Join here!

#10: 5 Practical Tips for Time Series Analysis | 3 Tips for your ML Income Growth

Reading time - 7 mins