Note: You can find the full script of this tiny work here (GitHub repository). You can also find other projects I’ve been working on lately. ❤
First thing you need to know:
Italian guys love playing cards
and, as I am an Italian guy, I’m not an exception. The second thing you need to know is more bitter. This social distancing is forcing people to stay safe and in some cases… alone. A good way to spend your time playing cards when no one else is around you could be playing solitaires. Solitaires are those kinds of card games that…
Note: The entire code can be found (and loved) here
I’m a musician, and a data scientist.
I spend my days writing codes and studying statistical theorems. Then (when it gets dark outside) I like to write music.
But it is possible to write music while coding?
The answer is yes, and I’m about to show you how.
Note: If you are interested on the entire process, I want to marry you. If this is not the case, you can skip to the 4., 5. and 6. points to see the algorithm and its results.
Note: This article is a part of a bigger study. You can find here and love it till death do you guys part.
The majestic gas ball that is the Sun has crucial effects in our lives. We know and live this life thanks to this star and it is close enough to have magnetic and deep study about it too.
In particular, solar flares are sudden flashes that occur on the Sun. You may think that in your life you have more serious stuff to think about, and you are probably right. …
Note: This report is part of a bigger project about climate change that can be seen (and hopefully loved) in this GitHub repository.
When some time ago I’ve heard from my television about the last Australian bushfire season, it was really terrible to hear. I was on another project at that time, but I’ve put this on my to-do-list, promising to myself that I would have worked on that to get an unbiased and data-driven opinion about climate change.
Let’s get started.
0. The libraries
Here’s the collection of the libraries…
Note: This article is a part of a bigger project that can be seen and hopefully loved in this GitHub repository
Let’s pretend for a second that Machine Learning models are real human beings: none of them is perfect (besides you, of course).
Some models could be too anxious, someone too jealous, someone too arrogant. The real magic happens when you fall in love with someone that is able to see your weak points and helps you improve them, and he/she emphasises your good sides. This is the exact idea of Ensemble Learning.
In fact, it is based on the…
Every data scientist, especially the ones that find themselves to work with Big Data, knows the importance of dimensionality reduction. If you have a dataset that has a large amount of columns and you have a Machine Learning task to complete:
So it is important to know and understand some dimensionality reduction techniques and one of the most famous one is the Principal Component Analysis (P.C.A.).
This algorithm projects your data into another dimension, but with lower dimensionality. Speaking in…
Noise is so difficult to treat, every data scientist knows that.
The fact is that, as one dear friend of mine loves to say,
“The hardest part of getting what you want is figuring out what it is”
Indeed, we can’t specify what noise really is. As a physicist, I find myself in the situation of studying a dataset and trying to understand if my data has a physical sense. When a clear pattern can’t be identified in a part of my data (or my signal) , I tend to classify that part as “noise”. But, this approach could be…
One of the sentences that my professor used to say in high school was that “History repeats itself” . The sense of this sentence is obviously related to the fact that we should learn from history, thus being able not to do the same mistakes that we have done in the past.
Now let’s talk about science. If you have a time series, then you have your data for a (preferably) long time. Let’s assume for a second that history actually does reproduce itself. That will mean that by simply replicating the signal you will extend your data, thus obtaining…
If you are a data scientist, one of your typical task is to analyze a certain signal and find its peaks. This may be your goal for a ton of reasons, but the bigger one is that peaks somehow tend to show a certain property of your signal. It is not surprising that a library that helps you finding peaks does already exist in Python, and it is SciPy (with its find_peaks function). …
When you are a kid, sometimes your overprotective mother makes you feel beautiful, smart and kind. Of course, if you are one of those kids, you are confident that everyone thinks exactly the same thing that your mother thinks about you, but when you grow up and you go to school, sometimes your teacher tells you that you are acting wrong, and you are not so kind or smart or beautiful! In that moment you need to realize that maybe your mother loves you too much and gave you a false impression of yourself: your model is overfitting :)
If…