Statistical Notes

Here are few statistical writings I have done during my academic years:

papers:

Italian:

English:

  • Milan House Prices : fourth place on a private Kaggle competition, 50 participants. Projects for the exam of Data Mining (Master)

presentations:

Italian:

English:

notebooks:

Italian:

  • STMicroelectronics Lab Data Challenge : Solution proposed for the Lab Data Challenge proposed by STM. The aim was to predict the presence of a person in a room from the wifi noise frequencies coming from the router. (Master)

  • Retina Cancer : first place solution on a private Kaggle competition, 8 participants, for the exam of Statistical Learning (Master)

  • POS tagger : Project made for the exam of Statistical Learning (Master). It is not a good way to train a POS tagger. Maybe If You want to train a neural network You should use directly python, and don’t switch to Keras for R.

  • Analisi di concentrazione del settore dei trasporti: exam project for Economia e Dinamica Industriale (Master)

  • Statistica Spaziale in R: R code notes from Spatial Statistics (Master)

English:

  • Kernel Clustering for text : If You try to apply common clustering algorthms to Bag of Words Text representations, You will obtain random clusters. However, when I use Kernel distances with Kmeans for the Italian Constitution articles, I get pretty good results.

  • Pytorch MNIST : a very simple exercise to train a neural network with the most common python library in the most classical and simple task (written digit classification from images).