Exploring StackOverflow Data

Evelina Gabasova | GOTO Chicago 2018


When you’re stuck while programming - who you gonna call? StackOverflow! It’s an invaluable source of daily help to many. Interestingly, you can also download the entire data dump of StackOverflow and let machine learning loose on the dataset. In the talk I’ll look at what we can learn from the crowdsourced knowledge of developers worldwide. Meanwhile, you will also learn about ideas behind some machine learning algorithms that can give us insights into complex data. I will use a combination of statistical computing language R with functional language F# to show how you can easily access and process large-scale data the functional way.

About the speakers

Evelina Gabasova
Evelina Gabasova

Machine Learning Expert & Data Scientist at The Alan Turing Institute