From (Big) Data Mess to Data as an Innovation Enabler

Olaf Zschiedrich | GOTO Berlin 2018

Transcript

It's no secret that collecting and processing data is a double-edged sword. On one hand, it is the enabler of AI and ML applications that drive the modern organisation forward. On the other, it takes constant effort to maintain its accuracy and usefulness and extreme diligence to make sure that it doesn't get into the wrong hands. This talk will look at the data journey of one of the world's largest internet companies, OLX Group. From data collection over data democratisation to data products and data innovation in a platform with as many monthly active users as twitter.

We will cover:

  • How to collect and store billions of events and records per day
  • How to aggregate data from multiple platforms
  • How to design a data lake/reservoir architecture in AWS cloud
  • How to give each and everyone access to the data that he or she needs
  • How to distribute data in a secure and compliant manner
  • How to build a scalable, easy to use reporting infrastructure
  • How to drive data innovation and data products with the help of AWS sagemaker, tensorflow and other ML tools

About the speakers