Uncluster Your Data Science Using Vaex

Jovan Veljanoski • Maarten Breddels | GOTO Copenhagen 2021

Share on:
linkedin facebook
Copied!

Transcript

Maarten is an expert at solving problems with expertise ranging from fast numerical computation, API design to 3D visualization.

Would you like to build an snappy dashboard visualising hundreds of millions of data points, or interactively explore hundreds of Gigabytes of data, all of that using a single machine?

Meet Vaex - an out of core DataFrame library in Python that can do all the typical data manipulations, filtering, and aggregations on a billion rows in real time & on a single computer. This approach empowers your team and allows them to focus much more on the business problem, as it removes the large DevOps overhead of configuring and maintaining a cluster.

Vaex fully supports Apache Arrow, which both facilitates the interoperability with other systems and enables storage and manipulation of more complex data structures like lists and dicts.

Jovan and Maarten will show how you can access and stream your data directly from the Cloud — perfect for building cloud services!

About the speakers

Jovan Veljanoski
Jovan Veljanoski

Machine learning specialist at Cloud Technology Solutions and co-founder of vaex.io

Maarten Breddels
Maarten Breddels

Independent developer and consultant, co-founder of vaex.io