Projects in the Data Space

This page is meant to serve as my personal curated list of relevant projects in the data industry today. Think of it as a window into my browsers bookmarks. If there’s a cool project that you think is missing here, please send it to me at samuelspersonalemail@gmail.com and I’ll check it out!

Apache Arrow
Apache Druid
Apache Flink
Apache Iceberg
Apache Spark
Apache Superset
Dask
GraphQL
Hugo NumPy
Pandas
Ray
Sludge
Trino