Distributed data analysis in pure Julia
Big data for everything from your laptop to your cluster

Fast

JuliaDB is Julia all the way down, meaning table operations use Julia's just-in-time compiler so that user-defined function are fast.

Distributed

JuliaDB seamlessly scales to fully utilize any machine or cluster with almost no setup. Use your laptop for prototyping and move to production with JuliaRun.

Batteries Included

JuliaDB can read many CSV files at once (and fast), save the data into an efficient binary format, extract statistics in parallel via integration with OnlineStats, perform feature engineering, and more.

Resources

Features

Open Source
Just-in-Time Compilation
Persistence
Distributed/Parallel Computing
Statistics for Big Data
Fast/Compiled UDFs

Comparisons

See comparisons with other similar packages

Documentation

JuliaDB API
JuliaDB is a
Project