Dataframes
Introduction
A DataFrame is a tabular data format. Conceptually, a column in a DataFrame represents a single variable for all observations, and a row represents all variables for a single observation:
var1 | var2 | var3 |
---|---|---|
obs1.var1 | obs1.var2 | obs1.var3 |
obs2.var1 | obs2.var2 | obs2.var3 |
obs3.var1 | obs3.var2 | obs3.var3 |
This way to organize data is extremely widespread, which makes it even more powerful due to the number of tools for visualization and analysis that can handle data in this format.
Overview
DataFrames.jl is a long-standing Julia Package that implements DataFrames. Is it very stable, and has a principled approach to stability and functionality. It is the recommended option for most users.
Packages
DataFrames.jl
The de-facto package for DataFrames in Julia. One of the highest quality packages in the Julia ecosystem.JuliaDB.jl
JuliaDB is unmaintained, and it's README recommends using alternatives.Tidier.jl
From the README:Tidier.jl is a data analysis package inspired by R's tidyverse and crafted specifically for Julia. Tidier.jl is a meta-package in that its functionality comes from a series of smaller packages. Installing and using Tidier.jl brings the combined functionality of each of these packages to your fingertips.
InMemoryDatasets.jl
InMemoryDatasets is a fresh effort on DataFrames in Julia. In it's release annoucement, the main improvement upon DataFrames.jl appears to be speed, but apparently without downside.DTables.jl
DTables features "Distributed table structures and data manipulation operations built on top of Dagger.jl".
This section is not yet well written. If you have used or developed Julia packages in this domain, we would love your help! Please visit the "Contributing" section of the repository that hosts this website for information on contributions.
Last modified: May 03, 2024. Built with Franklin.jl