Proper data format for uploading data to Datashift
Datashift requires your data to be of a standard format and it is possible that your datafiles will need to be cleaned before you can add them to your projects.
- A datafile must consist of one table of data (limit of 1,048,576 rows and 16,384 columns).
- Must contain on 1 row of headers.
- Each row must contain the same number of columns (cells can be empty but must be delimited by commas).
- The data in each column must represent the exact same source of data.
Use descriptive header names in your data tables (such as Temperature, Height, Speed) so your data will be easily identifiable when doing analysis and creating visuals.
Here is an example of a proper datafile:
If your datafile contains a header it must be the very first line in your datafile and each column must represent the exact same source of data.
This next example illustrates a datafile which is poorly formatted which would fail when attempting to upload to Datashift.
Here there are two rows of headers instead of the maximum amount of one. Also, each column doesn't represent the same data. Part of a column is for data from "Station 1" while further down the column the data switches to "Station 2".
Simply your data as much as possible. Don't be afraid to break up your data into multiples files, you'll be able to link them together in your project!