(Optional) Step 1: Data AnalyzerΒΆ
PyTrackDat includes an automatic data analyzer which can read a one or a series of CSV files in (i.e. data files with variable names in the headers), as well as their desired relation names, and generate a PyTrackDat design file, which contains human-readable CSV-formatted instructions for the structure of the database. This design file is not final, and should be checked over and added to/edited by a human. However, it provides a good starting point for generating a database for a particular dataset.
To run the data analyzer on one or more CSV-formatted data files, run the following command:
ptd-analyze design.csv sample_type_1 samples1.csv sample_type_2 samples2.csv [...]
Where design.csv
is the name of the design file to output, and
sample_type_1
and sample_type_2
are singular terms for the types of
entries stored in samples1.csv
and samples2.csv
, respectively. Feel
free to add more sample types (with corresponding data files) as necessary for
your dataset, or leave out sample_type_2
and samples2.csv
if only one
data type is necessary for the database.