Data preparation for NeurEco Regression with the command line interface#

The command line interface expects the data for model construction or evaluation in form of paths to files containing the data.

  • The supported formats are:

    • CSV with “;” or “,” separator;

    • NumPy .npy

    • MATLAB MAT-files .mat

  • Files contain the numerical data, allowed types: int, float, double

  • Any input file should contain a table with:

    • Number of lines equal to a number of samples

    • Number of columns equal to a number of input features

    • CSV files could have one additional line for a header

  • Any output file should contain a table with:

    • Number of lines equal to a number of samples

    • Number of columns equal to a number of output features

    • CSV files could have one additional line for a header

  • input file and the corresponding output file should have the same number of samples

  • The data can be provided in chunks, in multiple input and output files. In this case pay attention to preserving the correspondence between input and output files