3c. Original data preservation

The Data Scientist shall retain copies of the original data unaltered while keeping a record describing the set of transformations made across all of the data value chain (including ingestion, cleansing, feature extraction, scaling / normalization, feature selection, etc).


  Juan Bernabé Moreno says:

    Even data transformations to correct or standardize the input data shall not overwrite the original source. This would allow for auditing the entire process and identifying where potential bias have been introduced

