The Data Scientist shall retain copies of the original data unaltered while keeping a record describing the set of transformations made across all of the data value chain (including ingestion, cleansing, feature extraction, scaling / normalization, feature selection, etc).
Comments
Even data transformations to correct or standardize the input data shall not overwrite the original source. This would allow for auditing the entire process and identifying where potential bias have been introduced