3b(ii) Protocol and documentation

The Data Scientist shall document according to a standard template each and every step along the data science value chain. This shall include the elicitation of all data sources and the usage and justification of all relevant data sources, the procedures used to combine data sources and all the steps in the data transformation pipeline.  This will also include the model selection, any procedures to tune the hyper-parameters, the employed procedure to test the model and the results, and finally the strategy to industrialize the model.


  • Juan Bernabé Moreno says:

    Even data transformations to correct or standardize the input data shall not override the original source. Ideally, the entire data transformation process is documented for knowledge sharing and auditing purposes.

Leave a Reply

Your email address will not be published. Required fields are marked *