A particular attention must be paid to the statistical biases of the dataset. This issue originates from the overrepresentation of some category of data (e.g. dataset containing mostly IT profiles, senior profiles, males over females, etc).
Without any safety precaution taken, a deep learning model naturally tends to inadvertently leverage biases to better fit the data.
In order to successfuly retrain an unbiased scoring engine, the requirements are: