Use relevant bits from anonym2 branch. Maybe mask with proba=0.1. Important: use validation set where no masking. The idea is to monitor the effect of masking on the generalization of unmasked data and compare with no masking on unmaked (the usual setup).