I found that the model parameters and memory usage did not decrease, and the network did not run faster. I'm confused!