Evaluating rate of convergence #1071
-
|
Hi Everyone, I'm researching how continual learning methods can help improve the speed at which we converge to a solution. The idea is that training a model on Task A and using the same model to train Task B could result in Task B converging to a solution quicker by reusing the parameters from Task A. In order to test this, I've trained a model using the Joint Strategy on Task B i.e. training a model from scratch comparing this to a Naive strategy where I'd like the first experience to train the model on Task A and then the second experience on Task B. For the Naive strategy I've set up the following, task B in this case is training the model to recognize classes 2 and 3: The problem is this line: This line bounds the targets to 0,1 for the second experience instead of 2,3 because the model only expects 2 classes, so targets>2 results in an index error. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 1 reply
-
|
You can remove that line since there is no need to change the targets. You can either use an |
Beta Was this translation helpful? Give feedback.
You can remove that line since there is no need to change the targets. You can either use an
IncrementalClassifier, which automatically expands the head with new classes, or use aMultiHeadClassifier.