TY - GEN
T1 - Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting
AU - Liu, Xialei
AU - Masana, Marc
AU - Herranz, Luis
AU - van de Weijer, Joost
AU - Lopez, Antonio M.
AU - Bagdanov, Andrew D.
PY - 2018
Y1 - 2018
N2 - In this paper we propose an approach to avoiding catastrophic forgetting in sequential task learning scenarios. Our technique is based on a network reparameterization that approximately diagonalizes the Fisher Information Matrix of the network parameters. This reparameterization takes the form of a factorized rotation of parameter space which, when used in conjunction with Elastic Weight Consolidation (which assumes a diagonal Fisher Information Matrix), leads to significantly better performance on lifelong learning of sequential tasks. Experimental results on the MNIST, CIFAR-100, CUB-200 and Stanford-40 datasets demonstrate that we significantly improve the results of standard elastic weight consolidation, and that we obtain competitive results when compared to the state-of-the-art in lifelong learning without forgetting.
AB - In this paper we propose an approach to avoiding catastrophic forgetting in sequential task learning scenarios. Our technique is based on a network reparameterization that approximately diagonalizes the Fisher Information Matrix of the network parameters. This reparameterization takes the form of a factorized rotation of parameter space which, when used in conjunction with Elastic Weight Consolidation (which assumes a diagonal Fisher Information Matrix), leads to significantly better performance on lifelong learning of sequential tasks. Experimental results on the MNIST, CIFAR-100, CUB-200 and Stanford-40 datasets demonstrate that we significantly improve the results of standard elastic weight consolidation, and that we obtain competitive results when compared to the state-of-the-art in lifelong learning without forgetting.
UR - http://www.scopus.com/inward/record.url?eid=2-s2.0-85059769964&partnerID=MN8TOARS
U2 - 10.1109/ICPR.2018.8545895
DO - 10.1109/ICPR.2018.8545895
M3 - Conference paper
BT - Proceedings - International Conference on Pattern Recognition
ER -