I presume this paper is relevant to continual learning since they speak about being able to avoid catastrophic forgetting.
https://arxiv.org/abs/1910.01526
I’ll read it and see if I can use the ideas with any of the specialized algorithms I have.
Maybe you know about the paper since the first version was from last year.