Summary

Fine-tuning deep learning models leads to small differences in parameter values in L2 space (1). The foundation model and fine-tuned model often occupy the same basin of the loss landscape. Output features of each layer were examined using centered kernel alignment.

Details

models/layerconv1layer 1layer 2layer 3layer 4
P-T & P0.62250.45920.28960.18770.0453
P-T & P-T0.67100.82300.60520.40890.1628
P-T & RI-T0.00360.00110.00220.00030.0808
RI-T & RI-T0.00160.00880.00040.00040.0424
Ref (1)

See also

1.
Neyshabur B, Sedghi H, Zhang C. What is being transferred in transfer learning? Advances in Neural Information Processing Systems. 2020;33:512–23. Available from: https://proceedings.neurips.cc/paper/2020/hash/0607f4c705595b911a4f3e7a127b44e0-Abstract.html