Back to publications

Smooth Model Compression without Fine-Tunin

Paper Details

Published: 2025/05/31

Paper Links

HTML PDF

Compressing and pruning large machine learning models has become a critical step towards their deployment in real-world applications. Standard pruning and compression techniques are typically designed without taking the structure of the network’s weights into account, limiting their effectiveness. We explore the impact of smooth regularization on neural network training and model compression. By applying nuclear norm, first- and second-order derivative penalties of the weights during training, we encourage structured smoothness while preserving predictive performance on par with non-smooth models. We find that standard pruning methods often perform better when applied to these smooth models. Building on this observation, we apply a Singular-Value-Decomposition-based compression method that exploits the underlying smooth structure and approximates the model’s weight tensors by smaller low-rank tensors. Our approach enables state-of-the-art compression without any fine-tuning– reaching up to 91% accuracy on a smooth ResNet-18 on CIFAR-10 with 70% fewer parameters.

Authors

Ander Biguri

University of Cambridge

Senior Research Associate

Read Bio

Carola-Bibiane Schönlieb

Professor of Applied Mathematics and Head of the Cambridge Image Analysis (CIA) Group

Read Bio

Smooth Model Compression without Fine-Tunin

Paper Details

Paper Links

Authors

Ander Biguri

Carola-Bibiane Schönlieb

Christina Runkel

Natacha Kuete Meli

Jovita Lukasik

Michael Moeller

Smooth Model Compression without Fine-Tunin

Paper Details

Paper Links

Authors

Ander Biguri

Carola-Bibiane Schönlieb

Christina Runkel

Natacha Kuete Meli

Jovita Lukasik

Michael Moeller

Help Accelerate Research

Essential Cookies Always Required

Research Analytics

Essential Cookies
Always Required