Projects per year
Abstract
We introduce model folding, a novel data-free model compression technique that merges structurally similar neurons across layers, significantly reducing the model size without the need for fine-tuning or access to training data. Unlike existing methods, model folding preserves data statistics during compression by leveraging k-means clustering, and using novel data-free techniques to prevent variance collapse or explosion. Our theoretical framework and experiments across standard benchmarks, including ResNet18 and LLaMA-7B, demonstrate that model folding achieves comparable performance to data-driven compression techniques and outperforms recently proposed data-free methods, especially at high sparsity levels. This approach is particularly effective for compressing large-scale models, making it suitable for deployment in resource-constrained environments.
| Original language | English |
|---|---|
| Title of host publication | 13th International Conference on Learning Representations, ICLR 2025 |
| Publisher | International Conference on Learning Representations, ICLR |
| Pages | 90874-90907 |
| Number of pages | 34 |
| ISBN (Electronic) | 9798331320850 |
| Publication status | Published - 2025 |
| Event | 13th International Conference on Learning Representations, ICLR 2025 - Singapore, Singapore Duration: 24 Apr 2025 → 28 Apr 2025 |
Conference
| Conference | 13th International Conference on Learning Representations, ICLR 2025 |
|---|---|
| Country/Territory | Singapore |
| City | Singapore |
| Period | 24/04/25 → 28/04/25 |
ASJC Scopus subject areas
- Language and Linguistics
- Computer Science Applications
- Education
- Linguistics and Language
Fields of Expertise
- Information, Communication & Computing
Fingerprint
Dive into the research topics of 'Forget the data and fine-tuning! Just fold the network to compress'. Together they form a unique fingerprint.Projects
- 1 Finished
-
E-MINDS - Embedded Intelligence for wireless communication services
Wang, D. (Attendee / Assistant), Römer, K. U. (Contact person), Krisper, M. (Consortium manager resp. coordinator with external organisations) & Saukh, O. (Project manager on research unit)
1/09/22 → 31/03/25
Project: Research project
Activities
- 1 Invited talk
-
Forget the Data and Fine-Tuning! Just Fold the Network to Compress
Wang, D. (Speaker)
17 Jul 2025Activity: Talk or presentation › Invited talk › Science to science