2006

Model Compression

C. Bucilua, Rich Caruana, Alexandru Niculescu Mizil

citations

Cite Score

65

AI summary

This paper introduces a model compression technique that trains compact neural nets to mimic ensemble selection by labeling a large unlabeled dataset, or generating synthetic cases using MUNGE, to achieve comparable performance with significantly smaller and faster models.

Main Contributions

  • Introduces a model compression technique to train compact models that mimic larger models.
  • Presents MUNGE, a novel method for generating synthetic data to train the compact models when unlabeled data is scarce.
  • Demonstrates that neural nets can be trained to achieve nearly the same performance as ensemble selection models, while being significantly smaller and faster.
  • Achieves a 1000x reduction in model size and a 1000x increase in speed with negligible loss in performance on several benchmark datasets.
  • Highlights the effectiveness of MUNGE for generating pseudo-data, showing its superiority over RANDOM and NBE methods.

Abstract

Often the best performing supervised learning models are ensembles of hundreds or thousands of base-level classifiers. Unfortunately, the space required to store this many classifiers, and the time required to execute them at run-time, prohibits their use in applications where test sets are large (e.g. Google), where storage space is at a premium (e.g. PDAs), and where computational power is limited (e.g. hearing aids). We present a method for "compressing" large, complex ensembles into smaller, faster models, usually without significant loss in performance.

Citation Graph

Loading graph...

References [19]

Sort:
Filter:

L. Breiman - 1994

4 papers in library cite

A. P. Dempster, N. M. Laird, D. B. Rubin - 1977

6 papers in library cite

L. Breiman - 2001

5 papers in library cite

D. Wolpert - 1992

3 papers in library cite

P. Domingos - 2000

1 paper in library cites

P. Melville, R. Mooney - 2003

1 paper in library cites

I. Witten, E. Frank - 1999

1 paper in library cites

B. Silverman - 1986

1 paper in library cites

Rich Caruana, Alexandru Niculescu Mizil, G. Crew, A. Ksikes - 2004

1 paper in library cites

M. Craven, J. Shavlik - 1996

1 paper in library cites

W. Buntine, Rich Caruana - 1991

1 paper in library cites

P. Domingos - 1997

1 paper in library cites

T. Joachims - 1999

1 paper in library cites

D. Lowd, P. Domingos - 2005

1 paper in library cites

A. Zell, N. Mache, R. Huebner, M. Schmalzl, T. Sommer, T. Korb - 1992

1 paper in library cites

A. Gualtieri, S. Chettri, R. Cromp, L. Johnson - 1999

1 paper in library cites

R. Schapire - 2001

1 paper in library cites

C. Blake, C. Merz - 1998

1 paper in library cites

X. Zeng, T. Martinez - 2000

1 paper in library cites

Cited by

2

papers in your library

Cites

1

papers in your library

Read

on April 27, 2025

Your review

Tags

Paper Aliases

No aliases