Member-only story

Making Sense of Big Data

Model Compression with TensorFlow Lite: A Look into Reducing Model Size

Common pitfalls you have to know to apply model compression seamlessly

Cawin Chan

Published in

TDS Archive

10 min readJan 8, 2021

Why is Model Compression important?

A significant problem in the arms race to produce more accurate models is complexity, which leads to the problem of size. These models are usually huge and resource-intensive, which leads to greater space and time consumption. (Takes up more space in memory and slower in prediction as compared to smaller models)

The Problem of Model Size

A large model size is a common byproduct when attempting to push the limits of model accuracy in predicting unseen data in deep learning applications. For example, with more nodes, we can detect subtler features in the dataset. However, for project requirements such as using AI in embedded systems that depend on fast predictions, we are limited by the available computational resources. Furthermore, prevailing edge devices do not have networking capabilities, as such, we are not able to utilize cloud computing. This results in the inability to use massive models which would take too long to get meaningful predictions.

TDS Archive

Making Sense of Big Data

Model Compression with TensorFlow Lite: A Look into Reducing Model Size

Common pitfalls you have to know to apply model compression seamlessly

Why is Model Compression important?

The Problem of Model Size

Create an account to read the full story.

Published in TDS Archive

Written by Cawin Chan

Responses (1)