The easy to use AIminify python library will reduce the model size and increase the speed of inference on GPUs or CPUs. The library automatically employs techniques such as quantization and pruning.
Currently the following libraries are supported:
- Pytorch
- Tensorflow
- Keras