IPEX-LLM PyTorch API#

Optimize Model#

You can run any PyTorch model with optimize_model through only one-line code change to benefit from IPEX-LLM optimization, regardless of the library or API you are using.

Load Optimized Model#

To avoid high resource consumption during the loading processes of the original model, we provide save/load API to support the saving of model after low-bit optimization and the loading of the saved low-bit model. Saving and loading operations are platform-independent, regardless of their operating systems.