BetterTransformer integration for more models!
BetterTransformer API provides faster inference on CPU & GPU through a simple interface!
Models can benefit from very interesting speedups using a one liner and by making sure to install the latest version of PyTorch. A complete guideline on how to convert a new model has been created on the BetterTransformer documentation!
Here is a list of models that could be potentially supported, pick one of the architecture below and let's discuss about the conversion!
Text models 🖊️ :
Vision models 📷 :
Audio models 🔉 :
Let us also know if you think that some architectures can be supported that we missed. Note that for encoder-decoder based models below, we expect to convert the encoder only.
Support for decoder-based models coming soon!
cc @michaelbenayoun @fxmarty
huggingface/optimum#488
BetterTransformerintegration for more models!BetterTransformerAPI provides faster inference on CPU & GPU through a simple interface!Models can benefit from very interesting speedups using a one liner and by making sure to install the latest version of PyTorch. A complete guideline on how to convert a new model has been created on the BetterTransformer documentation!
Here is a list of models that could be potentially supported, pick one of the architecture below and let's discuss about the conversion!
Text models 🖊️ :
Bettertransformersupport for FSMT optimum#494MobileBERTsupport forBetterTransformeroptimum#506MBartsupport forBetterTransformeroptimum#516 @ravenouseVision models 📷 :
BetterTransformersupport for ViLT architecture optimum#508Audio models 🔉 :
ASTLayersupport forBetterTransformeroptimum#548Let us also know if you think that some architectures can be supported that we missed. Note that for encoder-decoder based models below, we expect to convert the encoder only.
Support for decoder-based models coming soon!
cc @michaelbenayoun @fxmarty
huggingface/optimum#488