Adds ASTLayer support for BetterTransformer#548
Conversation
|
The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. |
|
Hi @younesbelkada !
Please let me know what you think about these changes and let me what else I need to do on this topic. Thank you very much! |
|
Hi @ravenouse ! |
|
Hi @younesbelkada ! Happy New Year! I have just rebased this branch with the current optimum main branch, and I re-run the test, Please let me know what else I can do! Thank you so much! |
|
I did rebase some time ago. Could you spare some time to review this? |
|
This PR has been marked as stale because it has been open for 90 days with no activity. This thread will be automatically closed in 30 days if no further activity occurs. |


What does this PR do?
Adds
ASTLayersupport forBetterTransformerFixes Community contribution - BetterTransformer integration for more models! #20372
Questions:
"MIT/ast-finetuned-audioset-10-10-0.4593as the test model to runpytestbut some tests failed. I think one test model, like"hf-internal-testing/tiny-random-MBartModel", is needed to test theASTLayerBetterTransformer.BetterTransformerBaseLayerclass. I notice that we setself. use_geluas false for the default setting but lots of the supported transformer models actually use the gelu activation function, likebert. Could you provide more information about it?Thank you so much for your effort!!
Please let me know what else I need to do!