Compressing Large-Scale Transformer-Based Models: A Case Study on BERT
<div><p>Pre-trained Transformer-based models have achieved state-of-the-art performance for various Natural Language Processing (NLP) tasks. However, these models often have billions of parameters, and thus are too resource- hungry and computation-intensive to suit low- capability device...
Saved in:
| Main Author: | |
|---|---|
| Other Authors: | , , , , , , , |
| Published: |
2021
|
| Subjects: | |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Be the first to leave a comment!