Efficient self-attention with smart pruning for sustainable large language models
<p dir="ltr">Large Language Models (LLMs) have revolutionized artificial intelligence by enabling multitasking across diverse fields. However, their high computational demands result in significant environmental impacts, particularly in terms of energy and water consumption. This pap...
Saved in:
| Main Author: | Samir Brahim Belhaouari (9427347) (author) |
|---|---|
| Other Authors: | Insaf Kraidia (19198012) (author) |
| Published: |
2025
|
| Subjects: | |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Dual-attention Network for View-invariant Action Recognition
by: Gedamu Alemu Kumie (19273711)
Published: (2023) -
Reinforcement learning-based dynamic pruning for distributed inference via explainable AI in healthcare IoT systems
by: Emna Baccour (16896366)
Published: (2024) -
Enhancing Cross-Language Multimodal Emotion Recognition With Dual Attention Transformers
by: Syed Aun Muhammad Zaidi (22225033)
Published: (2024) -
A survey of transformers and large language models for ECG diagnosis: advances, challenges, and future directions
by: Mohammed Yusuf Ansari (16904523)
Published: (2025) -
Diffuse large B‐cell lymphoma presenting with pulmonary artery compression symptoms, case reports
by: Mhd Baraa Habib (11721774)
Published: (2024)