Efficient self-attention with smart pruning for sustainable large language models
<p dir="ltr">Large Language Models (LLMs) have revolutionized artificial intelligence by enabling multitasking across diverse fields. However, their high computational demands result in significant environmental impacts, particularly in terms of energy and water consumption. This pap...
محفوظ في:
| المؤلف الرئيسي: | Samir Brahim Belhaouari (9427347) (author) |
|---|---|
| مؤلفون آخرون: | Insaf Kraidia (19198012) (author) |
| منشور في: |
2025
|
| الموضوعات: | |
| الوسوم: |
إضافة وسم
لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
|
مواد مشابهة
-
Dual-attention Network for View-invariant Action Recognition
حسب: Gedamu Alemu Kumie (19273711)
منشور في: (2023) -
Reinforcement learning-based dynamic pruning for distributed inference via explainable AI in healthcare IoT systems
حسب: Emna Baccour (16896366)
منشور في: (2024) -
Enhancing Cross-Language Multimodal Emotion Recognition With Dual Attention Transformers
حسب: Syed Aun Muhammad Zaidi (22225033)
منشور في: (2024) -
A survey of transformers and large language models for ECG diagnosis: advances, challenges, and future directions
حسب: Mohammed Yusuf Ansari (16904523)
منشور في: (2025) -
Diffuse large B‐cell lymphoma presenting with pulmonary artery compression symptoms, case reports
حسب: Mhd Baraa Habib (11721774)
منشور في: (2024)