Text this: Efficient self-attention with smart pruning for sustainable large language models