Inference

Energy Considerations of Large Language Model Inference and Efficiency Optimizations

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025.