1

Energy Considerations of Large Language Model Inference and Efficiency Optimizations

The 63rd Annual Meeting of the Association for Computational Linguistics (ACL), 2025.

Efficient Hardware Scaling and Diminishing Returns in Large-Scale Training of Language Models

Transactions on Machine Learning Research, 2025.

Holistically Evaluating the Environmental Impact of Creating Language Models

The Thirtheenth International Conference on Learning Representations (ICLR), 2025.

Gradient Localization Improves Lifelong Pretraining of Language Models

Findings of the Association for Computational Linguistics: EMNLP (EMNLP Findings), 2024.

The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment

Conference on Empirical Methods in Natural Language Processing (EMNLP), 2023.