Publications

(2025). Energy Considerations of Large Language Model Inference and Efficiency Optimizations. Energy Considerations of Large Language Model Inference and Efficiency Optimizations.

PDF

(2025). Hardware Scaling Trends and Diminishing Returns in Large-Scale Distributed Training.

PDF

(2025). Holistically Evaluating the Environmental Impact of Creating Language Models.

PDF

(2024). Gradient Localization Improves Lifelong Pretraining of Language Models.

PDF

(2023). Adapting to Gradual Distribution Shifts with Continual Weight Averaging.

(2023). The Framework Tax: Disparities Between Inference Efficiency in Research and Deployment.

PDF Code

(2021). CIGLI: Conditional Image Generation from Language and image.

PDF Code

(2020). Generative Data Augmentation for Commonsense Reasoning.

PDF Code Project

(2019). CODAH: An Adversarially Authored Question-Answer Dataset for Common Sense.

PDF Code Poster