Publications
Publication record
Language-model research, from efficient training to retrieval and long-context evaluation.
Selected papers are marked for fast scanning. The full ledger keeps venue, artifact, and repository links close to each paper.
2025
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
International Conference on Learning Representations (ICLR 2025)
2024
CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024) Findings
2023
Optimizing Retrieval-augmented Reader Models via Token Elimination
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)
2022
Transformer Language Models without Positional Encodings Still Learn Positional Information
Findings of the Association for Computational Linguistics: EMNLP 2022
2021
How to Train BERT with an Academic Budget
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)
2020
Exploring the Boundaries of Low-Resource BERT Distillation
Proceedings of SustaiNLP: Workshop on Simple and Efficient Natural Language Processing, 2020
2019
2018
2014
The search duel: a response to a strong ranker
Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval (2014).
2013
Leveraging memory mirroring for transparent memory scale-out with zero-downtime failover of remote hosts
2013 IEEE Symposium on Computers and Communications (ISCC) (2013)