Publications
2025
HELMET: How to Evaluate Long-Context Language Models Effectively and Thoroughly
Conference
International Conference on Learning Representations (ICLR 2025)
2024
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation
Preprint
Preprint
CoTAR: Chain-of-Thought Attribution Reasoning with Multi-level Granularity
Conference
The 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024) Findings
2023
Optimizing Retrieval-augmented Reader Models via Token Elimination
Conference
The 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023)
2022
Transformer Language Models without Positional Encodings Still Learn Positional Information
Conference
Findings of the Association for Computational Linguistics: EMNLP 2022
2021
How to Train BERT with an Academic Budget
Conference
The 2021 Conference on Empirical Methods in Natural Language Processing (EMNLP 2021)
2020
Exploring the Boundaries of Low-Resource BERT Distillation
Workshop
Proceedings of SustaiNLP: Workshop on Simple and Efficient Natural Language Processing, 2020
2019
Training Compact Models for Low Resource Entity Tagging using Pre-trained Language Models
Workshop
2019 Fifth Workshop on Energy Efficient Machine Learning and Cognitive Computing - NeurIPS Edition (EMC2-NIPS) (2019): 44-47.
2018
Term Set Expansion based NLP Architect by Intel AI Lab
Conference
The 2018 Conference on Empirical Methods in Natural Language Processing (EMNLP 2018).
2014
The search duel: a response to a strong ranker
Conference
Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval (2014).
2013
Leveraging memory mirroring for transparent memory scale-out with zero-downtime failover of remote hosts
Conference
2013 IEEE Symposium on Computers and Communications (ISCC) (2013)