| Nov 01, 2025 | I will be attending NeurIPS in San Diego — looking forward presenting our latest works! |
| Sep 18, 2025 | DATE-LM was accepted to NeurIPS 2025. We introduce a rigorous, applications-driven benchmark for large-scale evaluation of data attribution methods in LLMs. Check out our paper, code, and leaderboard. |
| Sep 18, 2025 | Our work on Fairshare Data Pricing was accepted to NeurIPS 2025, introducing a data-influence–based framework for fair pricing of LLM training datasets. Check out our paper. |
| Feb 01, 2025 | Our work on ICP for Data Attribution was accepted to NAACL 2025. We showed that simple probing of LLMs may serve as a practical proxy for gradient-based data attribution, enabling efficient identification of influential training samples. Check out our paper and code. |