news

Nov 01, 2025 :round_pushpin: I will be attending NeurIPS in San Diego — looking forward presenting our latest works!
Sep 18, 2025 :tada: DATE-LM was accepted to NeurIPS 2025. We introduce a rigorous, applications-driven benchmark for large-scale evaluation of data attribution methods in LLMs. Check out our :page_facing_up: paper, :computer: code, and :trophy: leaderboard.
Sep 18, 2025 :tada: Our work on Fairshare Data Pricing was accepted to NeurIPS 2025, introducing a data-influence–based framework for fair pricing of LLM training datasets. Check out our :page_facing_up: paper.
Feb 01, 2025 :tada: Our work on ICP for Data Attribution was accepted to NAACL 2025. We showed that simple probing of LLMs may serve as a practical proxy for gradient-based data attribution, enabling efficient identification of influential training samples. Check out our :page_facing_up: paper and :computer: code.