0% found this document useful (0 votes)
8 views129 pages

39021924

The tutorial at ICML 2024 focuses on the challenges of evaluating language models (LMs), discussing fundamental evaluation methods, common pitfalls, and best practices for reliable assessments. It aims to provide attendees with insights into current evaluation practices, the issues faced, and future research directions in LM evaluation. Key topics include measurement methods, reproducibility challenges, and the impact of prompt sensitivity on evaluation outcomes.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd
0% found this document useful (0 votes)
8 views129 pages

39021924

The tutorial at ICML 2024 focuses on the challenges of evaluating language models (LMs), discussing fundamental evaluation methods, common pitfalls, and best practices for reliable assessments. It aims to provide attendees with insights into current evaluation practices, the issues faced, and future research directions in LM evaluation. Key topics include measurement methods, reproducibility challenges, and the impact of prompt sensitivity on evaluation outcomes.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF or read online on Scribd

You might also like