Preference Alignment, with Reference Mismatch, and without Reference Models

Date:

Friday, 31 January, 2025 - 12:00 to 13:00

Speaker:

James Thorne (KAIST)

Venue:

Room SS03 with Hybrid Format. Here is the Zoom link for those that wish to join online: https://cam-ac-uk.zoom.us/j/4751389294?pwd=Z2ZOSDk0eG1wZldVWG1GVVhrTzFIZz09

Abstract: In this talk, I'll cover two recent papers for preference alignment: Odds-Ratio Preference Optimisation (ORPO, EMNLP 2024), discussing the role of the reference model for preference alignment (e.g. DPO, RLHF), and Margin-aware Preference Optimization (under review @ CVPR), thinking about the risks of reference mismatch: where the preference alignment data has features diverging from the reference model.

Bio: James is Assistant Professor at the KAIST Graduate School of AI, South Korea, working on large-scale and knowledge-intensive natural language understanding. James recently completed his PhD at the University of Cambridge where he developed models and methods for automated fact verification and correction.

[1] https://aclanthology.org/2024.emnlp-main.626/
[2] https://arxiv.org/pdf/2406.06424

Seminar series:

NLIP Seminar Series

View on talks.cam

Calendar

Upcoming seminars

20Oct

Federated Learning at H.IAAC: On-going Research and Opportunities

Allan M. de Souza & Luiz Bittencourt, Universidade Estadual de Campinas (UNICAMP), Brazil

Cambridge ML Systems Seminar Series
20Oct

Bloomberg: Observability in Action: Designing Effective Dashboards

Speaker to be confirmed

Technical Talks
20Oct

Talk by Professor Bjarne Stroustrup: 'Concept-based Generic Programming'

Bjarne Stroustrup, Professor of Computer Science at Columbia University

Department of Computer Science and Technology talks and seminars
21Oct

AIReg-Bench: Benchmarking Language Models That Assess AI Regulation Compliance

William Marino (University of Cambridge)

Artificial Intelligence Research Group Talks
21Oct

Scalable and Verifiable Carbon Accounting in Supply Chains: Towards an Integrated Framework

Jonathan Heiss (TU Berlin)

Security Seminar

View all seminars

Upcoming seminars

About the department

Social media

Study at Cambridge

About the University

Research at Cambridge