NLIP Seminar Series

NLIP 2025 Social: Meet New PhD Students

Friday, 10 October, 2025 - 12:00 to 13:00

Introductory Presentations from new PhD students and Research Assistants in the NLIP Group: **New PhD Students** **Bianca-Mihaela Ganescu** (supervised by Prof Paula Buttery) **Filip Trhlik** (supervised by Prof Paula Buttery) **Yizhou Chi** (supervised by Prof Andreas Vlachos) **Przemyslaw Kubiak** (supervised by Dr...

Read more at: Title to be confirmed

Title to be confirmed

Friday, 31 October, 2025 - 12:00 to 13:00

Abstract not available

Making and breaking tokenizers

Friday, 17 October, 2025 - 12:00 to 13:00

Despite massive investments in training large language models, tokenizers remain a critical but often neglected component with weaknesses that can cause wild hallucinations, bypass safety guardrails, and break downstream applications. This talk will cover: Our recent research in automatically detecting problematic 'glitch...

Read more at: Title to be confirmed

Title to be confirmed

Friday, 5 December, 2025 - 12:00 to 13:00

Abstract not available

Read more at: Title to be confirmed

Title to be confirmed

Friday, 21 November, 2025 - 12:00 to 13:00

Abstract not available

Read more at: Title to be confirmed

Title to be confirmed

Friday, 14 November, 2025 - 12:00 to 13:00

Abstract not available

Read more at: Title to be confirmed

Title to be confirmed

Friday, 24 October, 2025 - 12:00 to 13:00

Abstract not available

LLMs, Implicit Bayesian inference and compositional Generalization

Friday, 20 June, 2025 - 12:00 to 13:00

**Abstract** Apparently rational behaviors of autoregressive LLMs, such as in-context learning, have been attributed to implicit Bayesian inference: since training data is best explained as a mixture, the optimal next-token-predictor learns to implicitly infer latent concepts and completes prompts consistently with...

MultiBLiMP: A Multilingual Benchmark of Linguistic Minimal Pairs

Friday, 13 June, 2025 - 12:00 to 13:00

We introduce MultiBLiMP, a massively multilingual benchmark of linguistic minimal pairs, covering 101 languages, 6 linguistic phenomena and containing more than 120.000 minimal pairs. Our minimal pairs are created using a fully automated pipeline, leveraging the large-scale linguistic resources of Universal Dependencies...

Measuring Political Bias in Large Language Models

Friday, 16 May, 2025 - 12:00 to 13:00

Large language models (LLMs) are helping millions of users to learn and write about a diversity of issues. In doing so, LLMs may expose users to new ideas and perspectives, or reinforce existing knowledge and user opinions. This creates concerns about political bias in LLMs, and how these biases might influence LLM users...

NLIP 2025 Social: Meet New PhD Students

Title to be confirmed

Making and breaking tokenizers

Title to be confirmed

Title to be confirmed

Title to be confirmed

Title to be confirmed

LLMs, Implicit Bayesian inference and compositional Generalization

MultiBLiMP: A Multilingual Benchmark of Linguistic Minimal Pairs

Measuring Political Bias in Large Language Models

About the department

Social media

Study at Cambridge

About the University

Research at Cambridge