What's in a speech token?

What's in a speech token?

Instituto Superior Técnico, Anfiteatro PA2Lisboa, Lisboa
Tuesday, Mar 24, 2026 from 1 pm to 2 pm
Overview

Priberam Machine Learning Lunch Seminar

Abstract:

Discrete speech tokenization is a surprisingly effective technique to adapt text language models to speech. But despite the simplicity of the method, the nature of the units it produces is not widely understood. In this talk, I ask some questions (and hopefully answer a few) about how, when, and why discrete speech tokens work.


Bio:

Ben Peters is a postdoctoral researcher at INESC-ID. He works at the intersection of NLP and speech.

www.priberam.com

Priberam Machine Learning Lunch Seminar

Abstract:

Discrete speech tokenization is a surprisingly effective technique to adapt text language models to speech. But despite the simplicity of the method, the nature of the units it produces is not widely understood. In this talk, I ask some questions (and hopefully answer a few) about how, when, and why discrete speech tokens work.


Bio:

Ben Peters is a postdoctoral researcher at INESC-ID. He works at the intersection of NLP and speech.

www.priberam.com

Good to know

Highlights

  • 1 hour
  • In-person

Location

Instituto Superior Técnico, Anfiteatro PA2

1 Avenida Rovisco Pais

1049-001 Lisboa

How would you like to get there?

Map
Organised by
Priberam Labs
Followers--
Events47
Hosting5 years
Report this event