Rethinking Depth in Speech Encoder

Rethinking Depth in Speech Encoder

0 followers50 events5y hosting2.9k total attendees
Overview

Priberam Machine Learning Lunch Seminar

Abstract:

Nowadays, speech encoders are becoming increasingly powerful, but also larger and more complex. However, we observe significant redundancy within these models, which motivates a rethinking of how depth should be designed. In this talk, I will present a parameter-efficient alternative based on shared weights with recursive (looped) application of encoder layers, where a smaller set of parameters is reused across multiple iterations instead of stacking many distinct layers. This approach aims to preserve strong representational capacity while reducing model size.

Bio:

Thomas Rolland is a Postdoctoral Researcher at INESC-ID in Lisbon, focusing on building robust speech systems for low-resource, noisy, and domain-shifted settings. His work centers on parameter-efficient architectures, synthetic data augmentation, and post-training strategies to improve adaptability and fairness across diverse speech scenarios.

www.priberam.com

Priberam Machine Learning Lunch Seminar

Abstract:

Nowadays, speech encoders are becoming increasingly powerful, but also larger and more complex. However, we observe significant redundancy within these models, which motivates a rethinking of how depth should be designed. In this talk, I will present a parameter-efficient alternative based on shared weights with recursive (looped) application of encoder layers, where a smaller set of parameters is reused across multiple iterations instead of stacking many distinct layers. This approach aims to preserve strong representational capacity while reducing model size.

Bio:

Thomas Rolland is a Postdoctoral Researcher at INESC-ID in Lisbon, focusing on building robust speech systems for low-resource, noisy, and domain-shifted settings. His work centers on parameter-efficient architectures, synthetic data augmentation, and post-training strategies to improve adaptability and fairness across diverse speech scenarios.

www.priberam.com

Good to know

Highlights

  • 1 hour
  • In-person

Location

Instituto Superior Técnico, Anfiteatro PA2

1 Avenida Rovisco Pais

1049-001 Lisboa

How would you like to get there?

Map
Organised by
Priberam Labs
Followers--
Events50
Hosting5 years
Report this event