Lydia Nishimwe
Lydia Nishimwe
Home
Experience
Publications
Talks
Blog
CV (English)
CV (French)
Contact
Light
Dark
Automatic
Embeddings
Making Sentence Embeddings Robust to User-Generated Content
[Microsoft Seminar] Extended presentation of the LREC-COLING 2024 paper of the same title.
May 29, 2024 2:30 PM — 3:30 PM
Online
Lydia Nishimwe
Making Sentence Embeddings Robust to User-Generated Content
Lydia Nishimwe
,
Benoît Sagot
,
Rachel Bawden
Your Fairseq-trained model might have more embedding parameters than it should.
How a bug in reading SentencePiece vocabulary files causes some Fairseq-trained models to have up to 3k extra parameters in the embedding layer.
Lydia Nishimwe
Last updated on Jun 4, 2024
Cite
×