Seminar: Matthias Gallé (NAVER Labs), The best atomic unit to represent text

Indlæser Begivenheder

23. januar 2020 @ 12:00 - 13:00

“Abstract: What is the best atomic unit to represent text? This important decision lies at the heart of the intersection between the continuous representation of modern NLP and the discrete world. To understand the effectiveness of BPE, we test the hypothesis that it lies in the compression capacity of that algorithm. We test this by linking it to the broader family of dictionary-based compression algorithms.We then study character-based NMT with Transformer models, showing the consequences of using character as…This is joint work with Rohit Gupta, Laurent Besacier and Marc Dymetman.”

Price: Free

Link: https://www.meetup.com/Natural-Language-Processing-Copenhagen-Meetup/events/267804527/

Detaljer

Dato:
23. januar 2020
Tidspunkt:
12:00 - 13:00
Lokation:
Auditorium 1, Universitetsparken 13, 2100 Copenhagen

« Alle begivenheder