Which Transformer architecture fits my data? A vocabulary bottleneck in self-attention

Noam Wies, Yoav Levine, Daniel Jannai, Amnon Shashua

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Fingerprint

Dive into the research topics of 'Which Transformer architecture fits my data? A vocabulary bottleneck in self-attention'. Together they form a unique fingerprint.

Mathematics

Engineering & Materials Science