Language Modelling: RNN vs GRU vs Transformer (Coming soon) My Latest Projects | 2019 | Links: Source Compare the performances of a Vanilla RNN vs GRU vs the attention module of a transformer network on the Penn Treebank dataset (Coming Soon)