Language Modelling: RNN vs GRU vs Transformer (Coming soon)

Compare the performances of a Vanilla RNN vs GRU vs the attention module of a transformer network on the Penn Treebank dataset (Coming Soon)


© 2020. All rights reserved.