Comparing GPT-3 and RNNs as probabilistic generative models
Jan 1, 2022
An investigation of transformers' and RNNs' ability to model long-term dependencies through the lens of probabilistic modelling. You can find the write-up here, and the code here.