Comparing GPT-3 and RNNs as probabilistic generative models

Jan 1, 2022

An investigation of transformers' and RNNs' ability to model long-term dependencies through the lens of probabilistic modelling. You can find the write-up here, and the code here.

https://inwaves.io/posts/feed.xml