The Free Transformer
6 months ago
- #Machine Learning
- #Transformer
- #Variational Methods
- Proposes an extension of the decoder Transformer that conditions its generative process on random latent variables.
- Latent variables are learned without supervision using a variational procedure.
- Experimental evaluations show substantial improvements on downstream tasks due to this conditioning.