In this project I implemented the algorithm from the paper Learning Disentangled Joint Continuous and Discrete Representations. As stated in the abstract, the main objective is to “learn a disentangled and interpretable jointly continuous and discrete representations in an unsupervised manner”. Putting in simpler words, we want to force each dimension of the latent space of a variational auto-encoder (VAE) to change something different from the other ones. If we consider images of digits for example, then hopefully changing one dimension would alter the thickness of the strokes, while another would change the angle of the digit.
