Don't have access to Dall-E 2 or Imagen but I do have [1] and [2] locally and th...

Nition · on May 24, 2022

Nice. Latent-diffusion has come out very traditional but the VQGAN/CLIP ones are fairly original.

zimpenfish · on May 24, 2022

From my experiments, the LD one doesn't seem to have been trained on as big or as tagged data set - there's a whole bunch of "in the style of X" that the VQGAN knows* about but the LD doesn't. That might have something to do with it.