DALL·E is a 12-billion parameter version of GPT-3 trained to generate images from text descriptions, using a dataset of text–image pairs. We’ve found that it has a diverse set of capabilities, including creating anthropomorphized versions of animals and objects, combining unrelated concepts in plausible ways, rendering text, and applying transformations to existing images.
If you give DALL·E the task to generate images with “an armchair in the shape of an avocado”, you get this:
And to say I was questioning the power of computers earlier today 😅
🤓 Clever name too: Dali + WALL·E = DALL·E