A powerful new form of artificial intelligence has burst onto the scene and captured the public’s imagination in recent months: text-to-image AI.
Text-to-image AI models generate original images based solely on simple written inputs. Users can input any text prompt they like—say, “a cute corgi lives in a house made out of sushi”—and, as if by magic, the AI will produce a corresponding image. (See above for this example; scroll down for some more.)
These models produce images that have never existed in the world nor in anyone’s imagination. They are not simple manipulations of existing images on the Internet; they are novel creations, breathtaking in their originality and sophistication.
The most well-known text-to-image model is OpenAI’s DALL-E. OpenAI debuted the original DALL-E model in January 2021. DALL-E 2, its successor, was announced in April 2022. DALL-E 2 has attracted widespread public attention, catapulting text-to-image technology into the mainstream.
In the wake of the excitement around DALL-E 2, it hasn’t taken long for competitors to emerge. Within weeks, a lightweight open-source version dubbed “DALL-E Mini” went viral. Unaffiliated with OpenAI or DALL-E, DALL-E Mini has since been rebranded as Craiyon following pressure from OpenAI.
In May, Google published its own text-to-image model, named Imagen. (All the images included in this article come from Imagen.)
Soon thereafter, a startup named Midjourney emerged with a powerful text-to-image model that it has made available for public use. Midjourney has seen astonishing user growth: launched only two months ago, the service has over 1.8 million users in its Discord group as of this writing. Midjourney has recently been featured on the cover of The Economist and on John Oliver’s late-night TV show.
Continue reading: https://www.forbes.com/sites/robtoews/2022/09/11/4-hot-takes-about-the-wild-new-world-of-generative-ai/?sh=3c727ee613d9
Text-to-image AI models generate original images based solely on simple written inputs. Users can input any text prompt they like—say, “a cute corgi lives in a house made out of sushi”—and, as if by magic, the AI will produce a corresponding image. (See above for this example; scroll down for some more.)
These models produce images that have never existed in the world nor in anyone’s imagination. They are not simple manipulations of existing images on the Internet; they are novel creations, breathtaking in their originality and sophistication.
The most well-known text-to-image model is OpenAI’s DALL-E. OpenAI debuted the original DALL-E model in January 2021. DALL-E 2, its successor, was announced in April 2022. DALL-E 2 has attracted widespread public attention, catapulting text-to-image technology into the mainstream.
In the wake of the excitement around DALL-E 2, it hasn’t taken long for competitors to emerge. Within weeks, a lightweight open-source version dubbed “DALL-E Mini” went viral. Unaffiliated with OpenAI or DALL-E, DALL-E Mini has since been rebranded as Craiyon following pressure from OpenAI.
In May, Google published its own text-to-image model, named Imagen. (All the images included in this article come from Imagen.)
Soon thereafter, a startup named Midjourney emerged with a powerful text-to-image model that it has made available for public use. Midjourney has seen astonishing user growth: launched only two months ago, the service has over 1.8 million users in its Discord group as of this writing. Midjourney has recently been featured on the cover of The Economist and on John Oliver’s late-night TV show.
Continue reading: https://www.forbes.com/sites/robtoews/2022/09/11/4-hot-takes-about-the-wild-new-world-of-generative-ai/?sh=3c727ee613d9