Midjourney, DALL-E… AI Text-to-Art: the ultimate guide to image generators

AI-based “Text-to-Art” art generators create realistic or artistic images from simple user-entered text. DALL-E, MidJourney, Craiyon… find out everything you need to know about these revolutionary tools, as well as a comparison of the best programs available.

Since the dawn of time, the art allows the human being to express his feelingshis emotions or his sensations. When words are not enough, painting and drawing allow to capture the moment and to share it…

Unfortunately, of many people do not have the artistic talent to bring their imagination to life on a canvas. Until recently, they had no choice but to remain frustrated and stifle their creative impulses.

This is no longer the case, thanks to a new kind of artificial intelligence: Text-to-Art” image generators. From a few words entered by the user, these tools are able to create images of a bluffing realism or of a striking artistic beauty.

Craiyon, DALL-E, MidJourney, Stable Diffusion… in just a few months, AI-based “Text-to-Art” image generators have taken the web by storm to become a true viral phenomenon.

This new technology is very quickly became extremely popular. Beyond creating images from one’s own ideas, watching the creations of millions of other Internet users is a real entertainment.

Many use it simply for fun or to test the range of possibilities, but some many artists, graphic artists, designers, illustrators and architects also use it for their works.

The fashion of Text-to-Art generators has started in 2018, when a portrait created by an AI sold for $432,500 at auction. Since then, artists and non-artists have been continually generating works for personal or commercial use…

An Indian architect notably used the AI MidJourney to imagine the skyscrapers of the future. In August 2022, a video game creator even won a digital art competition in Colorado by presenting a painting created with MidJourney.

While these artificial intelligences fascinate and impress, they also raise many concerns. The artists fear to be replacedwhile cybersecurity experts fear a misuse to create DeepFakes.

Faced with the potential risks represented by these tools, several developers have even chosen to not to let the general public access them. This is notably the case of Google with Imagen, or of OpenAI with Dall-E.

Through this file, discover all you need to know about Text-to-Art image generators and how they workas well as a comparison of the best existing tools

Table of contents

What is a Text-to-Art AI?

A Text-to-Art AI art generator is software that uses artificial intelligence to create works of art from text entered by the user. It is enough to enter a sequence of words, a textual description or even a sentence using the keyboard.

From this “prompt”, the AI is able to understand the words and create an image. Beyond the requested content, the program is even able to generate a work of art in a specific style or to represent the scene from a specific angle.

This technology is born thanks to the recent advances in the field of AI and Deep Learning, and offers unprecedented possibilities for artistic creation…

How do Text-to-Art AIs work?

Through a post published on his blog, Google explains how it works AI Text-to-Art generators using the example of its own models: Imagen and Parti. These two tools take a different approach to creating images from text.

Both are based on previous Machine Learning models, trained on large datasets composed of images with textual descriptions over the last few years: the Transformers.

These Transformers models are able to process the words of a sentence by taking into account their relations. They are the foundation of the Text-to-Art models.

In addition, both Google AIs use a new technique to generate an image that more closely matches the textual description. Even though Imagen and Parti use similar technology, each adopts a different strategy and complementary.

Imagen is a Diffusion modellearning to convert a pattern of random dots into an image. At first, these images are in low definition and improve gradually.

Diffusion models are used in particular for image and audio tasks such as definition enhancement, colorization of black-and-white photos, retouching of image regions, image uncropping or text-to-speech synthesis.

party vs imagen

On its side, Party’s approach begins by converting a collection of images into a sequence of code entries similar to the pieces of a puzzle. The text entered is translated into codeand a new image is created.

This approach takes advantage of existing research and infrastructure for large language models like PaLM. It is essential for processing long and complex texts and producing high quality images.

However, according to Google, these models have many limitations. None of them is really able to count objects accurately or to place them correctly according to spatial descriptions.

As the text becomes more complex, the models begin to to forget details or to introduce elements that were not requested. These shortcomings include a lack of explicit training material, limited data representation, and a lack of 3D concept.

In the face of risks of misinformation, bias and cybersecurityInstead, Google has taken steps to continue the development of this technology in a healthy way. Easily identifiable watermarks are being added to the images generated by Imagen and Parti, and experiments are being conducted to better understand the biases of these models. Unfortunately, not all other creators of Text-to-Art AI are as careful

The best AI Text-to-Art

It already exist many IA Text-to-Art generators. Some are available for free via the web and accessible to everyone, others are paid and require a very selective invitation. Discover the best tools.


DALL-E 2 is an AI image generator developed by OpenAI, and it is certainly the most powerful tool today for the general public. It takes only a few minutes to create highly realistic images.

According to OpenAI, this program can be used for create illustrations, to design productsor to generate new ideas for companies.

The first version of DALL-E has been used for several years, but it was reserved for scientific or professional use. Earlier, OpenAI also created the GPT-3 artificial intelligence capable of creating texts. This AI laid the foundation for Text-to-Art.

In reality, DALL-E 2 is a 3.5 billion parameter version of GPT-3. It is clearly less than DALL-E and its 12 billion parameters, but this second version is able to create images in a 4 times higher definition.

Thanks to its very easy to use interfaceDALL-E 2 allows anyone to create high quality images using AI. In addition to professional artists, amateurs can also use this tool.

One of the best features of DALL-E 2 is its brush to add details such as shadows, reflections and much more to your images. These tools allow you to create complex images with multiple layers, each customized with its own specificities. It only takes a few minutes for this AI to create highly realistic images.

By registering, you will receive 50 credits for free the first month then 15 credits per month. You will have to pay 15 dollars more to receive 115 additional credits. Note that DALL-E 2 is still in closed beta, and that it is necessary to be patient to have access to it.


In just a few weeks, MidJourney has established itself as the best AI art generator. Since the launch of its open beta, this tool has gone viral.

Created by David Holz, founder of LeapMotion who also worked for NASA, this text-to-image AI distinguishes itself by putting the emphasis on the artistic aspect. Its creators have optimized it to identify beauty.

And even if the images are not always successful, many of them are so breathtaking that they would believe them created by human artists. Moreover, thanks to a feedback system added with the third version, the AI improves by analyzing the reactions of Internet users to each of its creations.

Many artists are impressed by MidJourney, and even use this tool in their work. Over the next two years, David Holz predicts that AI will make unimaginable progress.

To generate an image with MidJourney, you just have to send a sentence to the robot in the Discord channel official. The images are then broadcasted on the Discord, which allows to contemplate the works of art continuously.

You can create 25 images for free during the trial periodbut then you will have to pay a 10 dollar subscription to generate 200 images per month. As an alternative, a monthly subscription of 30 dollars allows to create an unlimited number of images.

Dall-E Mini aka Craiyon

Initially named Dall-E Mini, this tool had to change its name to avoid confusion with the Dall-E AI from OpenAI. It is now called Craiyon.

For each sentence submitted by the user, this generator creates multiple images. This increases the chances of obtaining a satisfactory result among the nine proposals.

Unfortunately, the definition of the images is rather low. Compared to other tools, this AI seems to pick up images on the internet and mix them to match the user’s text.

The main advantage of Craiyon is to be totally free and available in free access. This tool has become popular for its propensity to create memes and hilarious images, often in spite of itself…

There is no need to create an account to use it. Just go to the official website and start entering your text. You will receive in response 9 images in a grid of 3 by 3.

Nevertheless, Craiyon does not offer no image customization options. This tool also lacks security protocols…

Stable Diffusion

Stable Diffusion is an open-source image generator based on Machine Learning. This tool is able to create images from text, but also to modify existing images or to improve the definition of blurred images.

Unlike other cloud-based generators, Stable Diffusion runs locally on your computer or your smartphone. This allows you to create images without any censorshipand some users take advantage of it to create erotic images…

This tool is completely free, and offers more control over content creation. However, it is necessary to have a sufficiently powerful machine to run it.

If you don’t have the required computing power, you can try a demo of Stable Diffusion on the web. There are several websites offering to use this AI online.

Runway ML

Runway ML allows to generate images by training your own Machine Learning models. This tool allows you to create models capable of generating realistic images in a wide variety of styles.

It is even possible to use Runway ML to create animations and 3D models. A video editor is also included to replace background images in your video projects.

Among the tools used by Runway ML are relative motion analysis allowing to understand what the user is trying to do. AI also uses object recognition to identify elements in an image or video.

As you can see in the video presentation above, Runway has already taken the next step in Text-to-Image AI: this tool can now create videos from text

Wombo Dream

Wombo Dream is an AI art generator developed by the Canadian startup WOMBO. It is considered as one of the best NFT creation apps.

The Wombo Dream system allows you to create drawings in a wide variety of styles. For example, you can choose between the retro art styles, Salvador Dahli or Ghibli.

In addition, you can include a reference image on which the AI can base itself. It is also possible to convert existing photos into cartoons or paintings.

A complex algorithm can turn words and sentences into works of art. You can then convert your creations in NFT.

It is possible to use Wombo Dream on phone, tablet or computer. The mobile version offers more features.


StarryAI is an AI art generator allowing to transform drawings into NFT. This tool does not need data input and can process the images with a Machine Learning algorithm.

This tool provides two different AI engines: Orion allows to create coherent images and realistic, whileAltair allows to generate more abstract images, pertaining to the field of the imaginary one.

One of the strong points of StarryAI is its simple and uncluttered interface. This generator offers you to upload an initial image on which the AI can be based.

In addition, this tool gives you full ownership of the images you create. You can use them for personal or commercial use.

Thus, this program can be used as a free NFT generator. It is besides its main selling point. It is also possible to make print your works.

The generated images are correctbut they are not as good as the best generators. You will receive a few credits for free when you register, but then you will have to pay to continue using this tool. However, the technology is constantly improving and has already created some fantastic designs.

It should also be noted that it is possible to to add credits to increase the runtime of the AI and improve the result. You can earn free credits by watching ads and sharing your creations on social networks. Users can create a maximum of five images per day for free.


The NightCafe tool is one of the big names in the small world of AI art generators. It offers more algorithms and options than most other programs, and is very easy to learn for new users.

Payment is via a credit system, but the free version is relatively generous. There are also many ways to earn credits by participating in the community.

You own the creations you generate with NightCafe, and advanced users will appreciate the many control options. There are also many social features, and a very active community.

The creations can be organized in collectionsand images can be downloaded in batches. It is also possible to create videos and to buy a printed version of the works.

Nightcafe lets you choose between different artistic styles. After creating a few images for free, you will have to buy credits to continue using it, or improve the quality and definition of the images. However, it is possible to create five images per day for free.

The images created can leave something to be desired, and are sometimes strange even with the default settings. In addition, the definition is not spectacular.

Google Deep Dream Generator

Deep Dream Generator is a tool developed by Google. It is one of the most popular AI art generators on the market, as it allows you to create realistic images with artificial intelligence.

It does not generate images from text, but transforms photos and images into stylized works of art. It is possible to choose different categories such as landscapes or animals to let the AI create a realistic image.

This tool is based on a neural network trained on millions of images. The use is very simple, since it just load an image for the tool to be inspired by it to generate a new one.

One of the main use cases of Deep Dream is the creation of artworks. You can choose from the styles of many famous artists, and change the settings to get the desired result.

It is also possible to to mix different artistic styles on the same image. This allows to generate images that look like they come from a specific country or period. This tool can be used to create NFTs.

The free version restricts the use. To take advantage of all the features, it is necessary to subscribe for a cost of 19 dollars per month or 39 dollars per month for the professional package.

Big Sleep

Big Sleep is an AI art generator for creating realistic images from scratch. The tool is very easy to use, and the creating images requires only a few steps.

This generator is based on the Python language, and uses a neural network for creating realistic images. It Just provide data to the program and let it produce an image.

This process relies on a GAN: generative adversarial network of neurons. The generator model creates an image, and the discriminator model tries to determine if it is a real image or if it is generated by the AI. Over time, the generator improves and produces more realistic images.

The strong points of this tool are an image definition of 1024×1024 and its open-source nature. It is also accessible for beginners.


Artbreeder is well known in the field of AI art generators. It is a image quality enhancement toolThis tool allows to produce different variations of an image thanks to Machine Learning.

It is possible to create landscapes, characters, portraits and many other types of illustrations from this platform. The tool also allows you to modify facial features like skin color, hair and eyes. You can even use it to animate your photos!

Another strong point of Artbreeder is that it offers the possibility to manage thousands of illustrations in folders. The result can then be downloaded in JPG or PNG format.

Free users have to make do with 8 free downloads. To take advantage of the increased definition and all the advanced features, you must subscribe for $8.99 per month.


This freeware is a Google Colab notebook based on Python, rather simple to use. Just go to the website, enter the text in the box and press Ctrl+F9 or the Runtime – Run All button.

The system improves the design in an iterative waywhich allows to observe the process step by step. However, the results are not always successful…

Deep AI

Founded in 2016, DeepAI aims to. Democratize AI through open-source software. It proposes different tools that can be used to create realistic images.

This AI generator is highly customizableand allows you to change many details like colors and textures. It allows you to create as many images as you want, and each one is unique.

Other tools developed by this company include StyleGAN and BigGAN for creating realistic images and CartoonGAN to transform images into cartoons.

Hotpot AI

Again, Hotpot AI is a very basic tool compared to the best AI art generators. However, it allows users to create simplistic drawings very easily.

Unfortunately, this artificial intelligence does not always understand what is asked of it to create. The drawings often have no relation with the texts. In addition, the faces are not very successful.


Fotor’s proposal is to transform images generated with AI into NFT. It is enough to load an image and to select the art style to apply.

You can also create layers or add personal touches quickly and easily. There is also no need to create an account to use the software or download the designs.

However, the creation tool GoArt of the program offers nothing more than Photoshop filters. This program is relatively easy to use, but is of little interest.

A photo editor is also includedbut requires paying to remove the watermark. There are also better freeware on the web to perform this function.


Pixray is a Text-to-Art generator using artificial intelligence. It is possible to run it for free with a API on a web browser or a computer.

Its clean interface lets you enter a phrase, and then you can choose between several AI engines: Pixel to generate pixel art, VQGAN for GAN images, or Clipdraw.

These different engines are customizableand the interface is easy to use. You can adjust the parameters with different filters and change your input text until the result matches your expectations.

A option also allows you to create sketches. This tool by its flexible integrations. However, you will have to wait more than five minutes for the AI to generate an image. So it is not an ideal option if you want immediate results. The generated images are not very realistic.

Comparison: which is the best Text-to-Art generator?

The AI art generators are not all the same. Some free tools create blurry, abstract-looking images, while the most powerful tools can create photorealistic images and reconstruct specific artistic styles. So which of the many tools is best?

A designer named Fabian Stelzer, based in Berlin, decided to make a comparison between three of the most popular generators: Midjourney, DALL-E 2 and Stable Diffusion. The results obtained from the same texts are very different from one program to another, and each one seems to interpret the words in its own way.

This is due to the diversity of the algorithms used, and the training data on which these models are trained. Each of these three tools has its strengths and weaknesses, but can also prove to be more adapted in a specific context.

To conduct his experiment, Stelzer used different prompts such as ” low poly game asset, Cthulhu monster, video game 2000, isometric view ” or “ 1990’s clip art of a fax machine caught in a laughing fit, Windows 3.1, MS-DOS, old computer clip art “. He then shared the results on Twitter.

We see that the creations of MidJourney are often darkeralmost apocalyptic. This tool was used to create ” the last selfie before the end of the world “. However, in terms of artistic style, MidJourney often produces the most natural results. Especially for texture details.

On his side, DALL-E 2 often leaves artifacts resembling digital glitches. However, this AI seems to be the most suitable for creating photorealistic images, and for representing facial expressions.

Finally, Stable Diffusion generally seems to produce the “cleanest” results. According to Stelzer, this AI can create incredible photos, but it needs to be be careful not to overload the scene. It is also very good for recreating the style of specific artists.

According to Stelzer, these AI generators are comparable to musical instruments each with its own range and timbre. He compares MidJourney to a beautiful analog Moog with a limited range, where DALL-E 2 offers a huge range, but an explicitly digital result.

In addition to their results, these tools have several differences. DALL-E 2 is distinguished by a feature to edit a part of the image, while MidJourney shines with a large and active community of users offering support and inspiration.

In the future, Stelzer is convinced that AI art generators will be the biggest revolution for creative work since photography : ” what the photo was to painting, the image synthesizers are to photography “. He predicts that we will soon be able to create films by typing texts…

Other less sophisticated Text-to-Art generators, like Craiyon, offer less realistic results. However, comparing it with DALL-E 2 and Midjourney, YouTuber 2kliksphilip concludes that Craiyon can produce very creative and more varied images. So it can be a very rich source of inspiration.

If DALL-E 2 is more suitable for professional useTools such as Craiyon and Artbreeder have the advantage of being free and accessible to everyone. In addition, strange, abstract and surreal images can be inspiring or just plain fun.

You know all about AI-based Text-to-Art image generators. However, this technology is still in its infancy and will know to evolve at a lightning pace over the years to come.

Artificial intelligence will soon be able to create images realistic enough to create the perfect illusionor artistic beauty worthy of the greatest artists. Future tools will understand requests with greater precision, and demonstrate greater creativity.

In a short time it will take only a few words to create real works of art, and even complete movies based on a simple idea. The revolution has begun, and the art world will be inexorably transformed

Be the first to comment

Leave a Reply

Your email address will not be published.