Buckle up girls and gents, now we have a brand new AI picture generator on the town, and it’s surprisingly good.
It is stunning as a result of it comes from Google and since it is not the fundamental, considerably ugly, lazy generator you’re used to seeing in Bard. It’s additionally hidden from most people —however that doesn’t imply you possibly can’t use it.
Its title is ImageFX and it’s Google’s newest enterprise into the realm of AI picture era. It’s obtainable by way of Google’s AI Check Kitchen, an experimental platform that enables customers to work together with Google’s initiatives whereas they’re nonetheless in improvement.
Regardless of being in its early beta part, ImageFX offers superb outcomes by way of accuracy and photorealism. Its availability, nevertheless, is confined to particular areas, particularly the U.S, Kenya, New Zealand, and Australia, and its utilization is restricted to English, demonstrating Google’s cautious strategy and its want for a managed setting for consumer suggestions and system refinements.
These dwelling exterior the allowed areas may bypass geographical restrictions with strategies like VPNs or proxies—at their very own threat.
Powering ImageFX is Imagen 2, a classy AI mannequin developed by Google’s famend AI lab, DeepMind. Imagen 2 is designed to interpret and visualize textual prompts, boasting capabilities to provide numerous photographs and types. Google asserts that Imagen 2 units a brand new commonplace for picture high quality amongst its era of AI fashions.
The introduction of ImageFX is a part of Google’s broader technique to discover numerous aspects of generative synthetic intelligence. It joins a set of specialised instruments, together with MusicFX for music creation and TextFX for stylized textual content era.
Google vs. Dall-e 3 vs. MidJourney
Google’s ImageFX marks a notable entry into the realm of AI-driven picture turbines, instantly competing with established gamers like Dall-E 3 and MidJourney. A definite edge for ImageFX in its early beta part is its cost-free entry, diverging from Dall-E’s integration with ChatGPT at a month-to-month charge of $20, and MidJourney’s annual subscription nearing $100.
Whereas cost-effectiveness is an enormous issue, it is the comparative options and output high quality that units these instruments aside. ImageFX excels in producing hyperrealistic photographs, surpassing Dall-E 3’s considerably cartoonish renditions and MidJourney’s deal with aesthetically interesting visuals.
However simply because ImageFX is free doesn’t imply it’s unhealthy. ImageFX provides distinctive options like seed management, permitting customers to finely tune the artistic course of by adjusting the preliminary noise configuration. This stage of management is unmatched by Dall-E 3 or MidJourney, permitting customers to make refined changes whereas sustaining the core parts of the picture.
Moreover, ImageFX can spotlight key immediate phrases and recommend artistic alternate options—a characteristic not obtainable from its opponents.
ImageFX does have its limitations, nevertheless. The device completely generates sq. photographs, whereas Dall-E 3 and MidJourney present flexibility in side ratios. Furthermore, not like MidJourney, ImageFX doesn’t assist picture modifying options like inpaint and outpaint, limiting its versatility. Lastly, the conversational characteristic of Dall-E 3—which permits rookies to instruct the mannequin in pure language—contrasts with the keyword-based prompting required by ImageFX and MidJourney.
The strategy to prompting differs considerably amongst these fashions, too. ImageFX doesn’t assist unfavourable prompting, which lets customers specify what to exclude from the picture. MidJourney provides this performance, including a layer of precision to the artistic course of. Dall-E 3 additionally lacks direct unfavourable prompting, however its conversational interface permits customers to information the mannequin not directly, providing a unique strategy to refining picture outputs.
A picture is value a thousand phrases
Decrypt bought entry to ImageFX and was in a position to evaluate its generations in opposition to MidJourney and Dall-E 3. We used the identical immediate for all fashions and the outcomes under are at all times offered in the identical order from left to proper: First is ImageFX, second is MidJourney, and third is Dall-E 3.
Photorealism:
Immediate: Picture of a cryptocurrency dealer with fearful expression

Each ImageFX and MirJourney generated fairly sensible outcomes. Nevertheless by way of fashion, ImageFX appears photorealistic whereas MidJourney appears to be like a bit extra hyperrealistic, that means the primary is extra true to life whereas the second is extra inventive, with saturated colours, exaggerated bokeh, and so on.
Dalle-3 fails to generate photographs. As an alternative it created a 3d render focusing extra on the content material. It’s simpler to inform it was a crypto dealer due to the charts within the background, however it was positively not a photograph.
Illustrations:
Immediate: Illustration of a mysterious bear browsing a cybernetic wave

This immediate was a little bit bit extra summary to check how fashions interpret non-standard concepts. ImageFX and MidJourney generated essentially the most aesthetically pleasing photographs, however MidJourney appears to be like extra like a render than an illustration and ImageFX tried to seize the essence of what a cybernetic wave might be. As an alternative, MidJourney related the time period “cybernetic” to the bear. Dall-e 3 captured the essence extra intently. It was clearly an illustration, and it resembles the cybernetic aesthetic, however the bear’s morphology is unsuitable, and the picture lacks in high quality in opposition to its opponents.
Lengthy natural-language:
Immediate: Extremely detailed pictures scifi shut up of a mysterious pc knowledgeable engaged on a laptop computer . Behind him, an FBI agent awaits to seize him huge shot photorealistic intricate

With a view to conduct this comparability, the immediate for MidJourney was modified to “extremely detailed pictures scifi shut up of a mysterious pc knowledgeable engaged on a laptop computer with an FBI agent behind him awaiting to seize him, huge shot, photorealistic, intricate.”
MidJourney refused to generate photographs below the primary immediate.
ImageFX generates a pleasant, detailed {photograph} respecting all the main points. MidJourney didn’t generate a “mysterious” pc knowledgeable. It additionally sticks to its signature fashion with extreme bokeh and attention-grabber gentle trails or rain droplets on the totally different generations. This was the most effective instance, as the remaining appeared to depict an astronaut, a cyberpunk marine, or one thing related. Dall-E generates a picture through which all the weather of the immediate are recognizable—the FBI emblem , the mysterious pc knowledgeable, and so on.—however it’s not a photograph, and the anatomy of the hacker is unsuitable, that includes the standard spaghetti fingers.
Textual content in Picture:
Immediate: A futuristic metropolis with a neon signal saying “EMERGE by Decrypt”

Normally, the most effective textual content generator is Dall-e 3 by far, Nevertheless, on this particular case and below the circumstances set by the comparability’s methodology, it didn’t correctly write the textual content. ImageFX couldn’t generate the entire phrase—its textual content era capabilities are there, however most likely are the least spectacular of the bunch.
That stated, Dall-E and ImageFX had been the most effective at capturing the essence of what a futuristic metropolis is whereas MidJourney generated an aesthetically pleasing metropolis however not one which’s futuristic in any respect.
Conclusion
AI aficionados at the moment are blessed with a cornucopia of AI fashions that serve many wants. With most provided totally free, there’s no want to choose winners—every has a particular use case that makes it stand out.
ImageFX is the most effective of the three if you happen to don’t need to spend cash. Additionally it is the most effective by way of photorealism.
MidJourney just isn’t good at respecting the prompts however is ideal for these searching for aesthetically pleasing photographs.
Dall-E 3 is the most effective for rookies who need to generate renders and don’t need to even take into consideration immediate engineering, key phrases and parameters and as an alternative simply need to speak to its AI as if it was simply one other good friend.
However yeah, in order for you a conclusion, we favored ImageFX—so much.
Edited by Ryan Ozawa.