Buckle up girls and gents, we now have a brand new AI picture generator on the town, and it’s surprisingly good.
It is stunning as a result of it comes from Google and since it isn’t the essential, considerably ugly, lazy generator you’re used to seeing in Bard. It’s additionally hidden from most of the people —however that doesn’t imply you possibly can’t use it.
Its title is ImageFX and it’s Google’s newest enterprise into the realm of AI picture era. It’s out there by way of Google’s AI Take a look at Kitchen, an experimental platform that enables customers to work together with Google’s initiatives whereas they’re nonetheless in improvement.
Regardless of being in its early beta section, ImageFX supplies superb outcomes when it comes to accuracy and photorealism. Its availability, nonetheless, is confined to particular areas, particularly the U.S, Kenya, New Zealand, and Australia, and its utilization is restricted to English, demonstrating Google’s cautious strategy and its want for a managed surroundings for person suggestions and system refinements.
These dwelling outdoors the allowed areas may bypass geographical restrictions with strategies like VPNs or proxies—at their very own threat.
Powering ImageFX is Imagen 2, a classy AI mannequin developed by Google’s famend AI lab, DeepMind. Imagen 2 is designed to interpret and visualize textual prompts, boasting capabilities to provide various photos and kinds. Google asserts that Imagen 2 units a brand new commonplace for picture high quality amongst its era of AI fashions.
The introduction of ImageFX is a part of Google’s broader technique to discover varied aspects of generative synthetic intelligence. It joins a set of specialised instruments, together with MusicFX for music creation and TextFX for stylized textual content era.
Google vs. Dall-e 3 vs. MidJourney
Google’s ImageFX marks a notable entry into the realm of AI-driven picture turbines, immediately competing with established gamers like Dall-E 3 and MidJourney. A definite edge for ImageFX in its early beta section is its cost-free entry, diverging from Dall-E’s integration with ChatGPT at a month-to-month fee of $20, and MidJourney’s annual subscription nearing $100.
Whereas cost-effectiveness is an enormous issue, it is the comparative options and output high quality that units these instruments aside. ImageFX excels in producing hyperrealistic photos, surpassing Dall-E 3’s considerably cartoonish renditions and MidJourney’s concentrate on aesthetically interesting visuals.
However simply because ImageFX is free doesn’t imply it’s unhealthy. ImageFX presents distinctive options like seed management, permitting customers to finely tune the artistic course of by adjusting the preliminary noise configuration. This stage of management is unmatched by Dall-E 3 or MidJourney, permitting customers to make refined changes whereas sustaining the core components of the picture.
Moreover, ImageFX can spotlight key immediate phrases and counsel artistic options—a characteristic not out there from its opponents.
ImageFX does have its limitations, nonetheless. The device completely generates sq. photos, whereas Dall-E 3 and MidJourney present flexibility in facet ratios. Furthermore, not like MidJourney, ImageFX doesn’t assist picture enhancing options like inpaint and outpaint, limiting its versatility. Lastly, the conversational characteristic of Dall-E 3—which permits freshmen to instruct the mannequin in pure language—contrasts with the keyword-based prompting required by ImageFX and MidJourney.
The strategy to prompting differs considerably amongst these fashions, too. ImageFX doesn’t assist unfavorable prompting, which lets customers specify what to exclude from the picture. MidJourney presents this performance, including a layer of precision to the artistic course of. Dall-E 3 additionally lacks direct unfavorable prompting, however its conversational interface permits customers to information the mannequin not directly, providing a distinct strategy to refining picture outputs.
A picture is value a thousand phrases
Decrypt received entry to ImageFX and was in a position to examine its generations in opposition to MidJourney and Dall-E 3. We used the identical immediate for all fashions and the outcomes beneath are at all times introduced in the identical order from left to proper: First is ImageFX, second is MidJourney, and third is Dall-E 3.
Photorealism:
Immediate: Picture of a cryptocurrency dealer with anxious expression

Each ImageFX and MirJourney generated fairly real looking outcomes. Nevertheless when it comes to fashion, ImageFX appears photorealistic whereas MidJourney appears a bit extra hyperrealistic, that means the primary is extra true to life whereas the second is extra creative, with saturated colours, exaggerated bokeh, and many others.
Dalle-3 fails to generate pictures. As an alternative it created a 3d render focusing extra on the content material. It’s simpler to inform it was a crypto dealer due to the charts within the background, nevertheless it was positively not a photograph.
Illustrations:
Immediate: Illustration of a mysterious bear browsing a cybernetic wave

This immediate was just a little bit extra summary to check how fashions interpret non-standard concepts. ImageFX and MidJourney generated probably the most aesthetically pleasing photos, however MidJourney appears extra like a render than an illustration and ImageFX tried to seize the essence of what a cybernetic wave might be. As an alternative, MidJourney related the time period “cybernetic” to the bear. Dall-e 3 captured the essence extra intently. It was clearly an illustration, and it resembles the cybernetic aesthetic, however the bear’s morphology is incorrect, and the picture lacks in high quality in opposition to its opponents.
Lengthy natural-language:
Immediate: Extremely detailed images scifi shut up of a mysterious laptop professional engaged on a laptop computer . Behind him, an FBI agent awaits to seize him huge shot photorealistic intricate

With a purpose to conduct this comparability, the immediate for MidJourney was modified to “extremely detailed images scifi shut up of a mysterious laptop professional engaged on a laptop computer with an FBI agent behind him awaiting to seize him, huge shot, photorealistic, intricate.”
MidJourney refused to generate photos below the primary immediate.
ImageFX generates a pleasant, detailed {photograph} respecting all the main points. MidJourney didn’t generate a “mysterious” laptop professional. It additionally sticks to its signature fashion with extreme bokeh and attention-grabber gentle trails or rain droplets on the totally different generations. This was the very best instance, as the remaining appeared to depict an astronaut, a cyberpunk marine, or one thing comparable. Dall-E generates a picture through which all the weather of the immediate are recognizable—the FBI emblem , the mysterious laptop professional, and many others.—however it’s not a photograph, and the anatomy of the hacker is incorrect, that includes the standard spaghetti fingers.
Textual content in Picture:
Immediate: A futuristic metropolis with a neon signal saying “EMERGE by Decrypt”

Normally, the very best textual content generator is Dall-e 3 by far, Nevertheless, on this particular case and below the circumstances set by the comparability’s methodology, it didn’t correctly write the textual content. ImageFX couldn’t generate the entire phrase—its textual content era capabilities are there, however in all probability are the least spectacular of the bunch.
That mentioned, Dall-E and ImageFX have been the very best at capturing the essence of what a futuristic metropolis is whereas MidJourney generated an aesthetically pleasing metropolis however not one which’s futuristic in any respect.
Conclusion
AI aficionados are actually blessed with a cornucopia of AI fashions that serve many wants. With most supplied free of charge, there’s no want to choose winners—every has a selected use case that makes it stand out.
ImageFX is the very best of the three should you don’t wish to spend cash. It is usually the very best when it comes to photorealism.
MidJourney just isn’t good at respecting the prompts however is ideal for these on the lookout for aesthetically pleasing photos.
Dall-E 3 is the very best for freshmen who wish to generate renders and don’t wish to even take into consideration immediate engineering, key phrases and parameters and as a substitute simply wish to speak to its AI as if it was simply one other pal.
However yeah, if you need a conclusion, we appreciated ImageFX—quite a bit.
Edited by Ryan Ozawa.








