Thoughts on AI image generation

June 02, 2025

Text to Image

Whenever I look at generated images I want to edit them, especially those that are a little 'off'.

For example these cats were generated using the Black Forest's Schnell model with the prompt: "Angry cat in Japanese style."

The upper lips are too thin, also the toes are off by one. cat with extra paw

This cat has an extra human-like hand, and its furless foot looks odd.

With Reference image

When I gave a specific reference image the end result still looks off. This is the original

"Treachery of images" by Rene Magritte, 1929

and here's the generated image with a twist. Rene Magritte's pipe image with last word being AI An image I generated with Gemini 2.5 Flash

Perhaps editing the original would've been better, as I only wanted to change the text.

The original painting's text read: "This is not a pipe". It's a representation of a pipe.

I changed it to say: "This is not AI." It's a representation of an AI-generated image.

Prediction

AI tools are affordable now; their prices will rise after a consolidation. Similar to how Uber and Lyft's fares were low to starve off competitors, and now they're high.

In AI's case the big providers eg. Google will drive other providers out of business before raising prices.

How to avoid vendor lock-in?

One way is to use open source models (they're on Huggingface), but it's hard for them to compete against better-funded models.

Another option is to continue to develop core skills (eg. coding/drawing/writing/editing/animation) and avoid being dependent on AI tools. One can still use them to augment existing skills, but they should not be a crutch.