How to Make AI Comics with Midjourney (2026 | V7)

After about a year, I returned to Midjourney to test its capabilities for AI comic creation. Unfortunately, I have to say that I’m disappointed by the lack of progress.

I used Version 7 for this project. While I was working on the comic, Midjourney released V8, but I was already about 90% finished with V7, so I decided to stick with it. From what I’ve seen so far, V8 doesn’t appear to offer significant improvements in the areas that matter most for comic creation: character consistency and environment consistency.

If you’d like to check out the comic, you can find it here: The Last Superhero Part 5

How to Create AI Comics with Midjourney

Create Master Prompts

You want your style, characters, and settings to be defined through master prompts. Keep them as short as possible to reduce prompt fading.

Examples:

  • Protagonist: mature man, grey hair, grey beard, stern face, dark coat, layered dark clothing, tall
  • Female lead: woman in her 30s, dark hair, slim build, worn clothing, subtle manipulative expression
  • Style: comic, Scott Snyder style, black and red colors
  • Mountain setting: mountains, canyon road, military convoy driving

Go through your script and create master prompts for all characters and settings involved. The style master prompt should be added to every image prompt.

This approach doesn’t guarantee consistency, but it improves it as much as currently possible in Midjourney.

Create Character Image Sheets

Next, create character sheets for your main characters. For my comic, I created sheets for the protagonist, the female lead, and the NPC soldiers. At a minimum, I recommend creating the following:

Examples:

  • Character Main Sheet: [style master prompt + character master prompt], full-body shot
  • Character Face Sheet: [style master prompt + character master prompt], multiple panels, different facial expressions (or angles), plus specific emotions if needed (e.g., surprised)

The main sheet serves as your primary character reference. You can upload these images to Midjourney and use them as character references whenever that character appears in a scene.

The face sheet can be used as an overlay. You can place it on top of larger scene images whenever a character is speaking. I’ve used this technique many times.

Use Character Sheets in Two Ways

Once you’ve established a consistent appearance for your characters, crop the image in two different ways:

  1. Full-body shot: Remove as much of the background and non-character details as possible.
  2. Face shot: Crop tightly around the character’s face.

You can then use both images as character references, which helps improve consistency.

Tips from 2025 Still Apply

If you haven’t read my guide from last year, nearly all of the advice still applies: How to Make Comics with AI (Midjourney | 2025)

Problems with Creating AI Comics with Midjourney

My previous attempt using ChatGPT produced significantly better results. The main reason is that Midjourney still hasn’t solved the two most important issues:

  1. Consistency
  2. Generating multiple characters in the same image

Character consistency remains a major problem, even with the option to upload reference images. Style is difficult to control and can easily fall apart, especially when prompts become longer. There is also still no reliable way to maintain environmental consistency across scenes.

Even worse, there has been virtually no improvement when generating images with multiple characters. It seems that using several character references causes features to blend together. The same issue occurs with character actions. As a result, it’s extremely difficult to create scenes where two characters interact naturally while maintaining consistent appearances from panel to panel.

Other Problems

Language Filter
You still can’t use many words associated with violence. Since most action scenes involve some form of violence, this limitation makes it difficult to tell traditional comic-book stories.

Prompt Fading
The longer your prompts become, the more tokens tend to “fade.” In practice, this means Midjourney starts ignoring certain parts of your prompt.

Style Drift
Maintaining a consistent visual style was even more difficult than a year ago. If you compare the first pages with the final pages of TLS 5, you’ll notice a significant style drift that I wasn’t able to control.

Improvements Compared to Earlier Midjourney Versions

Faster Image Generation

Midjourney has become even faster. You can generate roughly 500 images per hour, giving you plenty of options to choose from. This remains a significant advantage over ChatGPT’s image generation.

High-Quality Individual Images

When focusing on individual images, the quality is excellent. With enough patience, it’s possible to create impressive standalone comic pages.

Conclusion

Overall, there hasn’t been much progress—especially in the areas that matter most. Consistency remains a major challenge, and the language filter makes action scenes unnecessarily difficult to create.

In many respects, ChatGPT’s V5 image generation already produced better results for comic creation. My next step will be testing ChatGPT’s newest image model, which I’ve heard very positive things about. Perhaps that’s the model that finally brings us a significant step closer to creating truly convincing comics with AI.

How to Make Comics With ChatGPT (2026 – Image Model V5)

I got back into creating AI comics this year, starting with tests using ChatGPT’s latest image model, V5. You can read both generated comic stories here:

  1. The Last Superhero 3
  2. The Last Superhero 4

Part 3 was mainly a first test to see what the image model is capable of. In Part 4, I applied what I learned and aimed for better results. I’d say Part 3 delivered about 20% of what I had in mind, while Part 4 got closer to 60%.


What I Did to Improve the Results

Time Limits

There is a daily limit on how many images you can generate, even with a paid subscription. The free version gives you fewer than a handful of generations a day, making it unusable for a project like this.

To work around this, I focused on one or two pages per day and generated only the images needed for those pages. This way, I never hit the limit before finishing my daily work.


Generate More Images Per Prompt

I recommend adding token like “comic page layout” or “4 images in one with different perspectives” to every prompt. This gives you multiple images in a single generation.

Yes, the resolution drops, but if that matters to you, upscale the images afterwards.


Use Master Prompts

I created general “master prompts” for style and main characters, and even experimented with settings.

For example, “modern noir comic style” helps keep the visual style consistent and reduces style drift. The same applies to characters: “man in his 30s, short brown hair, grey hoodie”

Using these consistently improves overall coherence—although perfect consistency is still not possible.


Avoid Prompt Fading

If you add too many tokens to a prompt, the AI starts ignoring some of them—what I call “prompt fading.”

To avoid this, keep prompts short and focused. Use your master prompts, but limit scene descriptions to about 4–5 key elements and around 10 token max.


Let ChatGPT Refine Your Prompts

Describe your scene roughly and include your master prompts, then ask ChatGPT to generate a clean, concise version.

Most of the time, you can copy and paste that result directly for better outcomes than writing prompts manually.


Stay in the Same Chat

ChatGPT seems to retain context within a conversation, which can help with consistency if you generate all images in the same chat.

The downside is that long chats become slow and can glitch. This is something OpenAI should improve.


Accept Imperfection

Tools like Midjourney offer more control through adjustable parameters. ChatGPT doesn’t yet provide that level of precision.

Perfection isn’t achievable right now—so aim for “good enough” currently.


Work Around Slow Generation

Image generation is noticeably slower than with Midjourney.

A simple solution is to multitask: edit existing images in Photoshop (or another tool) while new ones are being generated.


Create Reference Sheets

Before starting, I generated text-based sheets for characters, settings, style, and actions.

Whenever ChatGPT lost consistency, I re-uploaded the relevant sheet to get it back on track (especially for characters).


Use a Color Token

Adding a consistent color theme helps unify the look.

For example, I used “orange-teal palette” in every prompt, which made the entire comic feel visually cohesive.


Generate Facial Close-Ups

Start by generating close-up facial expressions for each character. For example:
“man in his 30s, short brown hair, grey hoodie, angry, 4 images from different perspectives in one”

Do this for different emotions and characters. These images are useful for transitions between action scenes and as overlays.


Avoid Violence

Violence gets flagged very easily. Prompts involving fighting, injury, or killing often won’t render.

This is a major limitation—at the moment, you’re restricted to stories with minimal physical conflict.


Specify Aspect Ratio

Aspect ratio instructions are sometimes ignored, especially when using multi-image prompts.

Still, using terms like “horizontal shot” or “vertical shot” can help guide composition.


Handle Text Carefully

You can generate text within images, but results are inconsistent.

Sometimes the image looks good but the text is unusable—and sometimes the opposite. Because of this, I added dialogue later in Photoshop myself for better control.

Interestingly, simple text elements within scenes (like “police” on a car or a number like “0417” on clothing) worked surprisingly well and reliable.


Prompt Example

Here’s an example of a prompt I used:

  • Style prompt: modern noir comic style, cinematic lighting, orange-teal palette, sharp ink lines, graphic novel page
  • Multi-image prompt: panels layout with 4 perspective variations
  • Character prompt: 30s man, short dark hair, light stubble, grey hoodie

That is what I always added when the main character was part of a scene. Then I added the specific action for the scene. For example here is the final comic page:

  • Scene prompt: standing, watching suburban street, police car in distance, flashing lights, man standing still in foreground, police car driving away in background, distant perspective, quiet, tense aftermath

Full prompt:
modern noir comic style, cinematic lighting, orange-teal palette, sharp ink lines, graphic novel page, panels layout with 4 perspective variations, 30s man, short dark hair, light stubble, grey hoodie, standing, watching suburban street, police car in distance, flashing lights, man standing still in foreground, police car driving away in background, distant perspective, quiet, tense aftermath


Conclusion

The results were clearly better than my first attempt—and in some ways even better than what Midjourney produced for me last year.

However, it’s still far from offering the creative freedom needed for storytelling.

The biggest issue is content restriction: you can include tension, but not real action. No fights, no violence. Since these elements are essential to many comic narratives, this severely limits storytelling potential.

I reached about 50–60% satisfaction only by designing a story with reduced action. As soon as action becomes central, satisfaction drops drastically.

The Problems of Making Comics With ChatGPT (V5 – 2026)

My first attempt with ChatGPT (Version 5) is finished — The Last Superhero Part 3.

Right now, I’d say the result is maybe 20% of what I would like it to be. I plan to create Part 4 in the next couple of weeks and try to improve the results, as there is still some room for experimentation.

For now, here are the biggest problems I encountered.

Language Filter

Just like Midjourney, ChatGPT has a very strict language filter for image prompts. For text generation, you can tell ChatGPT that you’re working in a fictional setting. That allows you to describe certain acts of violence or crime to some degree.

With image generation, however, this isn’t possible at all. Even hinting at violence in a comic-book context can trigger the filter.

For example, I had problems generating an image where a character gets water splashed onto his face. That alone triggered the system.

The same happens with facial expressions. Pain alone might work, but pain combined with bruises often gets flagged — even without describing the action that caused them.

Time Limits for Image Generation

Don’t even try using the free version.

You might only get two or three images every couple of hours. For my 31-page comic, I generated more than 120 images.

Even the paid version has timeouts. After roughly every 20 images, ChatGPT asked me to wait a couple of hours before I could continue generating more.

Midjourney handles this much better — especially considering that the prices are somewhat comparable.

Style Drift

You can clearly see how the comic switches between different art styles. I tried to anchor the prompts around a specific comic-book artist, but every few images the style drifted again.

Prompt “Fading”

I’ve seen this with Midjourney as well. When prompts become too long, parts of them seem to fade away and become irrelevant. The AI then simply ignores those sections.

Character Consistency

Clothing and the general appearance are mostly fine, but the face of my protagonist drifted quite a lot.

Character consistency remains one of the biggest issues, especially if you attempt to create something larger like a 160-page comic.

Facial Details

Facial details are very difficult to control. My character’s beard looks slightly different in almost every image, and the hairstyle of the female doctor changes frequently as well.

Environment Consistency

This is similar to the character consistency problem. The more detailed the environment, the harder it becomes to keep it consistent across multiple images.

Chats Become Clunky and Glitchy

It helps to generate all images within the same chat, but once the conversation reaches around 20 prompts, things start to slow down. The chat becomes sluggish and sometimes even glitches.

User Experience

Overall, Midjourney still offers a better user experience. It’s easier to fine-tune prompts, results arrive faster, and the whole process feels more controlled.

Conclusion

There are quite a few issues. I think some of them can be improved with better prompting and a couple of workarounds.

For now, I would still recommend Midjourney for AI comics. That said, with a few adjustments I might be able to get better results with ChatGPT when creating Part 4 of The Last Superhero.