The Complete AI Image Prompt Guide Built from 300 Design Evaluations

The Complete AI Image Prompt Guide Built from 300 Design Evaluations

A systematic YAML 7-Part Structure for AI image prompts, derived from analyzing 300+ infographic design evaluations in Banana X. Includes high-scoring patterns and domain-specific templates.

Introduction: Stop Settling for “Just Pretty” Images

Ever typed “modern clean blog hero image” into an AI image generator? The result was probably a bland, generic image you’ve seen a thousand times. When your prompt is vague, the AI produces vague images.

To solve this, we analyzed 300+ infographic designs evaluated across 5 criteria in the Banana X project. The result? High-scoring designs share clear, consistent patterns — and we distilled those patterns into a systematic prompt framework called the YAML 7-Part Structure.

The 5 Evaluation Criteria (50 Points Total)

CriterionPointsDescription
Legibility10Can the information be clearly read?
Hierarchy10Is the visual priority of information distinguishable?
Consistency10Do all design elements follow a unified grammar?
Atmosphere10Does the style reinforce the content’s meaning?
Theme Fit10Does the visual match the post’s topic?

By extracting the common traits of designs scoring 45+ out of 50, we created a prompt writing framework anyone can follow.


1. YAML 7-Part Structure: The Skeleton of a Great Prompt

Every image prompt must include these 7 elements. Let’s walk through each part with BAD vs GOOD examples.

Part 1: Tone — 5 Mood Keywords

Tone: "keyword1, keyword2, keyword3, keyword4, keyword5"

Define the entire world of your image with 5 adjectives or nouns. The more specific, the better.

❌ BAD:

Tone: "modern, clean, professional"

→ Applies to literally every design. Zero directional guidance for the AI.

✅ GOOD:

Tone: "intellectual, planned, precise, engineering, blueprint"

→ “An intellectual, precisely engineered blueprint” — a clear visual world emerges.

High-Scoring (45+) Tone Patterns:

StyleTone Keywords
Minimal/Line Artsimple, refined, silence, mode, mature
Blueprint/Technicalintellectual, planned, precise, engineering, blueprint
Paper Craftwarm, handcrafted, storybook, dimensional
Neumorphismfuturistic, clean, soft, UI, minimal
Cyberpunk/Circuitstark, grid, network, future, intelligence
Newspaper/Classicjournalism, gravity, classic, authority, impact

Part 2: Visual Identity — Color Palette

Visual Identity:
  Background: "#HEX (Color Name) — role description"
  Text Color: "#HEX (Color Name)"
  Accent Colors:
    - "#HEX (Name) — purpose"
    - "#HEX (Name) — purpose"

When specifying colors, always include both the HEX code and the color name. This ensures the AI interprets colors accurately.

❌ BAD:

Visual Identity:
  Background: "blue"
  Accent Colors:
    - "yellow"

→ “Blue” could mean anything from navy to sky blue. The AI has to guess.

✅ GOOD:

Visual Identity:
  Background: "#0047AB (Cobalt Blue) — blueprint background"
  Text Color: "#FFFFFF (White)"
  Accent Colors:
    - "#FFD700 (Gold) — key highlights"
    - "#ADD8E6 (Light Blue) — auxiliary lines"

→ Precise color codes + roles keep the AI’s color scheme consistent.

High-Scoring Color Combinations:

StyleBackgroundTextAccent
Blueprint#0047AB (Blue)#FFFFFF (White)#FFD700 (Yellow)
Minimal#FFFFFF (White)#000000 (Black)#D3D3D3 (Light Gray)
Art Deco#050505 (Rich Black)#D4AF37 (Gold)#C0C0C0 (Silver)
Cyberpunk#000033 (Dark Blue)#00FFFF (Cyan)#1E90FF (Dodger Blue)
Paper CraftPastel tonesCut-out styleComplementary pastels

Part 3: Image Style — Core Visual Approach

This part determines what the image actually looks like. It consists of 5–6 sub-fields.

Image Style:
  Features: "Core visual approach in one sentence"
  Shapes: "Shapes and motifs to use"
  Texture: "Surface texture"
  Composition: "Layout and framing"
  Effects: "Visual effects" (optional)
  Imagery: "Specific image elements" (optional)

BAD vs GOOD for each field:

Features — The most important single sentence:

  • "Clean modern design" → Matches every design ever
  • "Layout composed of PCB circuit board patterns" → Unique to this design

Shapes — Recurring visual elements:

  • "Various shapes" → Zero information
  • "Straight lines and 45-degree traces, nodes, connectors" → Concrete visual vocabulary

Texture — Tactile expression:

  • "Smooth" → Insufficient
  • "Washi paper fibers, woodblock grain, ink gradations" → Physical presence

Composition — Visual hierarchy strategy:

  • "Centered layout" → Too simple
  • "A few thin lines in the center of overwhelming white space" → Includes spatial strategy

Part 4: Typography — Font Style

Typography:
  Heading: "Heading font style"
  Body: "Body font style" (optional)
  Style: "How the font is applied"

❌ BAD:

Typography:
  Heading: "Sans-serif"

✅ GOOD:

Typography:
  Heading: "Drafting stencil font"
  Style: "Hand-drawn block letters, placed like dimension lines and annotation labels"

High-Scoring Typography Patterns:

StyleHeadingStyle
BlueprintDrafting stencil fontHand-drawn block letters
MinimalThin sans-serif (Light/Thin)Thin enough to feel spacious even at large sizes
NeumorphismRounded sans-serifEmbossed/debossed like button labels
Paper CraftCut-out letters, rounded fontsLetters that look cut from paper
NewspaperBlackletter (masthead style)Tightly set type

Part 5: Content Connection — Linking to Post Content

Content Connection:
  Core Concept: "The key concept this post covers"
  Visual Metaphor: "What metaphor visualizes the concept"
  Key Elements: "2-3 core visual elements extracted from the post"

This is the most important differentiator. Without this section, you’ll get a generic image that could belong to any post.

❌ BAD (No Content Connection): → A “React” post’s hero image could just as easily be used for a “Vue” post

✅ GOOD:

Content Connection:
  Core Concept: "Component separation between server and client to reduce bundle size"
  Visual Metaphor: "A building (app) splitting into server floors and client floors as an architectural blueprint"
  Key Elements: "Server zone (blue), client zone (green), data flow arrows"

→ An image direction that only makes sense for a “React Server Components” post

Part 6: Constraints — Non-Negotiables

Constraints: "No text overlay. No watermarks. 2:1 aspect ratio. No photorealistic human faces."

4 mandatory constraints you must always include:

ConstraintReason
No text overlayAI-generated text is almost always garbled
No watermarksPrevents watermark artifacts
2:1 aspect ratioBlog hero image ratio
No photorealistic human facesAvoids uncanny valley faces

Part 7: Self-Check — 3-Point Validation

After writing your prompt, verify these 3 things:

  1. Uniqueness Test: “Could this prompt be used for a completely different post?” → If yes, Content Connection is lacking
  2. Visual Specificity Test: “Would two people reading this prompt draw similar images?” → If no, Shapes/Texture/Composition need work
  3. Consistency Test: “Do the 5 Tone keywords conflict with the Color Palette + Image Style?” → If there’s a conflict, revise

2. The Common Laws of High-Scoring Designs

Here are the shared traits of designs scoring 45+ out of 50 across 300 evaluations.

The Secret to Consistency 10/10: Unified “Design Grammar”

Designs that achieved perfect consistency scores apply the same design grammar to every element:

  • Line weight is uniform throughout
  • Icon abstraction levels are consistent
  • Color usage follows rules (e.g., red = emphasis, blue = supporting)
  • Texture is synchronized across all areas

How to achieve this in your prompt:

# ❌ BAD: No texture description → AI uses different textures for each part
Image Style:
  Features: "Modern design with icons"

# ✅ GOOD: Explicit texture rules for the entire image
Image Style:
  Features: "Composed entirely of uniform ultra-thin lines"
  Texture: "All lines at identical weight (0.5pt equivalent)"
  Composition: "Unified abstraction level for all symbols"

The Secret to Atmosphere 10/10: Style as “Container for Information”

Perfect atmosphere scores go to designs where the style isn’t mere decoration — it reinforces the content’s meaning. The design should shift the reader’s emotional state.

ScorePatternWhy It Scores High
50/50Minimal/Line ArtMasterful understanding of information through subtraction
49/50Blueprint/TechnicalPerfect metaphor of “thought as architectural plan”
49/50Paper CutoutPaper depth physically represents information layers
49/50NeumorphismEmboss/deboss physically represents priority levels

The Secret to Theme Fit 10/10: Repurposing a Style’s Essence

The key is understanding a style’s core essence and applying it to information expression:

  • Blueprint → Uses the “design/planning” metaphor to express the information construction process
  • Newspaper → Uses the “journalism” format to emphasize information gravity
  • Ukiyo-e → Uses the aesthetic of “refined elegance” to elevate information dignity

Analyzing Perfect Score (50/50) Designs

What all 50/50 designs have in common:

  1. 100% commitment to one style — No style mixing
  2. Specific Content Connection — 1:1 correspondence with the post content
  3. Strict constraint compliance — No text overlay, exact ratio
  4. Tone ↔ Style ↔ Color triangular consistency — All three point to the same visual world

3. Domain-Specific Prompt Templates

Here are prompt templates optimized for 5 technical domains. Each defines a visual language suited to its domain — just fill in the Content Connection for your specific post.

3.1 Web Development / Frontend

Tone: "modern, constructive, layered, component, clean"
Visual Identity:
  Background: "#FFFFFF (White)"
  Text Color: "#1A1A1A (Near Black)"
  Accent Colors:
    - "#61DAFB (React Blue) — framework signature color"
    - "#F7DF1E (JavaScript Yellow) — supporting"
Image Style:
  Features: "Architectural composition of stacking component blocks"
  Shapes: "Rounded rectangles, connection lines, nested structures"
  Texture: "Flat, clean digital surface"
  Composition: "Left-to-right data flow, or component tree"
Typography:
  Heading: "Modern sans-serif (Inter, SF Pro)"
  Style: "Monospace accents like a code editor"
Constraints: "No text overlay. No watermarks. 2:1 aspect ratio."

Key idea: Stacking component blocks in an architectural composition to express frontend’s “assembly” nature.

3.2 AI / Machine Learning

Tone: "intelligence, network, future, glow, neural"
Visual Identity:
  Background: "#0A0A2E (Deep Space Blue)"
  Text Color: "#E0E0FF (Light Lavender)"
  Accent Colors:
    - "#00FFCC (Cyan Glow) — neural connections"
    - "#FF6B6B (Coral) — data points"
Image Style:
  Features: "Abstract representation of neural network nodes and connections"
  Shapes: "Glowing spherical nodes, curved connection lines, data streams"
  Texture: "Luminous particles floating in dark space"
  Composition: "Central hub with radially expanding network"
  Effects: "Glow effects, subtle bokeh"
Typography:
  Heading: "Thin futuristic sans-serif (Exo, Audiowide)"
  Style: "Glowing text"
Constraints: "No text overlay. No watermarks. 2:1 aspect ratio."

Key idea: Glowing node networks against deep space blue — visualizing AI/ML’s essence of “connection and learning.”

3.3 DevOps / Infrastructure

Tone: "design, precision, planning, intellectual, industrial"
Visual Identity:
  Background: "#0047AB (Blueprint Blue)"
  Text Color: "#FFFFFF (White)"
  Accent Colors:
    - "#FFD700 (Yellow) — highlights"
    - "#ADD8E6 (Light Blue) — auxiliary lines"
Image Style:
  Features: "Architectural blueprint aesthetic"
  Shapes: "Grid squares, dimension lines, cross-sections, arrows"
  Texture: "Photosensitive paper, drafting paper"
  Composition: "Strict grid-based, organized information density"
Typography:
  Heading: "Drafting lettering (Architect's Daughter style)"
  Style: "All caps, disciplined placement"
Constraints: "No text overlay. No watermarks. 2:1 aspect ratio."

Key idea: Blueprint aesthetic — expressing DevOps’ “infrastructure design” essence through architectural drawings.

3.4 Performance / Optimization

Tone: "speed, efficiency, lightweight, refined, optimization"
Visual Identity:
  Background: "#FFFFFF (White)"
  Text Color: "#000000 (Black)"
  Accent Colors:
    - "#00C853 (Success Green) — post-optimization"
    - "#FF5252 (Error Red) — pre-optimization"
Image Style:
  Features: "Minimalism stripped to the absolute essentials"
  Shapes: "Ultra-thin lines, geometric shapes, graph curves"
  Texture: "Matte and smooth"
  Composition: "Overwhelming whitespace, before/after contrast"
Typography:
  Heading: "Thin sans-serif (Light / Thin)"
  Style: "Numbers large, everything else restrained"
Constraints: "No text overlay. No watermarks. 2:1 aspect ratio."

Key idea: Extreme minimalism — expressing “optimization = removing the unnecessary” through whitespace.

3.5 Security

Tone: "robust, trust, layers, protection, encryption"
Visual Identity:
  Background: "#0F0F0F (Almost Black)"
  Text Color: "#00FF00 (Terminal Green)"
  Accent Colors:
    - "#FF0000 (Alert Red) — threats"
    - "#00BFFF (Cyan) — protection layers"
Image Style:
  Features: "PCB circuit board and encryption metaphor"
  Shapes: "Shields, keys, locks, multi-layered concentric circles"
  Texture: "Dark metallic surface, faint circuit patterns"
  Composition: "Central protected asset surrounded by multiple layers"
  Effects: "Faint scanline sweep"
Typography:
  Heading: "Monospace (Fira Code)"
  Style: "Terminal output display"
Constraints: "No text overlay. No watermarks. 2:1 aspect ratio."

Key idea: Dark background + terminal green — expressing security’s “defense in depth” essence with concentric layers.


4. Converting YAML → English Prompts

To feed your YAML-structured prompt into an AI image generation API, you need to convert it into an English natural language prompt.

Conversion Rules

YAML PartPosition in English PromptConversion Method
ToneOpening phrase"A [tone1], [tone2] illustration..."
Visual IdentityColor specification"...in [color1] (#HEX) and [color2] (#HEX)..."
Image StyleCore descriptionCompress Features + Shapes + Texture into 1–2 sentences
CompositionLayout direction"...with [layout description]..."
Content ConnectionMetaphor descriptionVisual Metaphor as the key sentence
ConstraintsEnd of prompt"No text overlay. No watermarks. 2:1 aspect ratio."

Conversion Template

A [Tone keywords] illustration of [Features description].
[Shapes description] arranged in [Composition description].
[Texture description] with [Effects if any].
Color palette: [Background] background, [Accent colors] for key elements.
[Typography style if relevant to the visual].
[Content Connection: Visual Metaphor].
No text overlay. No watermarks. Suitable for 2:1 aspect ratio blog hero image.

Full Example: Next.js App Router Migration

YAML Design:

Tone: "constructive, modern, migration, evolution, architecture"

Visual Identity:
  Background: "#FFFFFF (White) — clean architectural drawing"
  Text Color: "#000000 (Black)"
  Accent Colors:
    - "#0070F3 (Next.js Blue) — new App Router side"
    - "#999999 (Gray) — old Pages Router side"
    - "#00C853 (Green) — success/improvement arrows"

Image Style:
  Features: "Like an architectural renovation, showing migration from old to new structure via blueprints"
  Shapes: "Left: old floor plan (Pages Router), Right: new floor plan (App Router), Center: transformation arrow"
  Texture: "Grid paper, fine drafting lines"
  Composition: "Left-right split. Left = old (gray tones), Right = new (blue tones), large arrow in center"
  Effects: "Right side with subtle glow"

Content Connection:
  Core Concept: "Incremental migration from Pages Router to App Router"
  Visual Metaphor: "An architect renovating an old building with a new floor plan"
  Key Elements: "Gray blueprint of old structure, blue blueprint of new structure, green migration arrows"

Constraints: "No text overlay. No watermarks. 2:1 aspect ratio. Blueprint aesthetic."

Converted English Prompt:

An architectural blueprint illustration showing migration from old to new
structure. Left side in gray shows a legacy floor plan (Pages Router), right
side in Next.js blue (#0070F3) shows a modern floor plan (App Router). A large
green arrow connects them through the center. Grid paper background with fine
drafting lines. Left half is muted gray tones, right half is bright blue tones
with subtle glow. Technical annotation style labels. Clean, architectural,
precise. No text overlay. No watermarks. 2:1 aspect ratio.

The key is compressing everything from your YAML structure into natural language without losing any important details.


Conclusion: Results and Final Checklist

Real-World Blog Application Results

After applying this guideline to actual blog hero image generation:

  • Before: Prompt “modern tech blog hero” → generic gradient image seen everywhere
  • After: YAML 7-Part Structure applied → a unique image perfectly aligned with the post content

The Content Connection part alone boosted Theme Fit scores from an average of 6 to 9 out of 10.

Final Checklist

After completing your prompt, run through this checklist:

  • Tone: Are the 5 keywords specific and mutually consistent?
  • Visual Identity: Are HEX codes + color names + purposes all specified?
  • Image Style: Are Features/Shapes/Texture/Composition each specific enough?
  • Typography: Is a style-appropriate font specified?
  • Content Connection: Is the visual metaphor meaningful only for this specific post?
  • Constraints: Are the 4 mandatory constraints (text, watermarks, ratio, faces) included?
  • Self-Check 3 Points: Did it pass the uniqueness / visual specificity / consistency tests?

Follow this structure, and AI will generate high-quality hero images that perfectly match your blog posts. Stop settling for “just pretty.” Use data-proven high-scoring patterns to create precise, meaningful images.

Read in Other Languages

Was this helpful?

Your support helps me create better content. Buy me a coffee! ☕

About the Author

JK

Kim Jangwook

Full-Stack Developer specializing in AI/LLM

Building AI agent systems, LLM applications, and automation solutions with 10+ years of web development experience. Sharing practical insights on Claude Code, MCP, and RAG systems.