antigravity-claudekit/skills/ck-ai-artist/SKILL.md at 2d31c0a137999850e91759b1306cfa84c3fe789e

Files

nvtien 2d31c0a137 Initial commit: antigravity-claudekit

2026-02-16 14:02:42 +09:00

3.4 KiB

Raw Blame History

name, description

name	description
ck-ai-artist	Generates, edits, and refines images using AI image generation APIs. Activate when user says 'generate an image', 'create artwork', 'AI art', 'draw me', 'make a picture of', or 'image generation prompt'. Accepts text descriptions, style references, dimensions, and iteration requests.

Overview

Crafts optimized prompts and orchestrates AI image generation workflows. Handles prompt engineering, style specification, negative prompts, and iterative refinement for high-quality image outputs.

When to Use

User requests visual content creation from text descriptions
Generating concept art, illustrations, or mockup visuals
Iterating on existing images with style or composition changes
Batch generating image variants for A/B testing or design exploration
Creating consistent image sets with shared style parameters

Don't Use When

User needs photo editing of an existing uploaded image (use ck-ai-multimodal)
Task requires vector graphics or SVG creation (use a design tool)
User needs video generation (use ck-remotion or a video AI service)
The output will be used in a commercial context without verifying model license

Steps / Instructions

1. Clarify Image Requirements

Gather before generating:

Subject / main content
Style (photorealistic, illustration, watercolor, pixel art, etc.)
Mood / lighting / color palette
Dimensions / aspect ratio (e.g., 16:9, square, portrait)
Reference styles or artists (check license implications)

2. Craft the Prompt

Structure:

[subject], [environment/setting], [style], [lighting], [camera/perspective], [quality modifiers]

Example:

A futuristic city skyline at dusk, neon reflections on wet streets,
cyberpunk illustration style, cinematic lighting, wide angle,
high detail, vibrant colors

Negative prompt (things to avoid):

blurry, low quality, text, watermark, distorted faces, extra limbs

3. Select Model / API Parameters

Common parameters across providers:

{
  "prompt": "<optimized prompt>",
  "negative_prompt": "<what to avoid>",
  "width": 1024,
  "height": 1024,
  "steps": 30,
  "guidance_scale": 7.5,
  "seed": 42
}

Provider options (do not hardcode API keys — use env vars):

OPENAI_API_KEY → DALL-E 3
STABILITY_API_KEY → Stable Diffusion / SDXL
REPLICATE_API_TOKEN → various open models

4. Iterate and Refine

After first generation:

Review output against intent
Adjust prompt — add specifics for weak areas
Tweak guidance scale (higher = more prompt-adherent, less creative)
Try seed variations for composition changes
Use img2img if base composition is close but needs refinement

5. Post-Processing Suggestions

Upscale with Real-ESRGAN for print-quality output
Use inpainting to fix specific regions
Apply style transfer for consistent series

Prompt Engineering Tips

Be specific about count: "three red apples" not "some apples"
Lighting has major impact: "golden hour", "studio softbox", "moonlight"
Add medium: "oil painting", "digital illustration", "pencil sketch"
Quality boosters: "highly detailed", "sharp focus", "8k", "award-winning"
Avoid abstract adjectives alone: "beautiful" is weak; describe what makes it beautiful

Notes

Never embed API keys in scripts; always read from environment variables
Log prompt + seed + parameters for reproducibility
Respect content policies of each image generation provider

3.4 KiB Raw Blame History