Media Generation - JIGGAI Docs

Architecture Overview

Driver Registry System

ClawRecipes uses a driver registry that maps providers to skill scripts:

Provider Name → MediaDriver → Skill Script → Generated Media

Key Components:

MediaDriver — TypeScript adapter with provider-specific logic
Skill — Folder containing scripts, dependencies, and documentation
Registry — Maps provider names to drivers for runtime lookup
Worker — Executes media nodes using appropriate drivers

Execution Flow

Workflow node specifies provider (e.g., "provider": "nano-banana-pro")
Worker looks up driver in registry by slug
Driver locates skill directory and script
Script executes with prompt and config
Output file saved to workflow run media directory

Setup Instructions

Image Generation

Nano Banana Pro (Gemini)

Requirements:

GEMINI_API_KEY environment variable
ClawHub skill: nano-banana-pro

Setup:

# Install skill
clawhub install nano-banana-pro

# Set API key in OpenClaw config
openclaw gateway config update env.vars.GEMINI_API_KEY "your-gemini-api-key"

Configuration:

{
  "kind": "media-image",
  "config": {
    "provider": "nano-banana-pro",
    "size": "2048x2048",
    "promptTemplate": "{{content_draft.title}}: professional product photo"
  }
}

Supported sizes:

1024x1024 → 1K resolution
1792x1792 → 2K resolution
3840x3840 → 4K resolution

DALL-E (OpenAI)

Requirements:

OPENAI_API_KEY environment variable
ClawHub skill: openai-dalle

Setup:

clawhub install openai-dalle
openclaw gateway config update env.vars.OPENAI_API_KEY "sk-your-openai-key"

Configuration:

{
  "config": {
    "provider": "openai-dalle",
    "size": "1024x1024",
    "quality": "hd",
    "style": "natural"
  }
}

Supported options:

size: 1024x1024, 1024x1792, 1792x1024
quality: standard, hd
style: natural, vivid

Video Generation

Kling AI

Requirements:

Kling AI credentials file (NOT environment variables)
ClawHub skill: klingai

Setup:

# Install skill  
clawhub install klingai --force

# Configure credentials
mkdir -p ~/.config/kling
cat > ~/.config/kling/.credentials << EOF
{
  "access_key_id": "your-access-key",
  "secret_access_key": "your-secret-key"
}
EOF

Configuration:

{
  "kind": "media-video",
  "config": {
    "provider": "klingai",
    "duration": "10",
    "aspect_ratio": "16:9",
    "promptTemplate": "{{content_brief.style}}: {{content_brief.video_description}}"
  }
}

Constraints:

Duration: 3-15 seconds
Aspect ratios: 16:9, 9:16, 1:1
Mode: pro (fixed)

Runway

Requirements:

RUNWAYML_API_SECRET environment variable
ClawHub skill: runway-video

Setup:

clawhub install runway-video
openclaw gateway config update env.vars.RUNWAYML_API_SECRET "your-runway-secret"

Configuration:

{
  "config": {
    "provider": "runway-video", 
    "duration": "8",
    "size": "1280x768",
    "addRefinement": "true"
  }
}

Luma AI

Requirements:

LUMAAI_API_KEY environment variable
ClawHub skill: luma-video

Setup:

clawhub install luma-video
openclaw gateway config update env.vars.LUMAAI_API_KEY "luma-your-api-key"

Configuration:

{
  "config": {
    "provider": "luma-video",
    "duration": "5",
    "aspect_ratio": "16:9"
  }
}

Configuration Fields

Core Fields

promptTemplate — Template string with variable substitution

"promptTemplate": "Create {{media_type}} for: {{content_draft.title}}\nStyle: {{brand_guide.style}}"

provider — Driver slug (matches skill folder name)

"provider": "nano-banana-pro"

outputPath — Custom output file path (optional)

"outputPath": "media/{{run.id}}/hero-{{content_draft.slug}}.png"

Size Configuration

For Images:

"size": "1024x1024"
"size": "1792x1792"  
"size": "3840x3840"

For Videos:

"aspect_ratio": "16:9"
"aspect_ratio": "9:16"
"aspect_ratio": "1:1"
"size": "1280x768"

Duration Configuration (Videos)

"duration": "5"      // seconds as string
"duration": "10"     // clamp to provider limits

Provider Constraints:

Kling AI: 3-15 seconds
Runway: 1-10 seconds
Luma AI: 2-10 seconds

addRefinement — Enable LLM prompt enhancement (opt-in)

"addRefinement": "true"

When enabled:

Input prompt processed by LLM for enhancement
Enhanced prompt sent to media provider
Results in more detailed, production-ready prompts

Default: false (upstream LLM nodes should produce ready prompts)

Environment Variables

Loading Hierarchy

ClawRecipes loads environment variables from:

Process environment — process.env (highest priority)
OpenClaw config — ~/.openclaw/openclaw.json → env.vars

OpenClaw Config Format

Modern format: (recommended)

{
  "env": {
    "vars": {
      "GEMINI_API_KEY": "your-key",
      "OPENAI_API_KEY": "your-key",
      "RUNWAYML_API_SECRET": "your-secret"
    }
  }
}

Legacy format: (still supported)

{
  "env": {
    "GEMINI_API_KEY": "your-key"
  }
}

Setting Environment Variables

# Via OpenClaw CLI
openclaw gateway config update env.vars.GEMINI_API_KEY "your-key"

# Via direct edit
$EDITOR ~/.openclaw/openclaw.json

# Via shell environment
export GEMINI_API_KEY="your-key"
openclaw gateway restart

Skill Installation

ClawHub Installation

# Global installation (recommended)
clawhub install nano-banana-pro
clawhub install klingai --force

# Agent-specific installation
openclaw recipes install-skill nano-banana-pro --agent-id marketing-lead

# Team-specific installation  
openclaw recipes install-skill nano-banana-pro --team-id marketing-team

Installation Roots

Skills are discovered from these directories:

~/.openclaw/skills/ — Global shared skills
~/.openclaw/workspace/skills/ — Workspace-local skills
~/.openclaw/workspace/ — ClawHub sometimes installs here

Verification

# List available drivers
openclaw recipes workflows media-drivers

# Expected output:
[
  {
    "slug": "nano-banana-pro",
    "displayName": "Nano Banana Pro (Gemini Image Generation)",
    "mediaType": "image", 
    "requiredEnvVars": ["GEMINI_API_KEY"],
    "available": true,
    "missingEnvVars": []
  }
]

Template Variables

Media nodes support full template variable substitution in:

promptTemplate — AI generation prompt
outputPath — Custom file paths

Available Variables

Global variables:

{{date}}             — Current timestamp
{{run.id}}           — Workflow run ID
{{workflow.name}}    — Workflow display name
{{node.id}}          — Current node ID

Upstream node outputs:

{{content_draft.text}}        — Text from LLM node
{{brand_guide.style}}         — Extracted JSON field
{{research_brief.video_concept}} — Nested field extraction

Example Templates

Product marketing image:

{
  "promptTemplate": "Professional product photo: {{product_brief.name}}\n\nStyle: {{brand_guide.visual_style}}\nMood: {{campaign_goals.target_emotion}}\nComposition: {{art_direction.composition_notes}}"
}

Social video:

{
  "promptTemplate": "Short social media video: {{content_calendar.post_topic}}\n\nHook: {{copywriting.video_hook}}\nVisual style: {{brand_assets.video_style}}\nDuration: energetic {{duration}}s clip"
}

Troubleshooting

Driver Not Found

Error:

No media driver found for provider "nano-banana-pro"

Diagnosis:

# Check if skill is installed
ls ~/.openclaw/skills/nano-banana-pro
ls ~/.openclaw/workspace/skills/nano-banana-pro

# Verify driver registry
openclaw recipes workflows media-drivers | grep nano-banana-pro

Solutions:

Install missing skill: clawhub install nano-banana-pro
Check skill directory permissions
Restart gateway if skill was just installed

Missing Environment Variables

Error:

{
  "slug": "nano-banana-pro",
  "available": false,
  "missingEnvVars": ["GEMINI_API_KEY"]
}

Diagnosis:

# Check current environment
echo $GEMINI_API_KEY

# Check OpenClaw config
cat ~/.openclaw/openclaw.json | jq '.env.vars'

Solutions:

Set via config: openclaw gateway config update env.vars.GEMINI_API_KEY "your-key"
Export in shell before starting gateway
Restart gateway after config changes

Script Execution Failures

Error:

Script execution failed: nano-banana-pro generate_image.py
--- stderr ---
ModuleNotFoundError: No module named 'google.generativeai'

Diagnosis:

# Check skill venv
ls ~/.openclaw/skills/nano-banana-pro/.venv/

# Test script manually
cd ~/.openclaw/skills/nano-banana-pro
./.venv/bin/python scripts/generate_image.py --help

Solutions:

Reinstall skill: clawhub install nano-banana-pro --force
Manually setup venv: cd skill && python -m venv .venv && .venv/bin/pip install -r requirements.txt
Check skill documentation for dependencies

Prompt Too Long

Error:

400 Bad Request: prompt exceeds maximum length (4000 characters)

Solutions:

Enable refinement: "addRefinement": "true" — let LLM condense the prompt
Shorten templates: Remove verbose instructions from prompt template
Upstream editing: Have prior LLM nodes produce concise briefs

Output Path Issues

Error:

fs.write path must be within the team workspace

Solutions:

Use relative paths: "outputPath": "media/{{node.id}}.png"
Don’t include ../ in paths
Paths are resolved relative to workflow run directory

Permission Errors

Error:

EACCES: permission denied, open '/home/control/.openclaw/skills/nano-banana-pro'

Solutions:

Fix ownership: sudo chown -R control:control ~/.openclaw/
Check directory permissions: chmod 755 ~/.openclaw/skills/
Reinstall skill if corrupted

Timeout Issues

Error:

Script execution timeout (300000ms)

Solutions:

Increase timeout: Add "timeoutMs": 600000 to node config
Check API status: Verify provider service availability
Reduce complexity: Simplify prompts for faster generation

Advanced Configuration

Custom Output Directories

{
  "config": {
    "outputPath": "assets/{{workflow.name}}/{{run.id}}/hero.png"
  }
}

Creates: workspace-team/assets/Content Pipeline/2025-04-04T03-53-00-123Z/hero.png

Model Selection (Provider-Specific)

Some drivers support model selection:

{
  "config": {
    "provider": "openai-dalle",
    "model": "dall-e-3",
    "quality": "hd"
  }
}

Provider Fallbacks

Use LLM nodes to implement fallback logic:

{
  "action": {
    "promptTemplate": "Generate an image using primary provider: {{image_config.primary_provider}}\n\nIf unavailable, fallback to: {{image_config.fallback_provider}}"
  }
}

MEDIA_DRIVERS.md — Developer guide for adding new providers
TEMPLATE_VARIABLES.md — Template variable syntax and examples
WORKFLOW_NODES.md — Complete node configuration reference
INSTALLATION.md — Installing ClawRecipes and dependencies

Implementation Details

Core Code Locations:

src/lib/workflows/media-drivers/ — Driver implementations
src/lib/workflows/workflow-worker.ts — Media node execution
src/handlers/media-drivers.ts — CLI media-drivers command

Driver Interface:

MediaDriver — TypeScript interface for all providers
MediaDriverInvokeOpts — Standardized invocation parameters
MediaDriverResult — Standardized return format

Registry System:

Known drivers registered in registry.ts
Generic driver auto-discovery for unlisted skills
Runtime environment variable availability checking

Documentation Index

​Architecture Overview

​Driver Registry System

​Execution Flow

​Setup Instructions

​Image Generation

​Nano Banana Pro (Gemini)

​DALL-E (OpenAI)

​Video Generation

​Kling AI

​Runway

​Luma AI

​Configuration Fields

​Core Fields

​Size Configuration

​Duration Configuration (Videos)

​Prompt Refinement

​Environment Variables

​Loading Hierarchy

​OpenClaw Config Format

​Setting Environment Variables

​Skill Installation

​ClawHub Installation

​Installation Roots

​Verification

​Template Variables

​Available Variables

​Example Templates

​Troubleshooting

​Driver Not Found

​Missing Environment Variables

​Script Execution Failures

​Prompt Too Long

​Output Path Issues

​Permission Errors

​Timeout Issues

​Advanced Configuration

​Custom Output Directories

​Model Selection (Provider-Specific)

​Provider Fallbacks

​Related Documentation

​Implementation Details

Architecture Overview

Driver Registry System

Execution Flow

Setup Instructions

Image Generation

Nano Banana Pro (Gemini)

DALL-E (OpenAI)

Video Generation

Kling AI

Runway

Luma AI

Configuration Fields

Core Fields

Size Configuration

Duration Configuration (Videos)

Prompt Refinement

Environment Variables

Loading Hierarchy

OpenClaw Config Format

Setting Environment Variables

Skill Installation

ClawHub Installation

Installation Roots

Verification

Template Variables

Available Variables

Example Templates

Troubleshooting

Driver Not Found

Missing Environment Variables

Script Execution Failures

Prompt Too Long

Output Path Issues

Permission Errors

Timeout Issues

Advanced Configuration

Custom Output Directories

Model Selection (Provider-Specific)

Provider Fallbacks

Related Documentation

Implementation Details