muapi-nano-banana
Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via muapi.ai with logic-based prompting
Packaged view
This page reorganizes the original catalog entry around fit, installability, and workflow context first. The original raw source lives below.
Install command
npx @skill-hub/cli install samuraigpt-generative-media-skills-nano-banana
Repository
Skill path: library/visual/nano-banana
Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via muapi.ai with logic-based prompting
Open repositoryBest for
Primary workflow: Analyze Data & AI.
Technical facets: Full Stack, Data / AI.
Target audience: everyone.
License: Unknown.
Original source
Catalog source: SkillHub Club.
Repository owner: SamurAIGPT.
This is still a mirrored public skill entry. Review the repository before installing into production workflows.
What it helps with
- Install muapi-nano-banana into Claude Code, Codex CLI, Gemini CLI, or OpenCode workflows
- Review https://github.com/SamurAIGPT/Generative-Media-Skills before adding muapi-nano-banana to shared team environments
- Use muapi-nano-banana for development workflows
Works across
Favorites: 0.
Sub-skills: 0.
Aggregator: No.
Original source / Raw SKILL.md
---
name: muapi-nano-banana
version: 0.1.0
description: Reasoning-driven image generation using structured creative briefs (Gemini 3 style) — generates high-fidelity images via muapi.ai with logic-based prompting
---
# 🍌 Nano-Banana Expert Skill (Gemini 3 Style)
**A specialized skill for AI Agents to leverage "Reasoning-Driven" image generation.**
Based on the advanced prompting architecture of Google's Gemini 3 (Nano Banana Pro), this skill moves beyond keyword stuffing to structured, logic-based creative briefs.
## Core Competencies
1. **Reasoning-Driven Prompting**: Using natural language logic to define physics, lighting, and spatial relationships.
2. **Structured Creative Briefs**: Implementing the "Perfect Prompt" formula: `Subject + Action + Context + Composition + Lighting`.
3. **Text Rendering Precision**: Explicitly defining typography and signifiers for legible text integration.
4. **Contextual Grounding**: Using "Search Grounding" logic (simulated) to anchor generations in real-world accuracy.
---
## 🏗️ Technical Specification
### 1. The "Perfect Prompt" Formula
| Component | Description | Example |
| :--- | :--- | :--- |
| **Subject** | Detailed entity description | "A stoic robot barista with exposed copper wiring" |
| **Action** | Dynamic interaction | "Pouring a latte art leaf with mechanical precision" |
| **Context** | Environment & Atmosphere | "Inside a neon-lit cyberpunk cafe at midnight" |
| **Composition** | Camera & Lens choice | "Close-up, 85mm lens, f/1.8 aperture" |
| **Lighting** | Mood & Direction | "Volumetric blue rim light, warm cafe glow" |
| **Style** | Aesthetic anchor | "Cinematic, photorealistic, 4K production value" |
### 2. Advanced Features
- **Negative Constraint Logic**: Instead of "no blurry," use "Ensure sharp focus on the subject's eyes."
- **Identity Consistency**: (Simulated) "Maintain consistent facial structure across variations."
- **Text Integration**: Use double quotes for specific text: `The sign reads "OPEN 24/7"`.
---
## 🧠 Prompt Optimization Protocol (Agent Instruction)
**Before calling the script, the Agent MUST rewrite the user's prompt into a logic-driven Reasoning Brief:**
1. **NO KEYWORD SOUP**: Remove "8k, masterpiece, ultra-detailed." Use full, descriptive sentences.
2. **PHYSICAL CONSISTENCY**: Describe how elements interact (e.g., "The light from the crystal shards casts caustic patterns across the obsidian floor").
3. **TEXT PRECISION**: If the user wants text, define it precisely: `featuring a sign that says "STORE NAME" in a weathered serif font`.
4. **OPTICAL DIRECTIVES**: Specify lens behavior: *Shallow Depth of Field (f/1.8)*, *Macro Lens*, *Anamorphic Flare*.
---
## 🚀 Protocol: Using Nano-Banana
### Step 1: Define the Creative Logic
Provide the agent with a subject and a specific scenario.
### Step 2: Invoke the Script
The `generate-nano-art.sh` script translates the logic into a structured Gemini 3-style prompt.
```bash
# Generating a reasoning-driven image
bash scripts/generate-nano-art.sh \
--subject "a glass chess piece" \
--action "shattering into liquid shards" \
--context "on a obsidian table" \
--style "macro photography"
```
---
## ⚠️ Constraints & Guardrails
- **No Keyword Soup**: **MANDATORY** - Do not use "trending on artstation, masterpiece, 8k". Use natural language descriptions.
- **Physics Logic**: Ensure the prompt describes *physically possible* lighting and reflection interactions.
- **Full Sentences**: The model parses relationships; use "light reflecting off the water" instead of "water, reflection".
---
## ⚙️ Implementation Details
This skill applies a "Logic Wrapper" around the `core/media/generate-image.sh` primitive, converting fragmented inputs into a coherent, reasoning-ready narrative prompt.
---
## Referenced Files
> The following files are referenced in this skill and included for context.
### scripts/generate-nano-art.sh
```bash
#!/bin/bash
# Expert Skill: Nano-Banana (Gemini 3 Style)
# Implements the "Perfect Prompt" reasoning-driven formula.
SUBJECT=""
ACTION=""
CONTEXT=""
STYLE="cinematic"
LIGHTING="natural"
RESOLUTION="1k"
TEXT=""
VIEW_FLAG=""
while [[ $# -gt 0 ]]; do
case $1 in
--subject) SUBJECT="$2"; shift 2 ;;
--action) ACTION="$2"; shift 2 ;;
--context) CONTEXT="$2"; shift 2 ;;
--style) STYLE="$2"; shift 2 ;;
--lighting) LIGHTING="$2"; shift 2 ;;
--resolution) RESOLUTION="$2"; shift 2 ;;
--text) TEXT="$2"; shift 2 ;;
--view) VIEW_FLAG="--view"; shift ;;
*) shift ;;
esac
done
if [ -z "$SUBJECT" ]; then
echo "Usage: bash generate-nano-art.sh --subject 'robot' [--action 'pouring coffee'] [--context 'cafe'] [--style 'photorealistic'] [--resolution 1k|2k|4k] [--text 'CAFE'] [--view]"
exit 1
fi
# Text Logic
TEXT_PROMPT=""
if [ -n "$TEXT" ]; then
TEXT_PROMPT="featuring a sign that clearly reads \"$TEXT\" in bold typography"
fi
# Nano-Banana Reasoning-Driven Prompt
# Formula: Subject + Action + Context + Lighting + Style + Text
EXPERT_PROMPT="[REASONING_BRIEF]
SUBJECT: A highly detailed $SUBJECT.
ACTION: The subject is $ACTION.
ENVIRONMENT: Situated in $CONTEXT.
LIGHTING: Illuminated by $LIGHTING lighting.
STYLE: Use a $STYLE aesthetic. High fidelity, physically accurate reflections and textures.
EXTRA: $TEXT_PROMPT.
[EXECUTE] Generate a photorealistic image based on this logic. Ensure precise composition and lighting interactions."
# Call Core Primitive
SCRIPT_DIR="$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )"
CORE_SCRIPT="$SCRIPT_DIR/../../../../core/media/generate-image.sh"
bash "$CORE_SCRIPT" --prompt "$EXPERT_PROMPT" --model "nano-banana-pro" --resolution "$RESOLUTION" $VIEW_FLAG --json
```