nano-banana

Generate, edit, and compose images using Google's Gemini 3 Pro Image model (Nano Banana Pro). Use this skill when the user asks to create images, generate visuals, edit photos, compose multiple images, create logos, thumbnails, infographics, product shots, or any image generation task. Supports text-to-image, image editing, multi-image composition (up to 14 images), iterative refinement, aspect ratio control, and Google Search-grounded image generation for real-time data visualization.

0étoiles

0forks

Mis à jour 1/21/2026

Obtenir Code Source

SKILL.md

readonlyread-only

name

nano-banana

description

Nano Banana Pro

Image generation skill powered by Google's Gemini 3 Pro Image model. Enables text-to-image generation, image editing, multi-image composition, and real-time data visualization.

Requirements

GEMINI_API_KEY environment variable set
Python packages: google-genai, Pillow

Install dependencies:

pip install -r requirements.txt

Quick Start

Generate an Image

python scripts/generate_image.py "A cat wearing a wizard hat" cat.png

Edit an Existing Image

python scripts/edit_image.py photo.png "Add a sunset to the background" edited.png

Compose Multiple Images

python scripts/compose_images.py "Create a group photo in an office" team.png person1.png person2.png

Available Scripts

Script	Purpose
`generate_image.py`	Text-to-image generation
`edit_image.py`	Edit/modify existing images
`compose_images.py`	Combine up to 14 reference images
`chat_image.py`	Interactive multi-turn refinement
`search_grounded_image.py`	Generate images with real-time search data

Generation Options

Aspect Ratios

1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9

Resolutions

1K (1024px), 2K, 4K

Usage with Options

python scripts/generate_image.py "Futuristic motorcycle on Mars" mars.png --aspect 16:9 --size 4K

Task Workflows

Logo Generation

python scripts/generate_image.py "Clean black-and-white logo with text 'Daily Grind', sans-serif font, coffee bean icon, minimalist style" logo.png --aspect 1:1

Product Mockup

python scripts/generate_image.py "Studio-lit product photo on polished concrete, 3-point softbox, 45-degree angle, professional e-commerce style" product.png --aspect 4:3 --size 4K

Photorealistic Portrait

python scripts/generate_image.py "A photorealistic close-up portrait, shot on 85mm lens, golden hour lighting, shallow depth of field, cinematic" portrait.png --size 4K

Stylized Art (Anime/Sticker)

python scripts/generate_image.py "A kawaii red panda sticker, bold outlines, cel-shading, white background, cute expression" sticker.png

Iterative Design Refinement

Use the chat script for back-and-forth refinement:

python scripts/chat_image.py

Then interact:

> Create a logo for 'Acme Corp'
[Image generated]
> Make the text bolder and add a blue gradient
[Refined image]
> save acme_logo.png

Real-Time Data Visualization

Generate infographics with current data:

python scripts/search_grounded_image.py "Visualize today's weather in Tokyo as an infographic" tokyo_weather.png --aspect 9:16

Use cases:

Live stock-market infographics
Breaking-news visuals
Weather dashboards
Current event visualizations

Multi-Image Composition

Combine reference images:

python scripts/compose_images.py "Create a product comparison shot with these items side by side, professional lighting" comparison.png item1.png item2.png item3.png --aspect 16:9

Use cases:

Product comparison shots
Character sheets
Team photos
Style-consistent image series

Inline Python Usage

For integration in larger scripts:

import os
from google import genai
from google.genai import types

client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])

response = client.models.generate_content(
    model="gemini-3-pro-image-preview",
    contents=["A serene mountain landscape at dawn"],
    config=types.GenerateContentConfig(
        response_modalities=['TEXT', 'IMAGE'],
        image_config=types.ImageConfig(
            aspect_ratio="16:9",
            image_size="2K"
        )
    )
)

for part in response.parts:
    if part.inline_data:
        image = part.as_image()
        image.save("landscape.png")

Editing with Inline Code

from PIL import Image
from google import genai
from google.genai import types

client = genai.Client(api_key=os.environ["GEMINI_API_KEY"])
img = Image.open("input.png")

response = client.models.generate_content(
    model="gemini-3-pro-image-preview",
    contents=["Add dramatic clouds to the sky", img],
    config=types.GenerateContentConfig(
        response_modalities=['TEXT', 'IMAGE']
    )
)

Prompting Tips

Photorealistic: Include camera settings, lighting, lens details

"Shot on 85mm lens, golden hour lighting, shallow depth of field"

Logos: Specify style, colors, typography

"Clean minimalist logo, sans-serif font, monochrome, vector style"

Product shots: Describe studio setup

"Studio-lit, 3-point softbox, polished surface, 45-degree angle"

Stylized art: Name the style explicitly

"Anime style, cel-shading, bold outlines, vibrant colors"

Related Skills

songsee

88Kdesign

Generate spectrograms and feature-panel visualizations from audio with the songsee CLI.

moltbot

Obtenir

slack-gif-creator

48Kdesign

Knowledge and utilities for creating animated GIFs optimized for Slack. Provides constraints, validation tools, and animation concepts. Use when users request animated GIFs for Slack like "make me a GIF of X doing Y for Slack."

anthropics

Obtenir

algorithmic-art

48Kdesign

Creating algorithmic art using p5.js with seeded randomness and interactive parameter exploration. Use this when users request creating art using code, generative art, algorithmic art, flow fields, or particle systems. Create original algorithmic art rather than copying existing artists' work to avoid copyright violations.

anthropics

Obtenir

brand-guidelines

47Kdesign

Applies Anthropic's official brand colors and typography to any sort of artifact that may benefit from having Anthropic's look-and-feel. Use it when brand colors or style guidelines, visual formatting, or company design standards apply.

anthropics

Obtenir

theme-factory

47Kdesign

Toolkit for styling artifacts with a theme. These artifacts can be slides, docs, reportings, HTML landing pages, etc. There are 10 pre-set themes with colors/fonts that you can apply to any artifact that has been creating, or can generate a new theme on-the-fly.

anthropics

Obtenir

canvas-design

47Kdesign

Create beautiful visual art in .png and .pdf documents using design philosophy. You should use this skill when the user asks to create a poster, piece of art, design, or other static piece. Create original visual designs, never copying existing artists' work to avoid copyright violations.

anthropics

Obtenir

nano-banana

Nano Banana Pro

Requirements

Quick Start

Generate an Image

Edit an Existing Image

Compose Multiple Images

Available Scripts

Generation Options

Aspect Ratios

Resolutions

Usage with Options

Task Workflows

Logo Generation

Product Mockup

Photorealistic Portrait

Stylized Art (Anime/Sticker)

Iterative Design Refinement

Real-Time Data Visualization

Multi-Image Composition

Inline Python Usage

Editing with Inline Code

Prompting Tips

You Might Also Like

Related Skills

songsee

slack-gif-creator

algorithmic-art

brand-guidelines

theme-factory

canvas-design