Skip to content

[Image Generation] Explore Nano Banana Pro capabilities for improved visual output #31

@madjin

Description

@madjin

Summary

Research into Nano Banana Pro (Gemini 3 Pro) capabilities reveals features that could benefit our poster generation pipeline. This issue tracks exploration of these capabilities.

Resources

Capabilities Worth Exploring

Feature Potential Use Case
Google Search grounding Accurate brand/project logos, current events imagery
Thinking mode Complex multi-element compositions
Multi-turn editing Iterative style refinement
Text rendering Infographic-style posters, data visualizations
Transparent backgrounds Cleaner icon/asset generation

Prompting Patterns to Test

From awesome-nanobanana-pro, these techniques show strong results:

  1. Narrative paragraph prompts vs keyword lists
  2. Camera/lens specifications for consistent rendering
  3. Lighting descriptions for photorealistic outputs
  4. Era-specific aesthetic triggers
  5. Explicit composition guidance

Exploration Tasks

  • Test Google Search grounding for brand logo accuracy
  • Compare narrative vs structured prompts for our use cases
  • Experiment with thinking mode for complex scenes
  • Test multi-turn editing for style iteration
  • Document what works best for our content (ElizaOS dev updates)

Related Files

  • scripts/posters/generate-icons.py - Icon generation
  • scripts/posters/generate-ai-image.py - Daily poster generation
  • scripts/posters/config/style-presets.json - Existing style templates

Priority: Low - exploratory research for future improvements

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions