google/nano-banana-pro
Google's state of the art image generation and editing model 🍌🍌
Pricing
google/
nano-banana-pro
Pricing for Synexa AI models works differently from other providers. Instead of being billed by time, you are billed by input and output, making pricing more predictable.
For example, generating 100 images should cost around $10.00.
Check out our docs for more information about how per-request pricing works on Synexa.
| Provider | Price ($) | Saving (%) |
|---|---|---|
| Synexa | $0.1000 | - |
| replicate | $0.1500 | 33.3% |
| fal | $0.1500 | 33.3% |
Readme
Gemini 3 Pro Image, consumer-facing as Nano Banana Pro, is Google DeepMind's state-of-the-art image generation and editing model built on Gemini 3 Pro. It creates high-fidelity visuals with legible text in multiple languages, connects to real-time information through Google Search grounding, and provides studio-quality control over every aspect of your images.
What You Can Do
Create Images with Accurate, Legible Text
Gemini 3 Pro Image excels at rendering clear, stylized text directly in images. Generate posters, mockups, infographics, menus, and diagrams with typography in multiple languages. The model handles depth and nuance, creating text with varied textures, fonts, and calligraphy styles integrated naturally into compositions.
Generate Context-Rich Visuals from Real-World Knowledge
The model leverages Gemini 3 Pro's advanced reasoning capabilities to create accurate educational content, infographics, and data visualizations. With Google Search grounding enabled, it can access real-time information like weather data, stock charts, sports scores, or recent events, then visualize that information with factual accuracy.
Blend Multiple Images with Consistent Results
Combine up to 14 images in a single composition while maintaining visual consistency and accurate representation of up to 5 people. This makes it effective for turning sketches into product prototypes, creating lifestyle scenes, or building surreal compositions from multiple visual elements.
Exercise Professional Creative Control
Gemini 3 Pro Image offers advanced editing capabilities including adjusting camera angles, changing scene lighting (day to night transformations), applying color grading, modifying depth of field, and editing specific regions while preserving the rest of the image. Generate images in various aspect ratios (1:1, 2:3, 3:2, 3:4, 4:3, 4:5, 5:4, 9:16, 16:9, 21:9) at resolutions up to 4K (1K, 2K, or 4K).
Example Use Cases
Typography and Branding
Create logos where letters convey meaning visually, generate posters with retro screen-printed textures, or build city scenes where buildings form letters that spell words.
Multilingual Content
Generate text in one language, then translate it to another while maintaining all other visual elements. Localize marketing materials, posters, or product packaging efficiently.
Educational Content
Transform handwritten notes into diagrams, create step-by-step infographics for recipes or tutorials, or generate detailed educational explainers about scientific concepts, plants, animals, or historical events.
Product Mockups and Prototypes
Blend sketches with product photos, create photorealistic renderings from blueprints, or generate lifestyle product shots with consistent branding across different settings and environments.
Creative Transformations
Change image aspect ratios while keeping subjects properly positioned, apply dramatic lighting effects with chiaroscuro techniques, shift focus to specific elements through depth of field adjustments, or transform mood by adjusting time of day and atmospheric conditions.
How It Works
Gemini 3 Pro Image uses Gemini 3 Pro's state-of-the-art reasoning and real-world knowledge to understand creative intent. When you provide a prompt, the model employs a "thinking" process to reason through complex instructions, considering context, spatial relationships, composition, and style. It may generate interim "thought images" internally to refine the composition before producing the final high-quality output.
The model's multilingual capabilities come from Gemini 3 Pro's enhanced language understanding, enabling accurate text rendering across different writing systems. When Google Search grounding is enabled, the model can verify facts and access current information for data-driven visualizations and infographics.
The model is available through multiple Google products including the Gemini app (using "Thinking" mode for image generation), Google AI Studio, Vertex AI for enterprise users, and Google Ads.
Important Considerations
Accuracy and Limitations
While highly capable, Gemini 3 Pro Image may occasionally struggle with small faces, precise spelling in complex layouts, and fine details. When generating infographics or data visualizations with real-world information, always verify the factual accuracy of outputs, as the model's knowledge, while extensive, is not infallible.
Multilingual Performance
Text generation is robust across many languages, but you may occasionally encounter issues with grammar, spelling, cultural nuances, or idiomatic phrases in specific languages.
Advanced Features
Complex operations like masked editing, major lighting transformations, or blending many images can sometimes produce visual artifacts, unnatural results, or disjointed scenes. Character consistency across multiple images is generally reliable but not perfect in every instance.
Content Provenance
All images generated or edited by Gemini 3 Pro Image include imperceptible SynthID digital watermarks, Google's technology for identifying AI-generated content. These watermarks are embedded directly into the image pixels and cannot be removed through cropping or screenshots, helping combat misinformation and enabling content verification.