-
Gemini app / Google AI Studio (easiest, GUI)
-
Open the Gemini app or Google AI Studio → choose Image generation or Image edit → upload a photo (or not) → type a natural-language instruction and hit generate/edit. Gemini’s Nano-Banana/Gemini 2.5 Flash Image is built for that. Gemini+1
-
-
Third-party web UIs that host the model (if you want quick experiments): Krea, PicLumen, Fal.ai and others expose Nano-Banana style editing tools — upload + prompt and they’ll run the model for you. Krea+2PicLumen+2
-
Programmatic / API use — for batch generation or app integration: some inference platforms (and Google’s developer tools) expose the model endpoints or wrappers (check official Google docs / provider). Use these for automation or integrating into a web app. Google Developers Blog+1
Step-by-step (Gemini App / Google AI Studio) — fast workflow
-
Open Gemini app or Google AI Studio and choose Image → Create or Edit. Gemini
-
If editing, upload your base photo. If creating from scratch, skip upload.
-
Write a clear prompt: start with subject + style + action + scene (examples below).
-
Use reference images if you want consistent likeness or to fuse elements. Gemini’s multi-image fusion handles that. Google Developers Blog
-
Iterate — tweak the prompt (“make lighting softer”, “more cinematic”, “closer crop”) until you like it.
-
Export / download. Note: Google may embed SynthID watermark to indicate AI-generated content. Indiatimes
Prompt recipes (copy-paste, then tweak)
-
Toy-figurine / Nano-Banana viral style:
Create a glossy 3D collectible figurine of [subject], standing in a small clear plastic display case, studio lighting, ultra-detailed, photorealistic, shallow depth of field, --style collectible-packaging
-
Photo edit — outfit swap + background:
Replace the background with a neon cyberpunk street at night, change subject's jacket to a reflective silver bomber, keep original facial features and lighting, photorealistic
-
Product mockup:
Place this uploaded product on a minimalist marble pedestal, soft window light from the left, shallow depth of field, high-res, white background for e-commerce
Tips: be explicit about lighting, camera lens (e.g., "85mm portrait lens, f/1.8"), and small details (shadows, reflections) to get consistent results.
Advanced tricks & consistency tips
-
Use multiple reference images to keep a consistent look across different outputs (Gemini supports multi-image fusion). Google Developers Blog
-
Preserve likeness: say “preserve facial features / identity” if editing people. Models like Gemini add safeguards though — don’t try to impersonate public figures. blog.google
-
Batch / templating: for many images, write a template prompt and only swap the variable (name / clothing / color). Use API or automation on platforms like Fal.ai to run batches. Fal.ai
-
Post-process: upscale in an external tool if you need huge print-resolution; use local editors for fine masks if model misses tiny details.
-
Safety & attribution: Google uses invisible SynthID and visible labels for AI images — good to know for legal/ethical sharing. Indiatimes+1
If you want to integrate into an app (mini checklist)
-
Choose platform: Google AI Studio / Gemini API for first-party integration or Fal.ai / other inference providers for convenient endpoints. Google Developers Blog+1
-
Add upload + prompt UI, show preview, let users iterate.
-
Add usage limits, watermarking and consent UI (people photos).
-
Save generated images and metadata (prompts) if you want reproducibility.
Example beginner prompt bank (use these and edit)
-
A cheerful golden retriever wearing sunglasses on a beach towel, cinematic sunlight, photorealistic, 50mm portrait
-
Turn this portrait into a collectible vinyl figurine in a transparent box, studio lighting, glossy finish
-
Edit: remove background, place subject on plain white backdrop, keep shadows natural, high resolution
Comments