This page is taken from the examples/live/ directory and refers to the code there.
Live Illustrator 🏞️
This agent creates illustrations in real time for the audio narration captured by your laptop’s microphone. Whether you're in a meeting, playing DnD, or reading a story to your kids, it generates illustrations automatically based on the conversation. It uses Gemini flash for the listener and Nano Banana (regular or pro) for generating visuals. To improve consistency, when a new character or key object / location are mentioned the agent first generates a "concept art", which is then used as a reference when generating illustrations.
This demo makes a good use of async functions / tools which have been added to Genai Processors v2.0. This way image generation does not block the listener model and the agent can keep up with the narration, working on multiple illustrations in parallel, if needed.
Note: The agent generates a high volume of images, which may lead to
throttling by Nano Banana or high token consumption. You can control the
interval between image generations in the UI; we recommend an initial setting of
30 seconds. Please note that image generation will be slower at this rate. Check
the logs for the error HTTP/1.1 429 Too Many Requests. If you see this while
on the free tier, your requests are being blocked. While the program will
continue to retry, illustrations may arrive significantly slower than usual—or
not at all if the system is saturated.
🚀 How to run it
This example comes with a web UI based on AI Studio Applets. To run it:
-
Install the dependencies with:
-
Define a
GOOGLE_API_KEYenvironment variable with your API key (we need it to access to Google GenAI models). -
go to the directory
live_illustratorand launch the illustrator agent: -
Access the applet at https://aistudio.google.com/app/apps/github/google-gemini/genai-processors/tree/main/examples/live_illustrator/ais_app.
Tip: You can set the style in the System Instruction input field (e.g. "watercolor style" or “create images with the New Yorker cartoonish style”).
Use the "Share" button to copy the generated illustrations. They can be pasted into a Google doc for sharing.
☝️ Regarding Quality
This is not a replacement for a professional artist. Upon closer look you may find many instances where illustrations are not consistent with the story or each other. Drawing proper illustrations takes time. Instead we focused on a use case where previously illustrations were infeasible, almost impossible.