Gemini CLI offers a generous free tier that covers the use cases for many individual developers. For enterprise / professional usage, or if you need higher limits, there are multiple possible avenues depending on what type of account you use to authenticate.
See privacy and terms for details on Privacy policy and Terms of Service.
Note: published prices are list price; additional negotiated commercial discounting may apply.
This article outlines the specific quotas and pricing applicable to the Gemini CLI when using different authentication methods.
Generally, there are three categories to choose from:
Your journey begins with a generous free tier, perfect for experimentation and light use.
Your free usage limits depend on your authorization type.
For users who authenticate by using their Google account to access Gemini Code Assist for individuals. This includes:
Learn more at Gemini Code Assist for Individuals Limits.
If you are using a Gemini API key, you can also benefit from a free tier. This includes:
Learn more at Gemini API Rate Limits.
Vertex AI offers an Express Mode without the need to enable billing. This includes:
Learn more at Vertex AI Express Mode Limits.
If you use up your initial number of requests, you can continue to benefit from Gemini CLI by upgrading to one of the following subscriptions:
Google AI Pro and AI Ultra by signing up at Set up Gemini Code Assist. This is recommended for individual developers. Quotas and pricing are based on a fixed price subscription.
For predictable costs, you can log in with Google.
Learn more at Gemini Code Assist Quotas and Limits
Purchase a Gemini Code Assist Subscription through Google Cloud by signing up in the Google Cloud console. Learn more at [Set up Gemini Code Assist] (https://cloud.google.com/gemini/docs/discover/set-up-gemini) Quotas and pricing are based on a fixed price subscription with assigned license seats. For predictable costs, you can sign in with Google.
This includes:
Learn more about Gemini Code Assist Standard and Enterprise license limits.
If you hit your daily request limits or exhaust your Gemini Pro quota even after upgrading, the most flexible solution is to switch to a pay-as-you-go model, where you pay for the specific amount of processing you use. This is the recommended path for uninterrupted access.
To do this, log in using a Gemini API key or Vertex AI.
Learn more at Vertex AI Dynamic Shared Quota and Vertex AI Pricing.
Learn more at Gemini API Rate Limits, Gemini API Pricing
It’s important to highlight that when using an API key, you pay per token/call. This can be more expensive for many small calls with few tokens, but it’s the only way to ensure your workflow isn’t interrupted by quota limits.
These plans currently apply only to the use of Gemini web-based products provided by Google-based experiences (for example, the Gemini web app or the Flow video editor). These plans do not apply to the API usage which powers the Gemini CLI. Supporting these plans is under active consideration for future support.
When using a Pay as you Go API key, be mindful of your usage to avoid unexpected costs.
A summary of model usage is available through the /stats
command and presented on exit at the end of a session.