Google AI Studio Behavior
This page covers the toggles and options that control how IntenseRP interacts with Google AI Studio (aistudio.google.com).
Modes (model IDs)
In IntenseRP Next v2, the model you select in SillyTavern is still mostly a behavior preset, not a true AI Studio model picker.
For Google AI Studio, these model IDs map to the following behavior:
| Model ID | Behavior |
|---|---|
aistudio-auto |
Uses your IntenseRP settings |
aistudio-chat |
Suppresses <think> output and lowers Thinking Level on supported AI Studio models |
aistudio-reasoner |
Uses your configured Thinking Level and Send Thinking setting |
Why chat mode works differently here
Google AI Studio's supported thinking models do not expose a true "thinking off" switch the way some other providers do.
So IntenseRP approximates a chat-like mode by:
- lowering Thinking Level to the minimum available (when the model supports it), and
- suppressing
<think>output to the client
Real AI Studio model selection (web UI)
Google AI Studio has a real model picker in the web UI:
Settings -> Provider Behavior -> Google AI Studio -> Model
Current model entries:
Gemini 3.5 FlashGemini 3.1 ProGemini 3.1 Flash LiteGemini 3 FlashGemini 2.5 Pro(paid in AI Studio; requires the paid-access override)Gemini 2.5 Flash(paid in AI Studio; requires the paid-access override)Gemini 2.5 Flash Lite(paid in AI Studio; requires the paid-access override)Gemma 4 26B-A4BGemma 4 31B
This is separate from the API aistudio-* behavior presets.
Gemini 2.5 models need paid account access
Gemini 2.5 Pro, Flash, and Flash Lite have become paid in Google AI Studio. IntenseRP keeps their model entries and old thinking-budget mappings so existing settings don't disappear, but runtime requests that resolve to a Gemini 2.5 model stop unless Assume Paid Model Access is enabled for the active account.
Gemini 2.5 models have become paid in Google AI Studio, and IRP can't serve them.
This is an account-access override, not a way to buy or bypass access. If a paid AI Studio API key is available, the actual AI Studio API is usually the cleaner path.
Authentication
Google AI Studio uses a Google login flow. IntenseRP supports both:
- Manual login
- Auto Login (best-effort Google form autofill)
Persistent Sessions strongly recommended
Google can still ask for manual confirmation, CAPTCHAs, or extra account checks even when Auto Login is enabled.
In practice, Persistent Sessions are the main quality-of-life feature here. If your Google session stays alive, AI Studio becomes much smoother to use.
First-use legal acknowledgement
On accounts that have never used AI Studio before, Google may show a legal / terms acknowledgement modal after login.
IntenseRP now detects that modal, notifies you, and waits for you to accept it manually before continuing with setup.
Auto Login Redirect Timeout
If Auto Login fills your credentials but Google does not return to AI Studio quickly enough, IntenseRP falls back to manual completion.
Settings -> Provider Behavior -> Google AI Studio -> Auto Login Redirect Timeout (s)
Default is 15 seconds.
Assume Paid Model Access
If your Google AI Studio account can use the paid Gemini 2.5 models in the web UI, you can allow IntenseRP to use those model entries again:
Settings -> Provider Behavior -> Google AI Studio -> Assume Paid Model Access
When enabled, Paid Model Access Emails appears below it. Leave that list empty to assume every AI Studio account has access. Add one or more emails to restrict Gemini 2.5 requests to only those accounts.
Be picky here
Only enable this when you're sure the account really has Gemini 2.5 access in AI Studio. With Providers in Parallel, IntenseRP uses the email list to route Gemini 2.5 requests to matching AI Studio lanes when it can. Accounts not on the list are treated as not having access.
Thinking
Google AI Studio exposes Thinking Level instead of a simple on/off reasoning toggle.
Enable Thinking
When enabled, IntenseRP uses your configured Thinking Level on supported AI Studio models.
When disabled, IntenseRP falls back to the lowest available level instead.
Settings -> Provider Behavior -> Google AI Studio -> Enable Thinking
Thinking Level
Controls the level IntenseRP selects on supported AI Studio models:
MinimalLowMediumHigh
Settings -> Provider Behavior -> Google AI Studio -> Thinking Level
Gemini 2.5 Flash, Flash Lite, and Pro
Gemini 2.5 Flash, Flash Lite, and Pro use a manual thinking budget instead of the newer Thinking Level dropdown.
For Gemini 2.5 Flash and Gemini 2.5 Flash Lite, Minimal turns Thinking Mode off. Low, Medium, and High turn Thinking Mode on and set the manual budget to 8192, 16384, and 24576. Flash Lite's manual budget range starts at 512.
For Gemini 2.5 Pro, Thinking Mode is always on, so Minimal, Low, Medium, and High map to 128, 8192, 16384, and 32768.
These mappings are kept for compatibility. Gemini 2.5 requests are allowed only when Assume Paid Model Access says the active account can use them.
Gemma 4 models
Gemma 4 models currently expose Minimal and High. When your configured level lands between those two, IntenseRP picks the nearest available level.
Per-request reasoning effort
For API clients, the recommended per-request way to control AI Studio thinking is reasoning_effort:
This works when Settings -> API Server -> Request Controls -> Accept API Reasoning Effort is enabled and Google AI Studio is selected under Reasoning Effort Providers.
AI Studio maps effort values to Thinking Level:
| API effort | AI Studio Thinking Level |
|---|---|
Not sent, auto |
Chat/off mode |
minimum, minimal |
Minimal |
low |
Low |
medium |
Medium |
high, max, xhigh |
High |
This is usually nicer than changing the global IRP setting or adding macros to prompts, especially when the client already has a reasoning-effort control.
Model-specific rounding
Some AI Studio models expose fewer levels, so IntenseRP picks the closest supported one. The old Gemini 2.5 budget mappings are still kept for compatibility, but those models are rejected at runtime because they are paid in AI Studio.
Send Thinking
When enabled, AI Studio thinking summaries are included in the API response, wrapped in <think> tags.
Settings -> Provider Behavior -> Google AI Studio -> Send Thinking
Search and URL Context
Google AI Studio exposes two browsing-style tools in the web UI:
Enable Search
Toggles Grounding with Google Search.
Settings -> Provider Behavior -> Google AI Studio -> Enable Search
Enable URL Context
Toggles URL Context browsing.
Settings -> Provider Behavior -> Google AI Studio -> Enable URL Context
Gemma 4 models
Gemma 4 models force Google Search grounding on and do not expose URL Context in AI Studio, so IntenseRP treats Search as enabled and skips the URL Context toggle for them.
Grounding payloads
AI Studio can emit extra grounding/search payloads into the same response stream. IntenseRP strips those provider-specific payloads and forwards only the assistant text.
System Prompt Field
AI Studio has its own System Instructions box, and IntenseRP can optionally use it for the leading system messages in your chat.
Settings -> Provider Behavior -> Google AI Studio -> Use System Prompt Field
When this is enabled, IntenseRP:
- pulls consecutive
systemmessages from the start of the request - pastes them into AI Studio's native System Instructions UI
- removes those leading
systemmessages from the normal chat prompt before sending - also moves your configured prompt injection there when Injection Position is set to
Before
Anything after the first non-system message stays in the normal prompt on purpose. Mid-chat system messages are left alone, because AI Studio's separate system field is global to the whole chat and mixing both behaviors would get weird fast.
Startup cleanup
AI Studio stores system instructions in browser local storage. When this option is enabled, IntenseRP clears that local cache once on page load and refreshes the tab so old instructions do not pile up.
File Upload Mode
Google AI Studio can now upload the prompt as a text file through its native media picker flow.
Settings -> Provider Behavior -> Google AI Studio -> Send As Text File
This is a best-effort implementation because AI Studio does not expose a clean direct file input - it opens a native picker from the media menu instead.
First upload on a fresh AI Studio account
On accounts that have not uploaded media before, AI Studio may show a copyright acknowledgement dialog after you pick the file.
IntenseRP now waits briefly for that dialog and clicks Agree to the copyright acknowledgement automatically when it appears.
Text File Message
Optional text to send alongside the uploaded prompt file.
Leave it empty to try file-only requests.
Settings -> Provider Behavior -> Google AI Studio -> Text File Message
File Upload Timeout
Controls how long IntenseRP waits (in seconds) for the send button to become available after the file is selected.
Settings -> Provider Behavior -> Google AI Studio -> File Upload Timeout
Sampling and output controls
Google AI Studio is currently the only provider in IntenseRP that applies these OpenAI-style request controls in the web UI when the selected model exposes the matching fields:
temperaturetop_pmax_tokens
You can set defaults in Settings, and request-level API values still win when provided.
Some models don't expose every sampling field. For example, Gemini 3.5 Flash and Gemini 3.1 Flash Lite currently have no Temperature or Top P inputs, so IntenseRP skips those controls for those models and still applies Max Output Tokens.
Temperature
Settings -> Provider Behavior -> Google AI Studio -> Temperature
Default is 1.0.
Top P
Settings -> Provider Behavior -> Google AI Studio -> Top P
Default is 0.95.
Max Output Tokens
Settings -> Provider Behavior -> Google AI Studio -> Max Output Tokens
Default is 65536.
Gemma 4 models are capped at 32768, so IntenseRP clamps larger request or settings values before applying the web UI control.
Safety Filters
On first AI Studio startup, IntenseRP automatically moves AI Studio's safety sliders to their lowest position once for that browser session.
What it does
This does not guarantee uncensored output. It only lowers the safety sliders that AI Studio exposes in the UI.
Anti-Censorship
Google AI Studio can hard-block a reply on the backend. When that happens, you usually get either:
- no usable assistant text in the stream, or
- a turn in the UI that shows Content blocked in the Safety Ratings button area
Settings -> Provider Behavior -> Google AI Studio -> Anti-Censorship
When this is enabled, IntenseRP does an invasive recovery flow:
- It watches the intercepted AI Studio response and the latest assistant turn for a hard-block signal
- If the turn is blocked, IntenseRP edits that assistant message in-place
- It replaces the blocked turn with your configured Replacement Message
- It sends your configured Continue Nudge as a normal text message
- It retries this up to 3 follow-up nudges
Blocked attempts stay hidden from the API stream. As soon as a recovery attempt reaches real assistant answer text again, that retry streams normally.
Not the same as DeepSeek
DeepSeek anti-censorship is basically a frontend workaround. AI Studio's version is more of a backend recovery trick, because the censorship is enforced deeper in the request/response flow.
Reuse Matching Chat gets disabled for that chat
If IntenseRP detects hard censorship in the current AI Studio chat, it clears that chat out of the clean-regeneration cache.
In plain English: the next identical request will start fresh instead of trying to regenerate inside that now-cursed chat.
CAARS (Cupcake's AIStudio AntiCensorship Ratelimit Saver)
"We've been trying to reach you about your caar's extended warranty."
CAARS is an optional prelude for AI Studio Anti-Censorship. It only appears when Anti-Censorship is enabled.
When enabled, IntenseRP sends the original prompt to a secondary Savior Model first, with that model's thinking level pushed as low as AI Studio allows. The default is Gemini 3.1 Flash Lite, since it's cheap and close enough to latest for this little warm-up act. IntenseRP waits until either the savior request finishes on its own or the assistant turn has produced 5 meaningful visible text updates. If the 5-update threshold wins while the savior is still running, IntenseRP clicks AI Studio's Stop button. Then it edits that savior assistant turn and replaces it with your configured Replacement Message, even if the turn was not blocked.
After that, IntenseRP switches back to your normal AI Studio Model and sends your configured Continue Nudge. The main model's answer is the one that streams back to the API. If the main model gets hard-censored too, the normal Anti-Censorship edit + continue retry flow still runs.
Reuse Matching Chat
CAARS starts from a fresh AI Studio chat for the prelude and skips Reuse Matching Chat for that request. It needs the savior turn and the main continuation to live in the same fresh chat.
Savior Model
The AI Studio model used for the CAARS prelude.
Default is Gemini 3.1 Flash Lite.
Settings -> Provider Behavior -> Google AI Studio -> Savior Model
Replacement Message
Text used to replace the blocked assistant turn before IntenseRP sends the continue nudge.
Default is ..
Settings -> Provider Behavior -> Google AI Studio -> Replacement Message
Continue Nudge
Text IntenseRP sends as the next user message after a blocked assistant turn is replaced.
Default is Continue..
Settings -> Provider Behavior -> Google AI Studio -> Continue Nudge
Sent as plain text
The continue nudge is always typed into the composer normally.
Even if Send As Text File is enabled for the original prompt, IntenseRP does not upload the nudge as a file.
Reuse Matching Chat
When enabled, IntenseRP tries to click AI Studio's regenerate action if:
- The new prompt matches the cached last prompt
- Effective AI Studio settings also match
Otherwise it opens a fresh chat.
Settings -> Provider Behavior -> Google AI Studio -> Reuse Matching Chat
No Search Older Matching Chats here (yet)
Google AI Studio does not support Search Older Matching Chats right now.
So for AI Studio, Reuse Matching Chat only checks the currently remembered latest chat instead of trying older cached conversations.
Blocked chats are skipped
If AI Studio hard-censors a turn and Anti-Censorship kicks in, IntenseRP clears that chat from the clean-regeneration cache instead of reusing it later.
Mutually exclusive with Preflight Next Chat
Reuse Matching Chat needs to keep the completed chat around so it can press Regenerate later.
Preflight Next Chat immediately moves on to a fresh blank chat after a successful response. Since those two ideas want opposite things from the browser tab, IntenseRP only lets you keep one enabled.
:material-flight-takeoff: Preflight Next Chat
When enabled, IntenseRP starts preparing the next Google AI Studio chat as soon as a successful response finishes.
It opens a fresh temporary chat, reapplies the previous request's AI Studio controls, and leaves the composer empty. The next request can then skip some setup and go straight to filling in the prompt.
Settings -> Provider Behavior -> Google AI Studio -> Preflight Next Chat
If the next request changes model, thinking, search, URL Context, sampling, or system prompt settings, IntenseRP adjusts the preflighted blank chat before sending the prompt. So the feature is best-effort speed, not a promise to ignore request-level overrides.
CAARS preflights the savior turn
When CAARS is enabled, Preflight Next Chat prepares the blank chat for the Savior Model prelude, because that is the first AI Studio request CAARS sends.
After the savior turn is replaced, IntenseRP still switches back to your main model in that same chat and sends the configured continue nudge.
UI language requirement
The Google AI Studio driver currently expects the AI Studio UI language to be English (en / en-US).
If you see a warning about the language:
- Switch your Google account language to English
- Reload the AI Studio tab
- Retry / restart the browser from IntenseRP if needed
Assume English UI
If AI Studio reports the wrong language value (and complains that the UI must be in English) while the visible page is actually English, you can bypass that detection:
Settings -> Provider Behavior -> Google AI Studio -> Assume English UI
Last resort
Leave this off unless you're absolutely sure the AI Studio UI is English. It does not translate the page or make non-English buttons searchable; it only tells IntenseRP to skip the language safety check.
Per-message macros
You can add [[...]] macros to the latest user message to override behavior for that request only.
All macros are stripped before sending.
| Macro | Effect |
|---|---|
[[think]] |
Use the configured Thinking Level |
[[nothink]], [[r0]] |
Force the lowest available Thinking Level and suppress <think> output |
[[r1]], [[r2]], [[r3]], [[r4]] |
Pick a Thinking Level tier (mapped per model) |
[[search]] |
Force Search on |
[[nosearch]], [[no_search]] |
Force Search off |
[[url]], [[urlcontext]] |
Force URL Context on |
[[nourl]], [[no_url]] |
Force URL Context off |
[[nocaars]], [[nocars]] |
Disable CAARS for this request |
[[nocaars]] / [[nocars]] only matters when AI Studio Anti-Censorship and CAARS are both enabled. Otherwise it has nothing to turn off.
Model-name suffixes
reasoning_effort is the recommended API-side method now, but you can still append a Thinking Level suffix directly to the API model string, for example:
aistudio-auto-highaistudio-auto-lowaistudio-auto-r4
IntenseRP strips that suffix, applies the Thinking Level override, then processes the remaining aistudio-* behavior preset as normal.
Quick Reference
| Setting | What It Does | Default |
|---|---|---|
| Model | Selects AI Studio's real model picker | Gemini 3.1 Pro |
| Enable Thinking | Uses a higher Thinking Level on supported AI Studio models | Off |
| Thinking Level | Picks the Thinking Level when Thinking is enabled | Medium |
| Send Thinking | Includes AI Studio thinking summaries in response | Off |
| Enable Search | Toggles Google Search grounding | Off |
| Enable URL Context | Toggles URL Context browsing | Off |
| Use System Prompt Field | Moves leading system messages into AI Studio's System Instructions UI | Off |
| Send As Text File | Uploads the prompt through AI Studio's media picker | Off |
| Text File Message | Optional text sent alongside the uploaded file | (empty) |
| File Upload Timeout | Seconds to wait for the send button after file selection | 20 |
| Anti-Censorship | Detects blocked AI Studio turns and runs the edit + continue workaround | Off |
| CAARS | Runs a savior model prelude before the main AI Studio model | Off |
| Savior Model | Model used for the CAARS prelude | Gemini 3.1 Flash Lite |
| Replacement Message | Text used to replace a blocked assistant turn before retrying | . |
| Continue Nudge | Follow-up user message sent after a blocked turn | Continue. |
| Temperature | Default temperature | 1.0 |
| Top P | Default top-p value | 0.95 |
| Max Output Tokens | Default output token budget | 65536 |
| Auto Login Redirect Timeout (s) | Wait before falling back to manual Google completion | 15 |
| Assume English UI | Skips AI Studio's <html lang> safety check when the visible UI is definitely English |
Off |
| Assume Paid Model Access | Allows Gemini 2.5 requests for accounts that have paid AI Studio access | Off |
| Paid Model Access Emails | Optional allowlist for paid-access Google accounts | (empty) |
| Reuse Matching Chat | Regenerates on duplicate prompts | Off |
| Preflight Next Chat | Prepares a blank chat after successful responses | Off |