Frequently Asked Questions

Common questions and answers gathered from GitHub Issues and Discussions.

Setup & API Key

Where do I get an API key?

Get a free API key from Google AI Studio. Paste it into the plugin settings under Settings → Gemini Scribe → API Key.

Why do Pro models fail with my free API key?

Google requires billing to be enabled on your API key to use Pro models (like Gemini 2.5 Pro). Flash models work on free keys. To use Pro models, enable billing in Google AI Studio. (#76)

No. The plugin requires an API key from Google AI Studio — it cannot use a consumer Gemini subscription (gemini.google.com) or the Gemini CLI's OAuth/Code Assist credentials. Even if you have a paid Gemini plan and the CLI works, these are separate systems:

Terms of Service: Google's ToS prohibit third-party tools from reusing Code Assist OAuth credentials. Google has contacted other integrators who attempted this approach.
Missing features: The Code Assist endpoint doesn't support server-side tools the plugin relies on, including image generation and the file search API used for semantic vault search.
Cost tip: Flash models (e.g., Gemini 2.5 Flash) are excellent for nearly all plugin use cases and are significantly cheaper than Pro models. Flash Lite is a great option for summaries and predictive typing.

(#390, #304)

The plugin won't load and I can't access settings to enter my API key

This was fixed in v4.3.1. Update to the latest version — the plugin now loads partially when unconfigured so you can access settings. (#316)

Rate Limits & API Errors

I'm getting "Rate limit exceeded" errors

This is the most common issue. A Gemini Pro consumer subscription does not increase API rate limits — those are separate. On a free API key, you will hit low rate limits quickly.

To fix:

Enable billing on your API key in Google AI Studio
Or switch to a model with more free quota (e.g., Gemini 2.5 Flash instead of 3.0)
Check your usage at the rate limits dashboard

(#296)

I'm getting "Failed to send message" on every query

This is usually caused by an invalid/expired API key or an unavailable model.

Steps to debug:

Verify your API key is valid in Google AI Studio
Enable Debug mode in plugin settings
Open the developer console (Ctrl/Cmd + Shift + I) for detailed error messages
Try switching to a different model

(#262, #268)

Requests fail after several retry attempts

The plugin has built-in retry logic with exponential backoff (3 retries by default, so up to 4 attempts total). If requests keep failing, it's usually a rate limit or transient API issue. Check your API key validity and rate limit dashboard. You can adjust retry settings under Advanced Settings. (#131)

Models

A model I selected shows "model not found"

Google regularly retires preview model versions. The plugin automatically fetches the latest model list from GitHub on startup (cached 24 hours). Click Refresh model list in Settings → General to pick up the current list immediately, without waiting for the cache to expire. (#223)

Where are the Temperature and Top-P settings?

These are available under Advanced Settings in the plugin settings. Click "Show advanced settings" to reveal them. Temperature ranges are automatically adjusted based on the selected model's capabilities. (#105)

Other Models & Providers

Will you add support for Gemma models?

If the Gemma model you want is served through the Gemini API (ai.google.dev) it can be added to the curated model list — open an issue naming the specific model ID. If you're asking about Gemma running locally via Ollama or similar, see the local LLM question below. (#587)

Can I use non-Gemini providers like OpenAI, Anthropic, or Mistral?

No. Gemini Scribe is intentionally a Gemini-only integration — it's built tightly around the @google/genai SDK, Gemini's tool-calling surface, URL Context, inline attachments, Google Search grounding, and the File Search API used for semantic vault search. Abstracting these to a generic provider interface would effectively be a rewrite, and there are other Obsidian plugins focused on multi-provider chat if that's what you need. (#588)

Can I point the plugin at Vertex AI for privacy or compliance reasons?

No. While the @google/genai SDK itself supports Vertex AI, two features this plugin depends on are Gemini Developer API features that are not available on Vertex: the Deep Research tool and the semantic vault search (File Search) pipeline. Shipping a Vertex code path would mean silently breaking those features, plus maintaining a second auth story (service account / OAuth / ADC) that is a poor fit for an Obsidian settings pane, plus a test matrix that would never actually get exercised.

If privacy is the concern, the paid AI Studio tier's no-training terms are the intended answer. For stricter compliance needs (HIPAA, SOC2, strict data residency), a different tool is likely the right call. (#588)

Can I use a local LLM via Ollama or llama.cpp?

Ollama is supported. Switch the provider to Ollama in Settings → Gemini Scribe → Provider, point the base URL at your Ollama instance, and enter the model name you have pulled. See the Ollama Setup guide for a step-by-step walkthrough.

A few Gemini-specific features are unavailable on Ollama — these all depend on Google's cloud APIs and have no local equivalent:

Deep Research (requires Google Search grounding)
Semantic vault search / RAG (requires the Gemini File Search API)
Google Search grounding in agent mode
Google Maps grounding in agent mode
URL Context web-fetch tool
Image generation (Imagen API)

Agent mode, tool calling, scheduled tasks, lifecycle hooks, custom prompts, completions, summarization, and rewriting all work with Ollama. (#576)

Language & Localization

How does the plugin decide what language to respond in?

Gemini Scribe does not have its own language setting. Instead, it reads Obsidian's own UI language preference and tells the model to reply in that language.

Technically, the plugin calls Obsidian's native getLanguage() API, which returns the interface language you picked in Obsidian's Settings → About → Language dropdown. If no value is set, the plugin falls back to "en". The detected code is then injected into every prompt (chat, summaries, completions, rewrites, vault analysis, etc.) via prompts/languageInstruction.hbs, which tells the model:

"My user interface is set to the language code: {code}. Respond in that language unless I write to you in a different one. Keep file paths, tool parameters, and [[WikiLinks]] in their original language regardless of response language."

So the plugin does not call navigator.language, process.env.LANG, or any system locale API — it only reads Obsidian's own language preference.

I want responses in a language different from my Obsidian UI — how?

The answer depends on which feature you're trying to influence. Each feature has a different set of hooks:

For agent chat, any of these work:

Just write to the agent in the target language. The language instruction explicitly tells the model to switch if you write in a different language, so asking a question in Korean will get a Korean answer even if your Obsidian UI is set to English.
Add a language rule to AGENTS.md. The [state-folder]/AGENTS.md memory file is prepended to the system prompt for agent chat. Adding a line like "Always respond in Korean." there persists the preference across sessions.
Configure a custom prompt on the session. Custom prompts are applied per chat session via the session settings gear icon.

For selection rewrite and full-file rewrite, only AGENTS.md works — these features build their own plugin-controlled prompt but still inject AGENTS.md via the shared system prompt path.

For file summarization, inline completions, and image generation, none of the above apply. These features construct fixed plugin-controlled prompts and do not read AGENTS.md, session custom prompts, or per-invocation overrides. The only currently supported way to change their output language is to change Obsidian's interface language (Settings → About → Language), which updates the auto-detected variable that gets injected into every prompt template. A dedicated override for these paths is being tracked in #613.

(#611)

Is there a dedicated language setting in the plugin?

Not today. The auto-detection above covers most users well, and the "respond in whatever language you're written to" fallback handles the mixed-language chat case. A dedicated override may be added in the future if there's demand from users who need non-chat features (summaries, rewrites, completions) in a language that differs from their Obsidian UI.

Is the plugin's own interface translated?

Yes. Static UI text — settings tabs, modals, the agent panel, command palette entries, buttons, and notices — follows your Obsidian interface language. Translations ship in 20 languages: cs, da, de, es, fr, id, it, ja, ko, nl, no, pl, pt, pt-BR, ru, tr, uk, vi, zh, and zh-TW. If your Obsidian language isn't in that list, the plugin UI falls back to English (AI responses are unaffected — they always follow your language per the question above).

These translations are AI-generated (bootstrapped with Gemini) rather than hand-written, so phrasing may occasionally be awkward. Native speakers can refine them by editing the corresponding file in src/i18n/ and opening a pull request — hand-refined strings are preserved across regenerations, since a string is only re-translated when its English source changes. When the UI is displayed in a non-English language, a small notice on the Agent panel's start screen discloses that the translation is AI-generated. (#754)

Cost & Billing

How much does this plugin cost to use?

Gemini Scribe itself is free and open source. The cost comes from the Gemini API calls it makes on your behalf. Google offers a generous free tier for most models, and Flash/Flash Lite models are very inexpensive even on paid tiers — for typical plugin usage (chat, summaries, completions) most users spend pennies per day or stay on the free tier entirely.

How can I track my spending?

Google provides authoritative dashboards in AI Studio:

Usage dashboard — token counts, request counts, and model breakdown
Billing page — invoices, payment methods, account tier
Spend page — current and historical spending

The plugin also shows live token usage for the current agent session in the chat UI, so you can see how much context the current conversation is consuming at a glance.

Can I set a spending cap?

Yes. Google provides two types of spending controls:

Project-level monthly cap (experimental) — Set a monthly limit for your specific Google Cloud project via aistudio.google.com/spend → Monthly spend cap → Edit spend cap. Billing is evaluated with up to a ~10-minute delay, so small overages are possible.
Account tier caps — Each billing account tier has a built-in monthly ceiling (Tier 1: $250, Tier 2: $2,000, Tier 3: $20,000+). Tier caps became enforced on April 1, 2026.

The free tier has no cap but is subject to rate limits. Full details: Gemini API billing docs.

How do I keep costs low?

Use Flash models for chat and agent interactions (Gemini 2.5 Flash is excellent and significantly cheaper than Pro)
Use Flash Lite for summaries and inline completions
Set a project-level spend cap in AI Studio for peace of mind
Watch the session token counter in the agent UI to spot runaway conversations
Reset long sessions periodically — agent sessions accumulate context, and longer sessions cost more per turn

Agent mode

What happened to the separate "Classic Chat" and "Agent mode"?

In v4.0, the plugin was unified into a single agent-first interface. There is now only one chat mode with full agent capabilities. The old classic chat mode was removed. (#123)

How do I use the fetch_url tool?

fetch_url is a built-in agent tool — you don't invoke it directly. Simply ask the agent to visit or summarize a URL in your message (e.g., "Summarize the content at https://example.com") and the agent will automatically use the tool. (#292)

The agent is hallucinating file contents instead of reading actual files

Make sure you're on v4.0 or later — earlier versions had a bug where context files were not properly read from the vault. If you still see issues, verify the file is added with @ and appears in the file shelf above the input area. (#159, #180)

"Summarize active file" isn't working

This command requires a markdown file actively open in the editor. If no file is open, you'll see "No active file to summarize. Please open a markdown file first." (#134)

Semantic Vault Search

What does the Vault Index feature do? Is my data private?

The vault index uses Google's File Search API to enable semantic (meaning-based) search of your vault. Files are stored in an index private to your GCP project, tied to your API key. Your data is not shared or used for model training. The feature is configured under the Vault search index section in the plugin settings. (#297)

Plugin Conflicts

RAG indexing creates runaway "Untitled" notes in the plugin state folder

This is caused by a conflict with the Folder Notes plugin, not Gemini Scribe itself. Folder Notes automatically creates notes when it detects new folders or file activity, and the rapid file operations during RAG indexing can trigger it repeatedly.

To fix: Disable the Folder Notes plugin, or configure it to ignore the Gemini Scribe state folder (default: gemini-scribe/). (#463)

Miscellaneous

What happened to the "Context Depth" setting?

The depth traversal setting was removed in v4.0 when Agent mode became the default. The agent now automatically searches your vault for relevant documents using tools instead of following a fixed link-depth hierarchy. Use @ mentions to explicitly add context files. (#267)

My custom prompt template isn't being applied

If the agent is ignoring your custom prompt, check that your model's rate limits haven't been exceeded — rate limit errors can appear as generic "failed" messages. Also verify: (1) the prompt file exists in [Plugin state folder]/Prompts/, and (2) the frontmatter reference uses correct wikilink syntax [[Prompt Name]]. (#330)

How do I reuse prompts in Agent mode?

Custom prompts are applied per-session, not executed as commands. To reuse a prompt:

Open the agent panel and start or load a session
Click the gear icon (session settings) in the session header
Select your prompt from the Prompt template dropdown
The prompt is now active for that session — all messages will use it

To apply the same prompt to different files, add the files as context (drag them in or use @ to mention them) while the prompt is active.

If you need a repeatable multi-step procedure rather than a behavioral style, consider creating a skill instead. Skills define step-by-step workflows the agent follows on demand.

Custom prompts and skills both work on mobile (Android and iOS). (#449)

Still have questions? Check the GitHub Discussions or open an issue.

Frequently Asked Questions ​

Setup & API Key ​

Where do I get an API key? ​

Why do Pro models fail with my free API key? ​

Can I use my Gemini Pro/Advanced subscription or Gemini CLI login instead of an API key? ​

The plugin won't load and I can't access settings to enter my API key ​

Rate Limits & API Errors ​

I'm getting "Rate limit exceeded" errors ​

I'm getting "Failed to send message" on every query ​

Requests fail after several retry attempts ​

Models ​

A model I selected shows "model not found" ​

Where are the Temperature and Top-P settings? ​

Other Models & Providers ​

Will you add support for Gemma models? ​

Can I use non-Gemini providers like OpenAI, Anthropic, or Mistral? ​

Can I point the plugin at Vertex AI for privacy or compliance reasons? ​

Can I use a local LLM via Ollama or llama.cpp? ​

Language & Localization ​

How does the plugin decide what language to respond in? ​

I want responses in a language different from my Obsidian UI — how? ​

Is there a dedicated language setting in the plugin? ​

Is the plugin's own interface translated? ​

Cost & Billing ​

How much does this plugin cost to use? ​

How can I track my spending? ​

Can I set a spending cap? ​

How do I keep costs low? ​

Agent mode ​

What happened to the separate "Classic Chat" and "Agent mode"? ​

How do I use the fetch_url tool? ​

The agent is hallucinating file contents instead of reading actual files ​

"Summarize active file" isn't working ​

Semantic Vault Search ​

What does the Vault Index feature do? Is my data private? ​

Plugin Conflicts ​

RAG indexing creates runaway "Untitled" notes in the plugin state folder ​

Miscellaneous ​

What happened to the "Context Depth" setting? ​

My custom prompt template isn't being applied ​

How do I reuse prompts in Agent mode? ​