Gemini provider

The NucleusIQ Gemini provider (nucleusiq-gemini) integrates Google's Gemini models using the official google-genai SDK (GA).

Installation

pip install nucleusiq nucleusiq-gemini

Set your API key:

export GEMINI_API_KEY="your-gemini-api-key"

Or in a .env file at your project root:

GEMINI_API_KEY=your-gemini-api-key

Quick start

import asyncio
from nucleusiq.agents import Agent
from nucleusiq.agents.config import AgentConfig, ExecutionMode
from nucleusiq.prompts.zero_shot import ZeroShotPrompt
from nucleusiq_gemini import BaseGemini, GeminiLLMParams

async def main():
    llm = BaseGemini(model_name="gemini-2.5-flash")
    agent = Agent(
        name="assistant",
        prompt=ZeroShotPrompt().configure(
            system="You are a helpful assistant.",
        ),
        llm=llm,
        config=AgentConfig(
            execution_mode=ExecutionMode.STANDARD,
            llm_params=GeminiLLMParams(temperature=0.7, max_output_tokens=1024),
        ),
    )
    result = await agent.execute({"id": "gemini-guide-1", "objective": "What is the capital of France?"})
    print(result.output)

asyncio.run(main())

Supported models

Model	Context window	Max output	Thinking support
`gemini-2.5-pro`	1M tokens	65,536	Yes
`gemini-2.5-flash`	1M tokens	65,536	Yes
`gemini-2.0-flash`	1M tokens	8,192	No
`gemini-1.5-pro`	2M tokens	8,192	No
`gemini-1.5-flash`	1M tokens	8,192	No

Gemini-specific parameters

Use GeminiLLMParams for parameters beyond the common set:

from nucleusiq.agents import Agent
from nucleusiq.agents.config import AgentConfig
from nucleusiq.prompts.zero_shot import ZeroShotPrompt
from nucleusiq_gemini import BaseGemini, GeminiLLMParams, GeminiThinkingConfig, GeminiSafetySettings

llm = BaseGemini(model_name="gemini-2.5-flash")
prompt = ZeroShotPrompt().configure(system="You are a helpful assistant.")

params = GeminiLLMParams(
    temperature=0.7,
    max_output_tokens=2048,
    top_k=40,
    top_p=0.95,

    # Thinking mode (Gemini 2.5+ models)
    thinking_config=GeminiThinkingConfig(thinking_budget=2048),

    # Safety settings
    safety_settings=[
        GeminiSafetySettings(
            category="HARM_CATEGORY_DANGEROUS_CONTENT",
            threshold="BLOCK_ONLY_HIGH",
        )
    ],
)

config = AgentConfig(llm_params=params)
agent = Agent(
    name="gemini-params-demo",
    prompt=prompt,
    llm=llm,
    config=config,
)

Native server-side tools

Gemini provides built-in tools that execute on Google's servers. These tools are free to use with the API (standard model pricing applies).

from nucleusiq_gemini import GeminiTool

# Google Search — ground responses with real-time web results
search = GeminiTool.google_search()

# Code Execution — run Python in a secure sandbox
code = GeminiTool.code_execution()

# URL Context — fetch and understand web pages
url_ctx = GeminiTool.url_context()

# Google Maps — location-aware grounding
maps = GeminiTool.google_maps()

Use them like any other tool:

agent = Agent(
    name="search-bot",
    prompt=ZeroShotPrompt().configure(system="Answer with the latest info."),
    llm=llm,
    tools=[GeminiTool.google_search(), GeminiTool.code_execution()],
    config=AgentConfig(execution_mode=ExecutionMode.STANDARD),
)

Or call directly on the LLM:

result = await llm.call(
    model="gemini-2.5-flash",
    messages=[{"role": "user", "content": "What are the latest AI news?"}],
    tools=[GeminiTool.google_search()],
    max_output_tokens=1024,
)

Mixing native and custom tools

New in v0.7.5

The Gemini 2.5 API limitation

Google's generateContent API rejects requests that combine native tools (google_search, code_execution, url_context, google_maps) with custom function declarations in the same tools array:

400 INVALID_ARGUMENT
Built-in tools ({google_search}) and Function Calling cannot be combined.

This affects all Gemini 2.5 models. Google's native fix (tool combinations) is available only for Gemini 3 models in Preview. No other framework — LangChain, CrewAI, AutoGen, or even Google ADK — provides a transparent single-agent solution.

NucleusIQ v0.7.5 resolves this with a transparent proxy pattern. Just pass your tools and it works:

from nucleusiq.agents import Agent
from nucleusiq.agents.config import AgentConfig
from nucleusiq.agents.task import Task
from nucleusiq.prompts.zero_shot import ZeroShotPrompt
from nucleusiq_gemini import BaseGemini
from nucleusiq_gemini.tools.gemini_tool import GeminiTool
from nucleusiq.tools.decorators import tool

@tool
def convert_temperature(celsius: float) -> str:
    """Convert Celsius to Fahrenheit."""
    return f"{celsius}°C = {celsius * 9/5 + 32}°F"

agent = Agent(
    name="researcher",
    prompt=ZeroShotPrompt().configure(
        system="Search the web and do calculations. Provide accurate answers.",
    ),
    llm=BaseGemini(model_name="gemini-2.5-flash"),
    tools=[
        GeminiTool.google_search(),   # Native tool
        GeminiTool.code_execution(),  # Native tool
        convert_temperature,          # Custom tool
    ],
    config=AgentConfig(execution_mode=ExecutionMode.STANDARD, enable_tracing=True),
    ...
)

result = await agent.execute(
    Task(objective="Search for Tokyo's temperature, convert to Fahrenheit.")
)

for tc in result.tool_calls:
    print(f"{tc.tool_name}: {tc.duration_ms:.0f}ms, success={tc.success}")

How it works behind the scenes:

BaseGemini.convert_tool_specs() detects mixed native + custom tools
Native tools switch to proxy mode — they appear as function declarations to the LLM
The LLM sees all tools as callable functions (no API rejection)
When a native tool is called, the proxy makes a separate generateContent sub-call with the real native tool spec
Results flow back as if from a normal tool call

Key properties:

Zero code changes — pass tools exactly as you would normally
All execution modes — Direct, Standard, and Autonomous all work
All Gemini models — 2.5-flash, 2.5-pro, and future models
Auto-detection — proxy activates only when tools are mixed; native-only or custom-only configurations have zero overhead
Provider-isolated — the proxy lives entirely in nucleusiq-gemini; the core framework is unaware

Timing expectations

Native tools in proxy mode (e.g. google_search) take 1.7-4.4 seconds per call because they make real API round-trips. Custom tools execute locally in microseconds. This is expected — the proxy trades one sub-call per native tool invocation for the ability to use all tools together.

Gearbox strategy (3 modes)

All three execution modes work identically with Gemini. The tabs below assume you already defined prompt (for example ZeroShotPrompt().configure(system="...")) and llm (BaseGemini(...)) like in Quick start.

Gear 1: DIRECTGear 2: STANDARDGear 3: AUTONOMOUS

Single LLM call. Best for Q&A, classification, summarization.

config = AgentConfig(execution_mode=ExecutionMode.DIRECT)
agent = Agent(name="fast", prompt=prompt, llm=BaseGemini(model_name="gemini-2.5-flash"), config=config)
result = await agent.execute({"id": "gemini-guide-2", "objective": "What is quantum computing?"})

Iterative tool calling (default 80 tool invocations per run unless you set max_tool_calls). Best for tasks needing external data.

from nucleusiq.tools.decorators import tool

@tool
def get_weather(city: str) -> str:
    """Get weather for a city."""
    return "22°C, Sunny"

config = AgentConfig(execution_mode=ExecutionMode.STANDARD)
agent = Agent(name="weather", prompt=prompt, llm=llm, config=config, tools=[get_weather])
result = await agent.execute({"id": "gemini-guide-3", "objective": "What's the weather in Tokyo?"})

Task decomposition, Critic verification, Refiner loop.

from nucleusiq_gemini.tools.gemini_tool import GeminiTool

config = AgentConfig(
    execution_mode=ExecutionMode.AUTONOMOUS,
    require_quality_check=True,
    max_iterations=5,
)
agent = Agent(
    name="researcher",
    prompt=prompt,
    llm=llm,
    config=config,
    tools=[GeminiTool.google_search(), GeminiTool.code_execution()],
)
result = await agent.execute({"id": "gemini-guide-4", "objective": "Compare Python and Rust for AI applications."})

Structured output

Pass a Pydantic model to get typed responses:

from pydantic import BaseModel

class MovieReview(BaseModel):
    title: str
    rating: float
    summary: str

result = await llm.call(
    model="gemini-2.5-flash",
    messages=[{"role": "user", "content": "Review the movie Inception"}],
    response_format=MovieReview,
    max_output_tokens=512,
)
print(result.title, result.rating)

Streaming

async for event in agent.execute_stream({"id": "s1", "objective": "Write a poem about the ocean"}):
    if event.type.value == "token":
        print(event.token, end="", flush=True)
    elif event.type.value == "thinking":
        print(f"[thinking] {event.message}", end="")
    elif event.type.value == "complete":
        print(f"\nTokens: {event.metadata.get('usage', {})}")

Multimodal attachments

Gemini natively supports images, PDFs, and files:

from nucleusiq.agents.attachments import Attachment, AttachmentType
from nucleusiq.agents.task import Task

task = Task(
    id="analyze",
    objective="Describe this image",
    attachments=[
        Attachment(type=AttachmentType.IMAGE_URL, data="https://example.com/photo.jpg"),
    ],
)
result = await agent.execute(task)

Supported attachment types: IMAGE_URL, IMAGE_BASE64, PDF, FILE_BYTES, FILE_BASE64, TEXT.

Error handling

The provider maps all Gemini SDK errors to NucleusIQ's framework-level exceptions:

from nucleusiq.llms.errors import RateLimitError, AuthenticationError, LLMError

try:
    result = await agent.execute({"id": "gemini-guide-5", "objective": "Hello"})
except RateLimitError as e:
    print(f"Rate limited: retry after backoff (status {e.status_code})")
except AuthenticationError:
    print("Check your GEMINI_API_KEY")
except LLMError as e:
    print(f"LLM error from {e.provider}: {e}")

Built-in retry with exponential backoff handles transient errors automatically (rate limits, server errors, connection issues). Auth and validation errors fail fast.

Provider portability

Switch between providers with zero agent code changes:

from nucleusiq.agents import Agent
from nucleusiq.agents.config import AgentConfig, ExecutionMode
from nucleusiq.prompts.zero_shot import ZeroShotPrompt

prompt = ZeroShotPrompt().configure(system="You are a helpful assistant.")
config = AgentConfig(execution_mode=ExecutionMode.STANDARD)

# Gemini
from nucleusiq_gemini import BaseGemini
llm = BaseGemini(model_name="gemini-2.5-flash")

# OpenAI (same agent shape) — swap llm only:
# from nucleusiq_openai import BaseOpenAI
# llm = BaseOpenAI(model_name="gpt-4o")

agent = Agent(
    name="my-agent",
    prompt=prompt,
    llm=llm,
    config=config,
    tools=[],
    plugins=[],
)

Tools, plugins, memory, and streaming all work identically across providers.