Skip to content

NucleusIQ

Memory

Memory

NucleusIQ agents support conversation memory across turns. Choose a strategy that fits your workload. All strategies work identically with any LLM provider (OpenAI, Gemini, MockLLM).

Strategies

Strategy	Use case	How it works
`FULL_HISTORY`	Short conversations	Keeps all messages
`SLIDING_WINDOW`	Chat applications	Keeps last N messages
`SUMMARY`	Long conversations	Summarizes older turns via LLM
`SUMMARY_WINDOW`	Long-running assistants	Summary of old + recent window
`TOKEN_BUDGET`	Hard cost control	Keeps messages within token budget

Configuration

from nucleusiq.agents import Agent
from nucleusiq.agents.config import AgentConfig, ExecutionMode
from nucleusiq.prompts.zero_shot import ZeroShotPrompt
from nucleusiq.memory import MemoryFactory, MemoryStrategy
from nucleusiq_openai import BaseOpenAI

memory = MemoryFactory.create_memory(
    MemoryStrategy.SLIDING_WINDOW,
    window_size=10,
)

agent = Agent(
    name="memory-demo",
    prompt=ZeroShotPrompt().configure(system="You are a helpful assistant."),
    llm=BaseOpenAI(model_name="gpt-4.1-mini"),
    memory=memory,
    config=AgentConfig(execution_mode=ExecutionMode.STANDARD),
)

Strategy examples

from nucleusiq.memory import MemoryFactory, MemoryStrategy

# Keep everything (default)
full = MemoryFactory.create_memory(MemoryStrategy.FULL_HISTORY)

# Last 12 messages
windowed = MemoryFactory.create_memory(MemoryStrategy.SLIDING_WINDOW, window_size=12)

# Hard token limit
budgeted = MemoryFactory.create_memory(MemoryStrategy.TOKEN_BUDGET, max_tokens=4096)

# Summarize old messages (requires an LLM)
summary = MemoryFactory.create_memory(MemoryStrategy.SUMMARY)

# Summary + recent window (best of both)
hybrid = MemoryFactory.create_memory(MemoryStrategy.SUMMARY_WINDOW, window_size=8)

Recommendations:

SLIDING_WINDOW for most chat apps
TOKEN_BUDGET when you need hard cost limits
SUMMARY_WINDOW for long-running assistants that need both context and efficiency

File-aware metadata

All strategies store attachment metadata alongside messages. User messages with attachments get a [Attached: ...] summary prefix for context continuity across turns.

See also

Agents — Agent configuration
Attachments — Multimodal inputs
Usage tracking — Monitor token consumption