Memory and RAG Integration

Learn how to add persistent memory and Retrieval-Augmented Generation (RAG) capabilities to your agents for context-aware conversations and knowledge-based responses.

🎯 Overview

AgenticGoKit v1beta provides flexible memory integration that enables:

Conversation History - Remember past interactions
Knowledge Retrieval - Access external knowledge bases
Context Augmentation - Enhance prompts with relevant information
Vector Search - Semantic similarity matching
Production-Ready Backends - In-memory and PostgreSQL with pgvector

🚀 Quick Start

Basic In-Memory Storage

package main

import (
    "context"
    "log"
    "github.com/agenticgokit/agenticgokit/v1beta"
    _ "github.com/agenticgokit/agenticgokit/plugins/memory/memory" // Register provider
)

func main() {
    // Create agent with memory
    agent, err := v1beta.NewBuilder("MemoryAgent").
        WithPreset(v1beta.ChatAgent).
        WithLLM("openai", "gpt-4").
        WithMemory(
            v1beta.WithMemoryProvider("memory"),
        ).
        Build()
    if err != nil {
        log.Fatal(err)
    }
    
    // First interaction - agent has no context
    result1, _ := agent.Run(context.Background(), "My name is Alice")
    // Output: "Nice to meet you, Alice!"
    
    // Second interaction - agent remembers
    result2, _ := agent.Run(context.Background(), "What's my name?")
    // Output: "Your name is Alice."
}

💾 Memory Providers

AgenticGoKit v1beta includes two production-ready memory providers:

1. In-Memory Provider (Development/Testing)

Perfect for development, testing, and prototyping:

import (
    "github.com/agenticgokit/agenticgokit/v1beta"
    _ "github.com/agenticgokit/agenticgokit/plugins/memory/memory"
)

agent, _ := v1beta.NewBuilder("DevAgent").
    WithPreset(v1beta.ChatAgent).
    WithMemory(
        v1beta.WithMemoryProvider("memory"),
    ).
    Build()

Pros:

✅ No external dependencies
✅ Fast and simple
✅ Good for testing and development

Cons:

❌ Lost on restart
❌ Not suitable for production
❌ No persistence across sessions

Use Cases:

Local development
Unit testing
Prototyping
Simple demos

2. PostgreSQL with pgvector (Production)

Enterprise-grade storage with vector search capabilities:

import (
    "github.com/agenticgokit/agenticgokit/v1beta"
    _ "github.com/agenticgokit/agenticgokit/plugins/memory/pgvector"
)

agent, _ := v1beta.NewBuilder("ProductionAgent").
    WithPreset(v1beta.ResearchAgent).
    WithMemory(
        v1beta.WithMemoryProvider("pgvector"),
    ).
    Build()

Pros:

✅ Persistent storage
✅ Horizontally scalable
✅ Vector similarity search
✅ ACID compliance
✅ Production-ready with connection pooling

Database Setup:

sql

-- Install pgvector extension
CREATE EXTENSION IF NOT EXISTS vector;

-- Tables are automatically created by the provider:
-- - personal_memory (conversation context)
-- - key_value_store (structured data)
-- - chat_history (message history)
-- - documents (document metadata)
-- - knowledge_base (RAG embeddings)

Configuration via Environment:

bash

# Set database connection
export AGENTFLOW_MEMORY_CONNECTION="postgresql://user:password@localhost:5432/agentdb"
export AGENTFLOW_MEMORY_PROVIDER="pgvector"
export AGENTFLOW_MEMORY_DIMENSIONS=1536  # OpenAI ada-002

Configuration via TOML:

toml

[memory]
provider = "pgvector"
connection = "postgresql://user:password@localhost:5432/agentdb"
max_results = 10
dimensions = 1536
auto_embed = true

[memory.embedding]
provider = "openai"
model = "text-embedding-ada-002"
cache_embeddings = true
max_batch_size = 100
timeout_seconds = 30

🔍 RAG (Retrieval-Augmented Generation)

RAG enhances agent responses by retrieving relevant context from memory before generating responses.

Basic RAG Configuration

import (
    "context"
    "github.com/agenticgokit/agenticgokit/v1beta"
    "github.com/agenticgokit/agenticgokit/core"
    _ "github.com/agenticgokit/agenticgokit/plugins/memory/memory"
)

func main() {
    agent, _ := v1beta.NewBuilder("RAGAgent").
        WithPreset(v1beta.ResearchAgent).
        WithMemory(
            v1beta.WithMemoryProvider("memory"),
            v1beta.WithRAG(2000, 0.3, 0.7), // maxTokens, personalWeight, knowledgeWeight
        ).
        Build()
    
    // Get memory provider to ingest documents
    memProvider := core.GetMemoryProvider("memory")
    
    // Ingest knowledge
    ctx := context.Background()
    doc := core.Document{
        ID:      "geo-001",
        Title:   "World Capitals",
        Content: "The capital of France is Paris. The capital of Germany is Berlin.",
        Source:  "geography-textbook",
        Type:    core.DocumentTypeText,
        Tags:    []string{"geography", "capitals"},
    }
    
    memProvider.IngestDocument(ctx, doc)
    
    // Agent automatically retrieves relevant context
    result, _ := agent.Run(ctx, "What is the capital of France?")
    // Agent uses retrieved knowledge to answer accurately
}

RAG Configuration Options

// Configure RAG weights and limits
agent, _ := v1beta.NewBuilder("SmartAgent").
    WithPreset(v1beta.ResearchAgent).
    WithMemory(
        v1beta.WithMemoryProvider("pgvector"),
        v1beta.WithRAG(
            4000, // maxTokens - maximum context size
            0.3,  // personalWeight - weight for conversation history
            0.7,  // knowledgeWeight - weight for knowledge base
        ),
    ).
    Build()

RAG Parameters:

maxTokens: Maximum tokens for RAG context (2000-8000 recommended)
personalWeight: Weight for personal memory/conversation history (0.0-1.0)
knowledgeWeight: Weight for knowledge base documents (0.0-1.0)

📚 Document Ingestion

Load documents into the knowledge base for RAG:

Ingesting Text Documents

import (
    "context"
    "fmt"
    "os"
    "path/filepath"
    "github.com/agenticgokit/agenticgokit/core"
)

func ingestTextFile(memProvider core.Memory, filePath string) error {
    content, err := os.ReadFile(filePath)
    if err != nil {
        return err
    }
    
    // Split into chunks for better retrieval
    chunks := chunkText(string(content), 1000, 200) // size, overlap
    
    ctx := context.Background()
    for i, chunk := range chunks {
        doc := core.Document{
            ID:         fmt.Sprintf("%s-chunk-%d", filePath, i),
            Title:      filepath.Base(filePath),
            Content:    chunk,
            Source:     filePath,
            Type:       core.DocumentTypeText,
            ChunkIndex: i,
            ChunkTotal: len(chunks),
        }
        
        if err := memProvider.IngestDocument(ctx, doc); err != nil {
            return err
        }
    }
    
    return nil
}

func chunkText(text string, chunkSize, overlap int) []string {
    var chunks []string
    start := 0
    
    for start < len(text) {
        end := start + chunkSize
        if end > len(text) {
            end = len(text)
        }
        
        chunks = append(chunks, text[start:end])
        start += chunkSize - overlap
    }
    
    return chunks
}

Ingesting PDF Documents

import (
    "github.com/ledongthuc/pdf"
    "github.com/agenticgokit/agenticgokit/core"
)

func ingestPDF(memProvider core.Memory, filePath string) error {
    f, r, err := pdf.Open(filePath)
    if err != nil {
        return err
    }
    defer f.Close()
    
    ctx := context.Background()
    totalPages := r.NumPage()
    
    for pageNum := 1; pageNum <= totalPages; pageNum++ {
        p := r.Page(pageNum)
        if p.V.IsNull() {
            continue
        }
        
        text, _ := p.GetPlainText(nil)
        
        doc := core.Document{
            ID:      fmt.Sprintf("%s-page-%d", filePath, pageNum),
            Title:   filepath.Base(filePath),
            Content: text,
            Source:  filePath,
            Type:    core.DocumentTypePDF,
            Metadata: map[string]interface{}{
                "page":        pageNum,
                "total_pages": totalPages,
            },
        }
        
        if err := memProvider.IngestDocument(ctx, doc); err != nil {
            return err
        }
    }
    
    return nil
}

Ingesting Web Pages

import (
    "github.com/gocolly/colly"
    "strings"
)

func ingestWebPage(memProvider core.Memory, url string) error {
    c := colly.NewCollector()
    
    var contentBuilder strings.Builder
    var title string
    
    c.OnHTML("title", func(e *colly.HTMLElement) {
        title = e.Text
    })
    
    c.OnHTML("article, main, .content", func(e *colly.HTMLElement) {
        contentBuilder.WriteString(e.Text)
    })
    
    if err := c.Visit(url); err != nil {
        return err
    }
    
    ctx := context.Background()
    doc := core.Document{
        ID:      url,
        Title:   title,
        Content: contentBuilder.String(),
        Source:  url,
        Type:    core.DocumentTypeWeb,
        Tags:    []string{"webpage"},
    }
    
    return memProvider.IngestDocument(ctx, doc)
}

Batch Ingestion

For efficient ingestion of multiple documents:

func batchIngest(memProvider core.Memory, filePaths []string) error {
    docs := make([]core.Document, 0, len(filePaths))
    
    for i, path := range filePaths {
        content, err := os.ReadFile(path)
        if err != nil {
            return err
        }
        
        docs = append(docs, core.Document{
            ID:      fmt.Sprintf("doc-%d", i),
            Title:   filepath.Base(path),
            Content: string(content),
            Source:  path,
            Type:    core.DocumentTypeText,
        })
    }
    
    ctx := context.Background()
    return memProvider.IngestDocuments(ctx, docs)
}

🔄 Session Management

Manage separate conversation sessions:

import (
    "github.com/agenticgokit/agenticgokit/v1beta"
    "github.com/agenticgokit/agenticgokit/core"
)

func sessionExample() {
    agent, _ := v1beta.NewBuilder("SessionAgent").
        WithPreset(v1beta.ChatAgent).
        WithMemory(
            v1beta.WithMemoryProvider("memory"),
            v1beta.WithSessionScoped(),
        ).
        Build()
    
    // Get memory provider
    memProvider := core.GetMemoryProvider("memory")
    
    // Session 1
    ctx1 := memProvider.SetSession(context.Background(), "user-123")
    agent.Run(ctx1, "My favorite color is blue")
    
    // Session 2 (different user)
    ctx2 := memProvider.SetSession(context.Background(), "user-456")
    agent.Run(ctx2, "My favorite color is red")
    
    // Query session 1 context
    agent.Run(ctx1, "What's my favorite color?")
    // Output: "Your favorite color is blue."
    
    // Clear a session
    memProvider.ClearSession(ctx1)
}

🎨 Memory Patterns

Pattern 1: Context-Aware Conversations

Enable context awareness for better conversations:

agent, _ := v1beta.NewBuilder("ChatAgent").
    WithPreset(v1beta.ChatAgent).
    WithMemory(
        v1beta.WithMemoryProvider("memory"),
        v1beta.WithContextAware(),
    ).
    Build()

Pattern 2: Session-Scoped Memory

Isolate memory per user session:

agent, _ := v1beta.NewBuilder("MultiUserAgent").
    WithPreset(v1beta.ChatAgent).
    WithMemory(
        v1beta.WithMemoryProvider("pgvector"),
        v1beta.WithSessionScoped(),
    ).
    Build()

Pattern 3: RAG with Both Personal and Knowledge

Combine conversation history with knowledge base:

agent, _ := v1beta.NewBuilder("HybridAgent").
    WithPreset(v1beta.ResearchAgent).
    WithMemory(
        v1beta.WithMemoryProvider("pgvector"),
        v1beta.WithContextAware(),
        v1beta.WithSessionScoped(),
        v1beta.WithRAG(4000, 0.3, 0.7),
    ).
    Build()

🎯 Best Practices

1. Choose the Right Provider

// Development & Testing
v1beta.WithMemoryProvider("memory") // Fast, no setup

// Production
v1beta.WithMemoryProvider("pgvector") // Persistent, scalable, vector search

2. Optimize Chunk Size

// Too small - too many chunks, higher retrieval overhead
chunkSize := 100 // ❌

// Too large - less relevant retrieval
chunkSize := 10000 // ❌

// Optimal - balanced granularity
chunkSize := 1000
chunkOverlap := 200 // ✅

3. Configure RAG Weights

// For personal assistant (prioritize conversation history)
v1beta.WithRAG(2000, 0.7, 0.3) // High personal weight

// For knowledge assistant (prioritize knowledge base)
v1beta.WithRAG(4000, 0.2, 0.8) // High knowledge weight

// Balanced
v1beta.WithRAG(3000, 0.5, 0.5) // Equal weights

4. Use Metadata for Better Retrieval

doc := core.Document{
    Content: "Important information",
    Metadata: map[string]interface{}{
        "category":   "technical",
        "importance": "high",
        "timestamp":  time.Now(),
        "author":     "expert",
    },
    Tags: []string{"documentation", "api", "v1beta"},
}

5. Connection Pooling (PostgreSQL)

The pgvector provider automatically configures connection pooling:

// Automatic configuration:
// - MaxConns: 25
// - MinConns: 5
// - MaxConnLifetime: 1 hour
// - HealthCheckPeriod: 1 minute

🐛 Troubleshooting

Issue: Memory Not Persisting

Cause: Using in-memory provider

Solution: Use pgvector for persistence

// ❌ Not persistent
v1beta.WithMemoryProvider("memory")

// ✅ Persistent
v1beta.WithMemoryProvider("pgvector")

Issue: Slow Vector Search

Cause: Missing pgvector indexes

Solution: Ensure indexes are created (automatic with pgvector provider)

sql

-- Automatically created by pgvector provider:
CREATE INDEX idx_personal_memory_embedding 
    ON personal_memory USING ivfflat (embedding vector_cosine_ops);

CREATE INDEX idx_knowledge_embedding 
    ON knowledge_base USING ivfflat (embedding vector_cosine_ops);

Issue: Connection Errors

Cause: Invalid connection string or missing environment variables

Solution: Set proper configuration

bash

export AGENTFLOW_MEMORY_CONNECTION="postgresql://user:pass@localhost:5432/db"
export AGENTFLOW_MEMORY_PROVIDER="pgvector"

Issue: Embeddings Not Generated

Cause: Missing embedding provider configuration

Solution: Configure embedding service

toml

[memory.embedding]
provider = "openai"
model = "text-embedding-ada-002"
api_key = "${OPENAI_API_KEY}"  # Uses environment variable

📋 Complete Example

Here's a complete example combining all features:

package main

import (
    "context"
    "log"
    "github.com/agenticgokit/agenticgokit/v1beta"
    "github.com/agenticgokit/agenticgokit/core"
    _ "github.com/agenticgokit/agenticgokit/plugins/memory/pgvector"
)

func main() {
    // Create agent with full RAG capabilities
    agent, err := v1beta.NewBuilder("RAGAssistant").
        WithPreset(v1beta.ResearchAgent).
        WithLLM("openai", "gpt-4").
        WithMemory(
            v1beta.WithMemoryProvider("pgvector"),
            v1beta.WithContextAware(),
            v1beta.WithSessionScoped(),
            v1beta.WithRAG(4000, 0.3, 0.7),
        ).
        Build()
    if err != nil {
        log.Fatal(err)
    }
    
    // Get memory provider
    memProvider := core.GetMemoryProvider("pgvector")
    
    // Ingest knowledge documents
    docs := []core.Document{
        {
            ID:      "doc-1",
            Title:   "Product Documentation",
            Content: "Our product features X, Y, and Z...",
            Source:  "docs/product.md",
            Type:    core.DocumentTypeMarkdown,
            Tags:    []string{"product", "documentation"},
        },
        {
            ID:      "doc-2",
            Title:   "API Reference",
            Content: "API endpoints include /api/v1/users...",
            Source:  "docs/api.md",
            Type:    core.DocumentTypeMarkdown,
            Tags:    []string{"api", "reference"},
        },
    }
    
    ctx := context.Background()
    if err := memProvider.IngestDocuments(ctx, docs); err != nil {
        log.Fatal(err)
    }
    
    // Create session for user
    sessionCtx := memProvider.SetSession(ctx, "user-123")
    
    // Run queries - agent will use RAG to retrieve relevant context
    result, err := agent.Run(sessionCtx, "What are the product features?")
    if err != nil {
        log.Fatal(err)
    }
    
    log.Printf("Response: %s", result.Content)
}

📚 Next Steps

Configuration - Advanced memory configuration
Custom Handlers - Custom memory integration
Tool Integration - Memory-aware tools
Performance - Memory optimization tips

Ready for custom behavior? Continue to Custom Handlers →

Memory and RAG Integration ​

🎯 Overview ​

🚀 Quick Start ​

Basic In-Memory Storage ​

💾 Memory Providers ​

1. In-Memory Provider (Development/Testing) ​

2. PostgreSQL with pgvector (Production) ​

🔍 RAG (Retrieval-Augmented Generation) ​

Basic RAG Configuration ​

RAG Configuration Options ​

📚 Document Ingestion ​

Ingesting Text Documents ​

Ingesting PDF Documents ​

Ingesting Web Pages ​

Batch Ingestion ​

🔄 Session Management ​

🎨 Memory Patterns ​

Pattern 1: Context-Aware Conversations ​

Pattern 2: Session-Scoped Memory ​

Pattern 3: RAG with Both Personal and Knowledge ​

🎯 Best Practices ​

1. Choose the Right Provider ​

2. Optimize Chunk Size ​

3. Configure RAG Weights ​

4. Use Metadata for Better Retrieval ​

5. Connection Pooling (PostgreSQL) ​

🐛 Troubleshooting ​

Issue: Memory Not Persisting ​

Issue: Slow Vector Search ​

Issue: Connection Errors ​

Issue: Embeddings Not Generated ​

📋 Complete Example ​

📚 Next Steps ​