Quick Start Guide¶

Get started with HTM in just 5 minutes! This guide will walk you through building your first HTM-powered application.

Prerequisites

Make sure you've completed the Installation Guide before starting this tutorial.

HTM Quick Start Workflow

Step 1 Initialize HTM HTM.new() Set robot name

Step 2 Add Memories remember() Store knowledge

Step 3 Recall Memories recall() Search & retrieve

Step 4 Use Context assemble_context() For LLM prompts

HTM Memory System

Working Memory (Fast) • Token-limited (128K) • In-memory storage • Immediate LLM access O(1) lookups

Long-Term Memory (Durable) • Unlimited storage • PostgreSQL • RAG search (vector + text) Permanent storage

Stored in both

Evicted when full

Recalled when needed

Quick Example Code: htm = HTM.new(robot_name: "My Assistant") htm.remember("Remember this fact", tags: ["fact"]) memories = htm.recall("fact", timeframe: "today")

Your First HTM Application¶

Let's build a simple coding assistant that remembers project decisions and preferences.

Step 1: Create Your Project¶

Create a new Ruby file:

# my_first_htm_app.rb
require 'htm'

puts "My First HTM Application"
puts "=" * 60

Step 2: Initialize HTM¶

Create an HTM instance for your robot:

# Configure HTM globally (optional - defaults to Ollama for local development)
# HTM uses RubyLLM which supports: :ollama, :openai, :anthropic, :gemini, :azure, :bedrock, :deepseek
HTM.configure do |config|
  config.embedding.provider = :ollama           # or :openai, etc.
  config.embedding.model = 'nomic-embed-text'   # provider-specific model
  config.tag.provider = :ollama
  config.tag.model = 'gemma3:latest'
end

# Initialize HTM with a robot name
htm = HTM.new(
  robot_name: "Code Helper",
  working_memory_size: 128_000    # 128k tokens
)

puts "✓ HTM initialized for '#{htm.robot_name}'"
puts "  Robot ID: #{htm.robot_id}"
puts "  Working Memory: #{htm.working_memory.max_tokens} tokens"

What's happening here?

robot_name: A human-readable name for your AI robot
working_memory_size: Maximum tokens for active context (128k is typical)
Configuration is set globally via HTM.configure block

Robot Identity

Each HTM instance represents one robot. The robot_id is an integer database ID used to track which robot created each memory.

Step 3: Add Your First Memory¶

Add a project decision to HTM's memory:

puts "\n1. Adding a project decision..."

node_id = htm.remember(
  "We decided to use PostgreSQL for the database " \
  "because it provides excellent time-series optimization and " \
  "native vector search with pgvector.",
  tags: ["database:postgresql", "architecture:decisions"],
  metadata: { category: "architecture", priority: "high" }
)

puts "✓ Decision added to memory (node #{node_id})"

Memory Components:

Content: The actual memory text (first argument)
Tags: Hierarchical tags for categorization (e.g., "database:postgresql")
Metadata: Arbitrary key-value data stored as JSONB

Automatic Embeddings

HTM automatically generates vector embeddings for the memory content in the background. You don't need to handle embeddings yourself!

Step 4: Add More Memories¶

Let's add a few more memories:

puts "\n2. Adding user preferences..."

htm.remember(
  "User prefers using the debug_me gem for debugging instead of puts statements.",
  tags: ["debugging:ruby", "preferences:coding-style"],
  metadata: { category: "preference" }
)

puts "✓ Preference added"

puts "\n3. Adding a code pattern..."

htm.remember(
  "For database queries, use connection pooling with the connection_pool gem " \
  "to handle concurrent requests efficiently.",
  tags: ["database:performance", "ruby:patterns"],
  metadata: { category: "code-pattern" }
)

puts "✓ Code pattern added"

Tags create relationships - use hierarchical tags to build a navigable knowledge graph. Tags like database:postgresql and database:performance are connected through their shared database prefix.

Step 5: Look Up a Specific Memory¶

Look up a memory by its node ID:

puts "\n4. Looking up specific memory..."

# Use the node_id returned from remember()
node = HTM::Models::Node.find_by(id: node_id)

if node
  puts "✓ Found memory:"
  puts "  ID: #{node.id}"
  puts "  Content: #{node.content[0..100]}..."
  puts "  Tags: #{node.tags.pluck(:name).join(', ')}"
  puts "  Created: #{node.created_at}"
else
  puts "✗ Memory not found"
end

Step 6: Recall Memories by Topic¶

Use HTM's powerful recall feature to find relevant memories:

puts "\n5. Recalling memories about 'database'..."

memories = htm.recall(
  "database",                      # Topic (first positional argument)
  timeframe: "last week",          # Natural language time filter
  limit: 10,                       # Max results
  strategy: :hybrid,               # Search strategy (vector + full-text)
  raw: true                        # Return full node data
)

puts "✓ Found #{memories.length} relevant memories:"
memories.each_with_index do |mem, idx|
  puts "  #{idx + 1}. #{mem['content'][0..60]}..."
end

Search Strategies:

:vector: Semantic similarity search using embeddings
:fulltext: Keyword-based PostgreSQL full-text search (default)
:hybrid: Combines both for best results (recommended)

Timeframe Options:

"last week" - Last 7 days
"yesterday" - Previous day
"last 30 days" - Last month
"this month" - Current calendar month
Date ranges: 7.days.ago..Time.now

Step 7: Create Context for Your LLM¶

Generate a context string optimized for LLM consumption:

puts "\n6. Creating context for LLM..."

context = htm.working_memory.assemble_context(
  strategy: :balanced,             # Balance frequency and recency
  max_tokens: 50_000               # Optional token limit
)

puts "✓ Context created: #{context.length} characters"
puts "\nContext preview:"
puts context[0..300]
puts "..."

Context Strategies:

:recent: Most recently accessed memories first (LRU)
:frequent: Most frequently accessed memories first (LFU)
:balanced: Combines frequency × recency (recommended)

This context can be directly injected into your LLM prompt:

# Example: Using context with your LLM
prompt = <<~PROMPT
  You are a helpful coding assistant.

  Here's what you remember from past conversations:
  #{context}

  User: What database did we decide to use for the project?
PROMPT

# response = your_llm.chat(prompt)

Step 8: Check Memory Statistics¶

View statistics about your memory usage:

puts "\n7. Memory Statistics:"

# Working memory stats
wm = htm.working_memory
puts "  Working memory:"
puts "    Nodes: #{wm.node_count}"
puts "    Tokens: #{wm.token_count} / #{wm.max_tokens}"
puts "    Utilization: #{wm.utilization_percentage}%"

# Long-term memory stats via models
puts "  Long-term memory:"
puts "    Total nodes: #{HTM::Models::Node.count}"
puts "    Total tags: #{HTM::Models::Tag.count}"
puts "    Active robots: #{HTM::Models::Robot.count}"

Complete Example¶

Here's the complete script:

#!/usr/bin/env ruby
# my_first_htm_app.rb
require 'htm'

puts "My First HTM Application"
puts "=" * 60

# Step 1: Configure and initialize HTM (optional - uses Ollama by default)
# Supports: :ollama, :openai, :anthropic, :gemini, :azure, :bedrock, :deepseek
HTM.configure do |config|
  config.embedding.provider = :ollama
  config.embedding.model = 'nomic-embed-text'
  config.tag.provider = :ollama
  config.tag.model = 'gemma3:latest'
end

htm = HTM.new(
  robot_name: "Code Helper",
  working_memory_size: 128_000
)

puts "✓ HTM initialized for '#{htm.robot_name}'"

# Step 2: Add memories
htm.remember(
  "We decided to use PostgreSQL for the database.",
  tags: ["database:postgresql", "architecture:decisions"],
  metadata: { priority: "high" }
)

htm.remember(
  "User prefers using the debug_me gem for debugging.",
  tags: ["debugging:ruby", "preferences"],
  metadata: { category: "preference" }
)

puts "✓ Memories added"

# Step 3: Recall memories
memories = htm.recall(
  "database",
  timeframe: "last week",
  strategy: :hybrid
)

puts "✓ Found #{memories.length} memories about 'database'"

# Step 4: Create context
context = htm.working_memory.assemble_context(strategy: :balanced)
puts "✓ Context created: #{context.length} characters"

# Step 5: View statistics
puts "✓ Total nodes: #{HTM::Models::Node.count}"

puts "\n" + "=" * 60
puts "Success! Your first HTM application is working."

Run it:

ruby my_first_htm_app.rb

Multi-Robot Example¶

HTM's "hive mind" feature allows multiple robots to share memory. Here's how:

require 'htm'

# Create two different robots
robot_a = HTM.new(robot_name: "Code Assistant")
robot_b = HTM.new(robot_name: "Documentation Writer")

# Robot A adds a memory
robot_a.remember(
  "The API documentation is stored in the docs/ directory.",
  tags: ["docs:api", "project:structure"]
)

puts "Robot A added memory"

# Robot B can access the same memory!
memories = robot_b.recall(
  "documentation",
  timeframe: "last week",
  strategy: :hybrid
)

puts "Robot B found #{memories.length} memories"
# Robot B sees Robot A's memory!

# Query which robots have accessed which nodes
HTM::Models::RobotNode.includes(:robot, :node)
  .where(nodes: { content: 'documentation' })
  .group(:robot_id)
  .count
  .each do |robot_id, count|
    robot = HTM::Models::Robot.find(robot_id)
    puts "  #{robot.name}: #{count} memories"
  end

Use cases for multi-robot:

Collaborative coding teams of AI agents
Customer service handoffs between agents
Research assistants building shared knowledge
Teaching AI learning from multiple instructors

Working with Relationships¶

Build a knowledge graph using hierarchical tags:

# Add parent concept
htm.remember(
  "Databases store and organize data persistently.",
  tags: ["knowledge:databases"]
)

# Add child concept with shared tag hierarchy
htm.remember(
  "PostgreSQL is a powerful open-source relational database.",
  tags: ["knowledge:databases:postgresql", "tech:database"]
)

# Add another related concept
htm.remember(
  "PostgreSQL provides robust relational database capabilities.",
  tags: ["knowledge:databases:postgresql:features", "tech:database"]
)

# View tag hierarchy
puts HTM::Models::Tag.tree_string
# knowledge
#   └── databases
#       └── postgresql
#           └── features

# Find all memories under a tag prefix
nodes = HTM::Models::Tag.find_by(name: 'knowledge:databases')&.nodes

Forget (Explicit Deletion)¶

HTM follows a "never forget" philosophy with soft delete by default:

# Soft delete (recoverable) - default behavior
node_id = htm.remember("Temporary note")
htm.forget(node_id)                    # Soft delete
htm.restore(node_id)                   # Restore it!

# Permanent delete requires confirmation
htm.forget(node_id, soft: false, confirm: :confirmed)

puts "✓ Memory permanently deleted"

Soft Delete by Default

The forget() method performs a soft delete by default (sets deleted_at timestamp). The memory can be restored with restore(). Permanent deletion requires soft: false, confirm: :confirmed. Working memory evictions move data to long-term storage, they don't delete it.

Next Steps¶

Congratulations! You've learned the basics of HTM. Here's what to explore next:

Explore Advanced Features¶

User Guide: Deep dive into all HTM features
API Reference: Complete API documentation
Architecture Guide: Understand HTM's internals

Build Real Applications¶

Try building:

Personal AI Assistant: Remember user preferences and habits
Code Review Bot: Track coding patterns and past decisions
Research Assistant: Build a knowledge graph from documents
Customer Service Bot: Maintain conversation history

Experiment with Different Configurations¶

# Try different memory sizes
htm = HTM.new(
  robot_name: "Large Memory Bot",
  working_memory_size: 256_000  # 256k tokens
)

# Try different providers or models
HTM.configure do |config|
  # Use OpenAI for production
  config.embedding.provider = :openai
  config.embedding.model = 'text-embedding-3-small'

  # Or use Ollama locally with different model
  # config.embedding.provider = :ollama
  # config.embedding.model = 'mxbai-embed-large'
end

# Try different recall strategies
memories = htm.recall(
  "important decisions",
  timeframe: "last month",
  strategy: :vector  # Pure semantic search
)

Performance Optimization¶

For production applications:

Use connection pooling (built-in)
Tune working memory size based on your LLM's context window
Adjust importance scores to prioritize critical memories
Use appropriate timeframes to limit search scope
Monitor memory statistics regularly

Join the Community¶

GitHub: https://github.com/madbomber/htm
Issues: Report bugs or request features
Discussions: Share your HTM projects

Common Patterns¶

Pattern 1: Conversation Memory¶

# Store user messages
htm.remember(
  "User: How do I optimize database queries?",
  tags: ["conversation:question"],
  metadata: { role: "user", timestamp: Time.now.to_i }
)

# Store assistant responses
htm.remember(
  "Assistant: Use indexes and connection pooling.",
  tags: ["conversation:answer"],
  metadata: { role: "assistant", timestamp: Time.now.to_i }
)

Pattern 2: Learning from Code¶

# Extract patterns from code reviews
htm.remember(
  "Always validate user input before database queries.",
  tags: ["security:validation", "patterns:best-practice"],
  metadata: { source: "code-review" }
)

Pattern 3: Decision Tracking¶

# Document architectural decisions
htm.remember(
  "Decision: Use microservices architecture. " \
  "Reasoning: Better scalability and independent deployment.",
  tags: ["adr", "architecture:microservices"],
  metadata: { category: "architecture", priority: "critical" }
)

Troubleshooting Quick Start¶

Issue: "Connection refused" error (Ollama)¶

Solution: Make sure Ollama is running:

curl http://localhost:11434/api/version
# If this fails, start Ollama with: ollama serve

Issue: "API key invalid" error (cloud providers)¶

Solution: Verify your API key is set:

echo $OPENAI_API_KEY  # or ANTHROPIC_API_KEY, GEMINI_API_KEY, etc.

Issue: "Database connection failed"¶

Solution: Verify your HTM_DATABASE__URL is set:

echo $HTM_DATABASE__URL
# Should show your connection string

Issue: Embeddings taking too long¶

Solution for Ollama: Check the model is downloaded:

ollama list | grep nomic-embed-text
# Should show nomic-embed-text model

Solution for cloud providers: Check your internet connection and API status.

Issue: Memory not found during recall¶

Solution: Check your timeframe. If you just added a memory, use a recent timeframe:

# Instead of "last week", use:
memories = htm.recall(
  "your topic",
  timeframe: (Time.now - 3600)..Time.now  # Last hour
)

Additional Resources¶

Installation Guide: Complete setup instructions
User Guide: Comprehensive feature documentation
API Reference: Detailed API documentation
Examples: Real-world code examples

Happy coding with HTM! 🚀