Oreag

Oreag is an Experience-Augmented Generation (EAG) system that goes beyond traditional RAG by incorporating contextual user experience signals into the retrieval and generation pipeline — producing personalized, context-aware responses that adapt to individual user history, preferences, and interaction patterns using LangChain.

LangChain LLMOps RAG EAG Python Vector DB Personalization LLM

View on GitHub

⚡

Oreag — EAG System

Experience-Augmented Generation for personalized AI responses

EAG vs RAG — The Difference

🗃️ Standard RAG

Retrieves documents based on query similarity only

No user context or history considered

Same response for everyone with the same query

Stateless — no memory between sessions

✨ Oreag (EAG)

Retrieves with query + user experience context

Adapts to individual history and preferences

Personalized responses per user profile

Persistent context across conversations

Project Overview

Oreag extends the standard RAG paradigm by injecting user experience signals — past queries, preferred topics, interaction depth, and session memory — into both the retrieval scoring and the generation prompt. This creates a self-improving loop where the system becomes more accurate and relevant with every interaction, without fine-tuning the base model.

Key Features

🎯

Experience-Aware Retrieval

Retrieval scoring is weighted by user experience signals — surfacing documents that are not just semantically similar but contextually relevant to the user.

🧠

Persistent User Memory

User profiles store interaction history, topic preferences, and engagement signals — feeding into every subsequent query for adaptive personalization.

🔗

LangChain Pipeline

Built on LangChain for modular, composable retrieval chains — enabling easy swapping of retrievers, LLMs, and memory backends.

📦

Vector Store Integration

Connects to vector databases for semantic document search, augmented with metadata filters driven by the user's experience profile.

⚙️

LLMOps Ready

Built with production observability in mind — prompt versioning, response logging, and evaluation hooks for continuous quality monitoring.

🔄

Self-Improving Loop

Each interaction refines the user's experience graph — making retrieval more accurate and generation more relevant over time without retraining.

EAG Pipeline

Query + Context Encoding

User query is encoded into a semantic vector. The user's experience profile (history, preferences, recent topics) is encoded as a context signal alongside it.

Experience-Weighted Retrieval

Vector similarity search is re-ranked using experience weights — documents aligned with the user's expertise level and topic history score higher.

Context-Augmented Prompt Construction

Retrieved documents are injected into a prompt template that also includes the user's experience context — guiding the LLM to generate personalized responses.

LLM Generation

The LangChain pipeline passes the augmented prompt to the LLM, which generates a response calibrated to the user's knowledge level and interests.

Experience Profile Update

Post-generation, the interaction is logged and the user's profile is updated — feeding back into future retrieval to continuously improve relevance.

Tech Stack

Python

LangChain

RAG

EAG

LLMOps

Vector Database

OpenAI / Ollama

Semantic Search

User Experience Graph

Persistent Memory

Prompt Engineering