About deep-research-agent

Multi-agent autonomous research system using LangGraph and LangChain. Generates citation-backed reports with credibility scoring and web search

t

Published by

tarun7r

Visit View Profile

README.md

View on GitHub

Deep Research Agent

A production-ready multi-agent autonomous research system built with LangGraph and LangChain. Four specialized agents collaborate to conduct comprehensive research on any topic, generating detailed citation-backed reports with credibility scoring and quality metrics.

Supports: Local models (Ollama, llama.cpp) and Cloud APIs (Google Gemini, OpenAI)

Demo

https://github.com/user-attachments/assets/df8404c6-7423-4a49-864a-bd4d59885c1b

Watch the full demo video to see the Deep Research Agent in action, showcasing the multi-agent workflow, real-time progress updates, and comprehensive report generation.

Features

Core Capabilities

Feature	Description
Multi-Agent Architecture	Four specialized autonomous agents orchestrated by LangGraph's StateGraph
Autonomous Research	Search agent dynamically decides queries, sources, and extraction depth
Credibility Scoring	Automatic source evaluation (0-100) based on domain authority
Quality Validation	Section-level validation with retry logic and exponential backoff
Multi-Format Export	Reports in Markdown, HTML, and plain text
LLM Usage Tracking	Real-time monitoring of API calls, tokens, and costs
Research Caching	7-day TTL file-based caching with MD5 topic hashing
Web Interface	Interactive Chainlit UI with real-time progress

Production-Ready Features

Feature	Description
Circuit Breaker	Automatic failure detection and recovery for external services
Connection Pooling	HTTP/2 with persistent connections via httpx
Checkpointing	Workflow state persistence for crash recovery
Typed Exceptions	Domain-specific error handling for better debugging
Dependency Injection	Testable agent architecture with injectable LLMs
Search Provider Abstraction	Extensible search backend (DuckDuckGo, with easy addition of others)

Architecture

High-Level Flow

Deep Research Agent Flow Diagram

Agent Responsibilities

ResearchPlanner

Analyzes research topics and generates 3-5 SMART objectives
Creates targeted search queries covering different aspects
Designs report outline with up to 8 sections
Uses structured JSON output for reliability

ResearchSearcher (Autonomous Agent)

LangChain-powered autonomous agent using create_agent()
Dynamically decides which queries to execute
Uses web_search and extract_webpage_content tools
All sources scored for credibility and filtered (default threshold: 40)
Circuit breaker protection against service failures

ResearchSynthesizer

Analyzes aggregated results with credibility awareness
Prioritizes HIGH-credibility sources (score ≥70)
Resolves contradictions using credibility hierarchy
Progressive truncation handles token limits

ReportWriter

Generates structured sections with academic tone
Adds proper citations (APA, MLA, Chicago, IEEE)
Validates section quality with retry on failure
Compiles final markdown with references

Installation

Prerequisites

Python 3.11+
pip or uv package manager
One of:
- Ollama (local models)
- llama.cpp (local models, maximum performance)
- Google Gemini API (cloud)
- OpenAI API (cloud)

Quick Start

# Clone the repository
git clone https://github.com/tarun7r/deep-research-agent.git
cd deep-research-agent

# Create virtual environment
python -m venv .venv
source .venv/bin/activate

# Install dependencies
pip install -r requirements.txt

# Run
python main.py

Using Ollama (Recommended for Local)

# Install Ollama
curl -fsSL https://ollama.com/install.sh | sh

# Pull a model
ollama pull qwen2.5:7b

# Configure .env
MODEL_PROVIDER=ollama
MODEL_NAME=qwen2.5:7b
SUMMARIZATION_MODEL=qwen2.5:7b

Using llama.cpp (Maximum Performance)

# Download GGUF model
huggingface-cli download Qwen/Qwen2.5-7B-Instruct-GGUF \
  qwen2.5-7b-instruct-q4_k_m.gguf --local-dir ./models

# Start server with tool calling
./llama-server -m ./models/qwen2.5-7b-instruct-q4_k_m.gguf \
  --host 0.0.0.0 --port 8080 -ngl 35 --ctx-size 4096 --jinja

# Configure .env
MODEL_PROVIDER=llamacpp
MODEL_NAME=qwen2.5-7b-instruct-q4_k_m
LLAMACPP_BASE_URL=http://localhost:8080

Using Cloud APIs

# Gemini
MODEL_PROVIDER=gemini
GEMINI_API_KEY=your_api_key_here
MODEL_NAME=gemini-2.5-flash

# OpenAI
MODEL_PROVIDER=openai
OPENAI_API_KEY=your_api_key_here
MODEL_NAME=gpt-4o-mini

Usage

Command Line

# Interactive mode
python main.py

# Direct topic
python main.py "Impact of quantum computing on cryptography"

Web Interface

chainlit run app.py --host 127.0.0.1 --port 8000

Features:

Real-time progress with stage indicators
Quality metrics and LLM usage statistics
Multiple format downloads (MD, HTML, TXT)
Research history tracking

Programmatic API

import asyncio
from src.graph import run_research

async def research():
    # Basic usage
    state = await run_research(
        topic="Your research topic",
        verbose=True,
        use_cache=True
    )

    # Access results
    print(state["final_report"])
    print(f"Sources: {len(state['search_results'])}")
    print(f"Findings: {len(state['key_findings'])}")
    print(f"Tokens: {state['total_input_tokens'] + state['total_output_tokens']:,}")

asyncio.run(research())

With Persistence (Crash Recovery)

from src.graph import run_research_with_persistence, resume_research

# Run with SQLite persistence
state = await run_research_with_persistence(
    topic="Your topic",
    thread_id="my-research-001"
)

# Resume interrupted workflow
state = await resume_research(thread_id="my-research-001")

Configuration

Environment Variables

# =============================================================================
# MODEL PROVIDER (required)
# =============================================================================
MODEL_PROVIDER=gemini              # Options: ollama, llamacpp, gemini, openai

# =============================================================================
# PROVIDER-SPECIFIC SETTINGS
# =============================================================================

# Ollama
MODEL_NAME=qwen2.5:7b
SUMMARIZATION_MODEL=qwen2.5:7b
OLLAMA_BASE_URL=http://localhost:11434

# llama.cpp
MODEL_NAME=qwen2.5-7b-instruct-q4_k_m
LLAMACPP_BASE_URL=http://localhost:8080

# Gemini
GEMINI_API_KEY=your_api_key_here
MODEL_NAME=gemini-2.5-flash
SUMMARIZATION_MODEL=gemini-2.5-flash

# OpenAI
OPENAI_API_KEY=your_api_key_here
OPENAI_BASE_URL=https://api.openai.com  # Optional
MODEL_NAME=gpt-4o-mini
SUMMARIZATION_MODEL=gpt-4o-mini

# =============================================================================
# SEARCH SETTINGS (optional)
# =============================================================================
MAX_SEARCH_QUERIES=3               # Number of search queries
MAX_SEARCH_RESULTS_PER_QUERY=3     # Results per query
MIN_CREDIBILITY_SCORE=40           # Filter threshold (0-100)

# =============================================================================
# REPORT SETTINGS (optional)
# =============================================================================
MAX_REPORT_SECTIONS=8              # Maximum sections in report
CITATION_STYLE=apa                 # Options: apa, mla, chicago, ieee

Model Provider Comparison

Provider	Cost	Privacy	Speed	Setup
Ollama	Free	Local	Fast	Easy
llama.cpp	Free	Local	Fastest	Manual
Gemini	Free tier	Cloud	Fast	API key
OpenAI	Pay-per-use	Cloud	Fast	API key

Project Structure

deep-research-agent/
├── src/
│   ├── __init__.py           # Package initialization
│   ├── config.py             # Configuration management (Pydantic)
│   ├── state.py              # State models (ResearchState, etc.)
│   ├── agents.py             # Agent implementations with DI
│   ├── graph.py              # LangGraph workflow + checkpointing
│   ├── callbacks.py          # Progress callback system
│   ├── llm_tracker.py        # Token and cost tracking
│   ├── exceptions.py         # Typed domain exceptions
│   │
│   ├── prompts/              # Extracted prompt templates
│   │   ├── __init__.py
│   │   ├── planner.py        # Planning prompts
│   │   ├── searcher.py       # Search prompts
│   │   ├── synthesizer.py    # Synthesis prompts
│   │   └── writer.py         # Writing prompts
│   │
│   └── utils/
│       ├── __init__.py
│       ├── tools.py          # LangChain @tool functions
│       ├── web_utils.py      # httpx client, circuit breaker, search providers
│       ├── cache.py          # Research caching (7-day TTL)
│       ├── credibility.py    # Source credibility scoring
│       ├── citations.py      # Citation formatting
│       ├── exports.py        # Multi-format export
│       └── history.py        # Research history
│
├── outputs/                  # Generated reports
├── .cache/
│   ├── research/             # Cached results
│   ├── checkpoints/          # Workflow checkpoints (SQLite)
│   └── research_history.json
│
├── assets/                   # Documentation assets
├── main.py                   # CLI entry point
├── app.py                    # Chainlit web interface
├── requirements.txt          # Dependencies
├── pyproject.toml            # Project metadata
├── LICENSE                   # MIT License
└── README.md

Key Components

Exception Hierarchy

DeepResearchError
├── ConfigurationError
├── PlanningError
├── SearchError
│   └── RateLimitError
├── ContentExtractionError
├── SynthesisError
├── ReportGenerationError
├── CircuitOpenError
└── LLMError

Credibility Scoring

Sources are scored (0-100) based on:

Factor	Points
Trusted domain (.edu, .gov, academic)	+30
HTTPS enabled	+5
Academic/research path	+10
Suspicious TLD (.xyz, .tk)	-20
No HTTPS	-10

Default filter threshold: 40 (configurable via MIN_CREDIBILITY_SCORE)

Circuit Breaker States

CLOSED ──► (5 failures) ──► OPEN ──► (30s timeout) ──► HALF_OPEN ──► (success) ──► CLOSED
                              │                            │
                              └──────── (failure) ◄────────┘

API Reference

Core Functions

# Main research function
async def run_research(
    topic: str,
    verbose: bool = True,
    use_cache: bool = True,
    use_checkpoints: bool = True,
    thread_id: Optional[str] = None
) -> Dict[str, Any]

# With SQLite persistence
async def run_research_with_persistence(
    topic: str,
    verbose: bool = True,
    use_cache: bool = True,
    thread_id: Optional[str] = None
) -> Dict[str, Any]

# Resume interrupted workflow
async def resume_research(
    thread_id: str,
    additional_input: Optional[Dict[str, Any]] = None
) -> Dict[str, Any]

# Check workflow state
async def get_workflow_state(thread_id: str) -> Optional[Dict[str, Any]]

# List saved threads
def list_research_threads() -> List[str]

Response Structure

{
    "research_topic": str,
    "plan": ResearchPlan,
    "search_results": List[SearchResult],
    "credibility_scores": List[Dict],
    "key_findings": List[str],
    "report_sections": List[ReportSection],
    "final_report": str,
    "current_stage": str,
    "error": Optional[str],
    "iterations": int,
    "llm_calls": int,
    "total_input_tokens": int,
    "total_output_tokens": int,
    "llm_call_details": List[Dict]
}

Output Format

Reports follow this structure:

# [Research Topic]

**Deep Research Report**

## Executive Summary
[Overview with source count and section count]

## Research Objectives
1. [Objective 1]
2. [Objective 2]
...

---

## [Section 1 Title]
[Content with inline citations [1], [2]]

## [Section 2 Title]
[Content with inline citations [3], [4]]

---

## References
1. [Formatted citation - APA/MLA/Chicago/IEEE]
2. [Formatted citation]
...

---

**Note:** X high-credibility sources were prioritized.

Development

Running Tests

pytest tests/ -v

Adding a New Search Provider

# src/utils/web_utils.py
class GoogleSearchProvider(SearchProvider):
    @property
    def name(self) -> str:
        return "google"

    async def search(self, query: str, max_results: int) -> List[SearchResult]:
        # Implementation
        pass

# Register in WebSearchTool
tool = WebSearchTool(providers=[
    DuckDuckGoProvider(),
    GoogleSearchProvider()  # Fallback
])

Customizing Prompts

Edit files in src/prompts/:

planner.py - Research planning strategy
searcher.py - Search agent instructions
synthesizer.py - Synthesis methodology
writer.py - Report writing style

Contributing

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Commit changes (git commit -m 'Add amazing feature')
Push to branch (git push origin feature/amazing-feature)
Open a Pull Request

License

MIT License - See LICENSE file for details.

Acknowledgments

Built with:

LangGraph - Workflow orchestration
LangChain - LLM framework
Chainlit - Web interface
httpx - Async HTTP client
DuckDuckGo - Web search

Supports:

Ollama & llama.cpp - Local models
Google Gemini & OpenAI - Cloud APIs

Contact

GitHub: tarun7r
LinkedIn: Tarun Sai Goddu
Hugging Face: tarun7r
Email: [email protected]

deep-research-agent