Morgana

Originated from open source, give back to open source.

Morgana is a deep research studio that orchestrates multi-agent investigations, source-grounded synthesis, and polished deliverables. It pairs large language models with specialized tools for search, crawling, code execution, and content generation while keeping humans in the loop where it matters.

This project is inspired by the excellent DeerFlow initiative. Morgana keeps the spirit of that work while re-imagining the architecture, agent orchestration, and user experience for a more opinionated deep research workflow.

Highlights

LangGraph Deep Agent Core – the researcher node is powered by LangGraph's Deep Agent architecture, combining long-form prompts, a planning tool, sub-agents, middleware, and a virtual file system so investigations can branch, persist intermediate artifacts, and rejoin when ready.
Task-Aware Planning – Morgana persists structured TODO plans, status files, and research artifacts for every thread. The planner seamlessly rehydrates context, spawns focused sub-tasks, and keeps execution state off the main context window.
Modern Web Experience – a refreshed UI centers the chat workspace, introduces responsive layouts for research sidebars, and ships with theme controls, replay support, and polished animation inspired by production agent tooling.
Model Flexibility – any LiteLLM-compatible model can be plugged in. The hosted deployment defaults to OpenAI GPT-4.1 Nano for the deep agent to balance context hunger with cost, while heavier models remain available for local runs.
Observability by Design – LangGraph Studio, structured checkpoints, and replayable chat sessions make it straightforward to audit, debug, and iterate on workflows.

Demo

Video

DeepResearch.mp4

The demo covers:

Integration with Model Context Protocol (MCP) services
End-to-end deep research culminating in an illustrated report
Podcast generation built from the generated research narrative

Replays

Quick Start

Morgana (a.k.a. DeepResearchAgent) ships with a Python backend and a Next.js web experience. The tooling below mirrors the workflow we use for local development.

Recommended Tools

uv: Creates and manages project virtual environments and dependencies automatically.
nvm: Makes it simple to install and switch between Node.js versions.
pnpm: Efficient package manager for the web workspace.

Environment Requirements

Python: Version 3.12 or newer
Node.js: Version 22 or newer

Installation

# Clone the repository
git clone https://github.com/obinopaul/DeepResearchAgent.git
cd DeepResearchAgent

# Install Python dependencies (uv will create the virtualenv for you)
uv sync

# Prepare environment configuration
cp .env.example .env
cp conf.yaml.example conf.yaml

# The .env file holds API keys (Tavily, Brave Search, RAG providers, Volcengine TTS, etc.)
# conf.yaml controls LLM routing; use Ollama or local models for offline development.

# (Optional) install Marp for PPT generation
brew install marp-cli

Install the web dependencies with pnpm if you plan to run the full-stack experience:

cd DeepResearchAgent/web
pnpm install

Configurations

See the Configuration Guide for an exhaustive walkthrough of every environment flag and configuration value. Update the sample files before booting the stack.

Console UI

The quickest way to experience Morgana is through the console application:

uv run main.py

Web UI

For the full chat and research experience, start both backend and frontend services:

# macOS/Linux
./bootstrap.sh -d
# Windows
bootstrap.bat -d

By default the API binds to 127.0.0.1 for safety. Change the host to 0.0.0.0 inside the bootstrap script if you are deploying remotely, and secure the environment before exposing it.

Then visit http://localhost:3000.

Supported Search Engines

Web Search APIs

Configure the SEARCH_API variable in .env to choose a provider:

Tavily (default) – purpose-built for AI assistants (TAVILY_API_KEY required)
DuckDuckGo – privacy-focused and keyless
Brave Search – privacy-first with advanced filters (BRAVE_SEARCH_API_KEY)
ArXiv – scientific publications without an API key
Searx/SearxNG – self-hosted metasearch (SEARX_HOST required)

# Choose one: tavily, duckduckgo, brave_search, arxiv, searx
SEARCH_API=tavily

Private Knowledge Base

Morgana supports private retrieval sources such as RAGFlow and VikingDB so research can stay grounded in your corpus.

# Example RAGFlow configuration (see .env.example)
RAG_PROVIDER=ragflow
RAGFLOW_API_URL="http://localhost:9388"
RAGFLOW_API_KEY="ragflow-xxx"
RAGFLOW_RETRIEVAL_SIZE=10
RAGFLOW_CROSS_LANGUAGES=English,Chinese,Spanish,French,German,Japanese,Korean

Features

Deep Agent Core

Built on LangGraph's Deep Agent architecture with dedicated planner, TODO tool, and hierarchical sub-agents for every research thread.
Researcher sub-agents persist plans, intermediate notes, and execution progress to a virtual file system, letting long-form tasks resume without blowing up the context window.
Configurable middleware lets you inject validation, guardrails, or custom tracing into the agent loop.

Model Strategy

Compatible with any LiteLLM provider or OpenAI-compatible API.
Defaults to OpenAI GPT-4.1 Nano for hosted research tasks to keep costs predictable while still supporting heavier models for desktop runs.
Supports popular open-source models such as Qwen when configured via the provided guides.

Modern Web Experience

Responsive, centered chat layout with adaptive research sidebars.
Built-in theme toggle, animated loading states, and replay support for showcasing multi-agent runs.
Auth-friendly header components with quick links to project assets.

Tools and MCP Integrations

Web search, crawling, and structured extraction (Tavily, Brave, Jina, and more)
Native RAGFlow integration for private document retrieval
MCP clients extend Morgana into knowledge graphs, enterprise APIs, and internal tooling

Human Collaboration

Natural-language plan editing keeps humans steering long tasks ([EDIT PLAN], [ACCEPTED], etc.)
Optional auto-accept flow when you want to run without interruptions
Notion-style editing experience for generated reports, including AI-assisted rewrite actions

Content Creation

Podcast scripts and audio generation for every report
Automated slide and presentation exports (Marp-based)
Customizable templates for downstream publishing

Architecture

Morgana implements a modular LangGraph workflow with clear boundaries between planning, research, execution, and reporting.

See it live at deep-research-agent-taupe.vercel.app

The pipeline consists of:

Coordinator – entry point that receives user requests, orchestrates the graph, and routes state transitions.
Planner – builds structured research plans, rehydrates existing TODOs, and decides when to iterate versus execute.
Deep Research Team – hierarchical agent stack:
- Deep Researcher (LangGraph Deep Agent) performs web work, manages sub-agents, interacts with the file system, and tracks TODO status.
- Coder executes Python tool invocations for data wrangling or code-level investigation.
- Additional sub-agents can be added per workflow via configuration.
Reporter – synthesizes findings, enriches them with images, citations, and media, and hands results to the UI for editing or podcast generation.

Every agent interaction is checkpointed. Plans, notes, and data artifacts are stored via the LangGraph checkpoint saver so you can replay, inspect, and resume long-running sessions.

Workflow Schemas

The diagrams below mirror the visual walkthrough in docs/workflow_architecture.md and are included here for quick reference.

flowchart LR
   Start([User Request])
   Coordinator[Coordinator Node\nChat intake & routing]
   Background{Enable background\ninvestigation?}
   Investigator[Background Investigator\nWeb search & crawl]
   Planner[Planner Node\nTrustCall plan builder]
   Iteration{Plan accepted?}
   Maxed{Max iterations reached?}
   ResearchTeam[Research Team Hub\nStep dispatcher]
   Researcher[Researcher Node\nDeep Agent executor]
   Coder[Coder Node\nPython & data tooling]
   Reporter[Reporter Node\nReport & media synthesis]
   End([Final Deliverables])

   Start --> Coordinator
   Coordinator --> Background
   Background -->|Yes| Investigator
   Background -->|No| Planner
   Investigator --> Planner
   Planner --> Iteration
   Iteration -->|Needs edits| Planner
   Iteration -->|Accepted| ResearchTeam
   Planner --> Maxed
   Maxed -->|Yes| Reporter
   Maxed -->|No| ResearchTeam
   ResearchTeam -->|Research step| Researcher
   ResearchTeam -->|Processing step| Coder
   Researcher --> ResearchTeam
   Coder --> ResearchTeam
   ResearchTeam -->|Plan complete| Reporter
   Reporter --> End

flowchart TB
   Entry[Pending plan steps\nfrom Research Team]
   Chunk[Chunk steps in pairs\ntwo at a time]
   Brief[Compose execution brief\nuser query, step summary, resources]
   Timer[[Research Timer Middleware\nLangChain middleware]]
   Orchestrator[LangGraph Deep Agent\nrecursion limit 1000]
   subgraph SubAgents[Specialist Sub-Agents]
      Research[Research agent\nweb & MCP tooling]
      Critique[Critique agent\nreport redlining]
      Strategist[Query strategist\nprompt refinement]
      Synthesizer[Insight synthesizer\nevidence scoring]
      Architect[Exploration architect\nfollow-up design]
      Auditor[Evidence auditor\ncitation checks]
   end
   Tools[(Search, Crawl, RAG,\nPython REPL, MCP servers)]
   Result[Capture pair response\nand update step status]
   Aggregator[Accumulate execution results\nfor every pair]
   Synthesis[Invoke reporter LLM\nfor final synthesis]
   Update[Write observations, reports,\nand plan step metadata]
   Return[Return to Research Team node]

   Entry --> Chunk --> Brief
   Brief --> Timer --> Orchestrator
   Orchestrator --> SubAgents
   SubAgents --> Orchestrator
   Orchestrator --> Tools
   Tools --> Orchestrator
   Orchestrator --> Result --> Aggregator
   Aggregator -->|More pairs pending| Chunk
   Aggregator -->|All pairs complete| Synthesis --> Update --> Return

Middleware Stack

Morgana relies on a layered middleware pipeline inspired by the deep-dive in docs/linkedin_post.md:

FilesystemMiddleware (src/agents/deep_agents/middleware/filesystem.py) – exposes ls, read_file, write_file, and edit_file tools with optional long-term storage so agents can persist and revisit artifacts throughout an investigation.
SubAgentMiddleware (src/agents/deep_agents/middleware/subagents.py) – spins up the specialist crew (research, critique, strategist, synthesizer, architect, auditor) and injects adaptive summarization guards for each sub-agent.
AdaptiveSummarizationMiddleware (src/agents/deep_agents/middleware/summarization.py) – enforces token budgets, rebalances history, and compresses transcripts only when the context window is truly at risk.
ResearchTimerMiddleware (src/agents/deep_agents/middleware/timer.py) – keeps the deep agent accountable with time-boxed nudges and final wrap-up messages when steps linger.
PatchToolCallsMiddleware (src/agents/deep_agents/middleware/patch_tool_calls.py) – repairs out-of-band tool call responses so downstream reducers always see well-formed tool events.
TodoListMiddleware and HumanInTheLoopMiddleware – manage structured TODO artifacts and enable review/approval checkpoints before the system commits to a plan.
Prompt caching & tracing layers – Anthropic prompt caching (when available) and LangGraph checkpointing round out observability so every run stays debuggable.

Each middleware can be composed or extended per agent; see src/agents/deep_agents/graph.py for the exact stack assembled for the researcher and its sub-agents.

Text-to-Speech Integration

Morgana ships with Volcengine-based text-to-speech so reports can be turned into audio summaries.

curl --location 'http://localhost:8000/api/tts' \
  --header 'Content-Type: application/json' \
  --data '{
    "text": "This is a test of the text-to-speech functionality.",
    "speed_ratio": 1.0,
    "volume_ratio": 1.0,
    "pitch_ratio": 1.0
  }' \
  --output speech.mp3

Configure your TTS credentials in .env before invoking the endpoint.

Development

Testing

make test               # run all tests
pytest tests/integration/test_workflow.py
make coverage

Code Quality

make lint
make format

Debugging with LangGraph Studio

LangGraph Studio can visualize Morgana's workflow in real time. The repository includes langgraph.json so you can boot the studio with a single command.

macOS

curl -LsSf https://astral.sh/uv/install.sh | sh
uvx --refresh --from "langgraph-cli[inmem]" --with-editable . --python 3.12 langgraph dev --allow-blocking

Windows / Linux

pip install -e .
pip install -U "langgraph-cli[inmem]"
langgraph dev

The CLI prints the API endpoint, Studio UI, and docs URL. Open the Studio UI link to inspect state transitions, provide live feedback during planning, and replay runs.

Enabling LangSmith Tracing

Set the following variables in .env to stream traces to LangSmith:

LANGSMITH_TRACING=true
LANGSMITH_ENDPOINT="https://api.smith.langchain.com"
LANGSMITH_API_KEY="xxx"
LANGSMITH_PROJECT="xxx"

Then launch langgraph dev to capture traces locally and within LangSmith.

Checkpointing

LangGraph checkpoint saver supports Postgres and MongoDB. In-memory stores cache streaming messages before persistence.
Set LANGGRAPH_CHECKPOINT_SAVER=true and configure LANGGRAPH_CHECKPOINT_DB_URL in .env.

LANGGRAPH_CHECKPOINT_SAVER=true
LANGGRAPH_CHECKPOINT_DB_URL="mongodb://localhost:27017/"
# LANGGRAPH_CHECKPOINT_DB_URL="postgresql://localhost:5432/postgres"
# LANGGRAPH_CHECKPOINT_DB_URL="postgresql://postgres:postgres@localhost:${POSTGRES_HOST_PORT:-5433}/checkpointing_db"      # host -> dockerized Postgres
# LANGGRAPH_CHECKPOINT_DB_URL="postgresql://postgres:postgres@checkpoint-db:5432/checkpointing_db"  # backend container -> Postgres container

Default database: checkpoint_db. Collections: checkpoint_writes_aio, checkpoints_aio, chat_streams.
For Postgres, prefer langgraph-checkpoint-postgres==2.0.21 until issue #5557 is resolved (TypeError: Object of type HumanMessage is not JSON serializable).
Psycopg typically requires libpq. Install the binary extra if you do not have system libraries available:

pip install psycopg[binary]

See the Psycopg installation guide for platform support.

Docker

Run Morgana in containers after preparing .env and conf.yaml.

docker build -t morgana-api .
docker run -d -t -p 127.0.0.1:8000:8000 --env-file .env --name morgana-api-app morgana-api
docker stop morgana-api-app

The repository also includes a compose file to launch backend and frontend together:

docker compose build
docker compose up

checkpoint-db runs Postgres 15 with persistent storage and a health check. Override POSTGRES_USER, POSTGRES_PASSWORD, POSTGRES_DB, and POSTGRES_HOST_PORT in .env if you need custom credentials or port bindings before running docker compose up.
When you run the backend on the host (e.g., uv run server.py), connect via postgresql://<user>:<password>@localhost:${POSTGRES_HOST_PORT:-5433}/<db>. When the backend runs inside the backend container, use postgresql://<user>:<password>@checkpoint-db:5432/<db> so it resolves over the compose network.

Add authentication and harden MCP/Python REPL access when deploying to production.

Examples

Research Reports

OpenAI Sora Report – features, access, prompt engineering, and ethics
examples/openai_sora_report.md
Google's Agent to Agent Protocol – emergent agent communication patterns
examples/what_is_agent_to_agent_protocol.md
What is MCP? – Model Context Protocol and other meanings across domains
examples/what_is_mcp.md
Bitcoin Price Fluctuations – market trends, regulation, technical indicators
examples/bitcoin_price_fluctuation.md
What is LLM? – architecture, training, applications, and ethics
examples/what_is_llm.md
How to Use Claude for Deep Research? – prompt engineering and workflows
examples/how_to_use_claude_deep_research.md
AI Adoption in Healthcare – organizational readiness and digital infrastructure
examples/AI_adoption_in_healthcare.md
Quantum Computing vs. Cryptography – vulnerabilities and post-quantum strategies
examples/Quantum_Computing_Impact_on_Cryptography.md
Cristiano Ronaldo: Performance Highlights – achievements and international record
examples/Cristiano_Ronaldo's_Performance_Highlights.md

Run Morgana with your own queries:

uv run main.py "What factors are influencing AI adoption in healthcare?"
uv run main.py --max_plan_iterations 3 "How does quantum computing impact cryptography?"
uv run main.py --interactive
uv run main.py
uv run main.py --help

Interactive Mode

uv run main.py --interactive

Choose English or Chinese, pick a built-in topic or provide your own, and Morgana will generate a full research report.

Human in the Loop

Plan review – Morgana presents the generated plan before execution.
Feedback loop – respond with [ACCEPTED] or [EDIT PLAN] ... to refine it.
Auto-accept – set auto_accepted_plan: true in API calls to bypass review.
API feedback – supply the feedback parameter to guide revisions programmatically.

{
  "messages": [{ "role": "user", "content": "What is quantum computing?" }],
  "thread_id": "my_thread_id",
  "auto_accepted_plan": false,
  "feedback": "[EDIT PLAN] Include more about quantum algorithms"
}

Command Line Arguments

query – research query to run
--interactive – launch guided interactive mode
--max_plan_iterations – cycle planning before execution (default 1)
--max_step_num – constrain plan length (default 3)
--debug – verbose logging

FAQ

See docs/FAQ.md for common questions and troubleshooting tips.

License

Released under the MIT License.

Acknowledgments

Morgana stands on the shoulders of the open-source ecosystem. Thank you to:

LangChain for the foundational LLM framework.
LangGraph for stateful agent orchestration.
Novel for the Notion-style editor powering report revisions.
RAGFlow for private knowledge base retrieval.
DeerFlow whose authors laid the groundwork that Morgana extends.

We continue to contribute back by documenting changes, improving usability, and keeping the project aligned with the needs of deep research practitioners.

Name		Name	Last commit message	Last commit date
Latest commit History 45 Commits
archive		archive
assets		assets
docs		docs
examples		examples
src		src
tests		tests
web		web
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CONTRIBUTING		CONTRIBUTING
Dockerfile		Dockerfile
GEMINI.md		GEMINI.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
bootstrap.bat		bootstrap.bat
bootstrap.sh		bootstrap.sh
browser-freeze-analysis.md		browser-freeze-analysis.md
conf.yaml.example		conf.yaml.example
debug_interrupt.ipynb		debug_interrupt.ipynb
docker-compose.yml		docker-compose.yml
langgraph.json		langgraph.json
main.py		main.py
pre-commit		pre-commit
pyproject.toml		pyproject.toml
question.txt		question.txt
requirements.txt		requirements.txt
server.py		server.py
test.ipynb		test.ipynb
test_fix.py		test_fix.py
uv.lock		uv.lock

License

obinopaul/DeepResearchAgent

Folders and files

Latest commit

History

Repository files navigation

Morgana

Highlights

Demo

Video

Replays

Table of Contents

Quick Start

Recommended Tools

Environment Requirements

Installation

Configurations

Console UI

Web UI

Supported Search Engines

Web Search APIs

Private Knowledge Base

Features

Deep Agent Core

Model Strategy

Modern Web Experience

Tools and MCP Integrations

Human Collaboration

Content Creation

Architecture

Workflow Schemas

Middleware Stack

Text-to-Speech Integration

Development

Testing

Code Quality

Debugging with LangGraph Studio

macOS

Windows / Linux

Enabling LangSmith Tracing

Checkpointing

Docker

Examples

Research Reports

Interactive Mode

Human in the Loop

Command Line Arguments

FAQ

License

Acknowledgments

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages