Deep Agent: AI-Powered Real Estate Orchestrator

Welcome to the Deep Agent Real Estate Orchestrator – a cutting-edge implementation of modern AI and ML. This project demonstrates a production-grade, multi-agent system designed to perform complex reasoning, valuation, and analysis combining classical ML, RAG, vision AI, and geospatial analysis

🚀 Key Innovations & Architecture

This project is built around the ReAct (Reasoning and Acting) framework, managed by a supervisor agent that intelligently routes queries to specialized sub-agents. It leverages the emerging Model Context Protocol (MCP) and Agent to Agent protocol (A2A) to provide standardized, highly-performant tools to the AI via a FastAPI backend.

1. Multi-Agent Orchestration (LangGraph)

Hierarchical Routing: A supervisor agent synthesizes user requests and conceptually delegates tasks to specialized domain agents (Zoning, Vision, Valuation, Market Expert).
LangSmith Observability: Full tracing of thought processes, tool invocations, and agent latency for rapid debugging and MLOps monitoring.

2. MCP and A2A Integration

Standardized Tool Access: Built a modular API layer using bare-metal FastAPI to expose programmatic tools to the AI.
Pydantic Validation: Strict input/output schema validation ensuring agents never hallucinate tool parameters and execute deterministic backend code.
Intent Parsing & Semantic Routing: The top-level Supervisor Node intercepts raw user queries and semantically determines which specialized sub-agent (or combination of agents) is required to fulfill the request.
Specialized Worker Nodes: The Supervisor dispatches precise context payloads to isolated worker nodes, including:
- Zoning Expert: Dedicated to querying the RAG Pinecone index.
- Vision Assessor: Dedicated to processing LMM image classification.
- Valuation Analyst: Dedicated to querying the XGBoost pipeline.
- Market Guide: Dedicated to querying the fine-tuned LLM.
State Graph Orchestration: LangGraph maintains a strict State object that flows between nodes, ensuring that an agent can request more information from another agent before synthesizing a final, comprehensive response back to the user. This enforces separation of concerns and dramatically reduces hallucinations at scale.

3. Advanced RAG & Vector Search

Pinecone Vector Database: High-dimensional semantic search over heavy, unstructured local zoning laws (ingested, chunked, and vectorized).
Contextual Chunking: Optimized retrieval augmented generation (RAG) using text-embedding-3-small and RecursiveCharacterTextSplitter.

4. Large Multimodal Models (LMM) in Production

Automated Damage Assessment: Integrating GPT-4o-mini's visual capabilities to process raw property imagery, resizing and standardizing on the fly.
Metadata Extraction: Extracting severity metrics and generating multimodal embeddings upserted directly into the vector DB.

5. Classical ML Meets Agentic AI

Fair Market Value (FMV) Modeling: Building a highly robust XGBoost Regressor, bridging the gap between Generative AI and Classical ML.
Optuna Hyperparameter Tuning: Automated, intelligent grid searching with Early Stopping to optimize RMSE and R² scores.
Explainable AI (XAI): Utilizing SHAP (SHapley Additive exPlanations) and Permutation Importance so the Agent can confidently explain its valuation logic to users.

6. Geospatial & Amenity Intelligence

OpenStreetMap (OSM) API: Live geographic querying using Haversine distance formulas to calculate neighborhood amenity metrics.
Algorithmic Walkability: Creating dynamic walkability scores based on radius thresholds to augment the FMV model and Agent Market Reports.

7. Domain-Specific Fine-Tuning

Hyper-Local Expertise: Curating specialized Orlando market datasets to fine-tune GPT-4o-mini, drastically reducing hallucinations and giving the Market Expert agent an unparalleled understanding of hyper-local real estate dynamics.

🛠️ Technology Stack

Category	Technologies
Agentic Frameworks	LangChain, LangGraph, OpenAI Assistant API
Foundation Models	GPT-4o, GPT-4o-mini, `text-embedding-3-small`
Machine Learning	XGBoost, Scikit-Learn, Optuna, SHAP
Vector DB / Search	Pinecone
Backend & APIs	MCP, A2A, FastAPI, Pydantic
Data Pipeline	Pandas, Numpy, Pillow (PIL), OpenStreetMap
DevOps / MLOps	`uv` (Dependency Management), LangSmith, `.env` config

📂 Project Structure

sturdy-barnacle/
├── src/
│   └── deep_agent/                 # Placeholder for productionized multi-agent core
├── notebooks/                      # R&D, Model Training, & Agent Prototyping
│   ├── agent.py                    # Core ReAct agent implementation
│   ├── mcp_server.py               # MCP FastAPI entrypoint
│   ├── tools.py                    # Tool definitions and Pydantic schemas
│   ├── agent2agent.ipynb           # Multi-agent orchestrator logic
│   ├── xgboost-fmv-model-v2.ipynb  # Advanced XGBoost modeling & SHAP explainability
│   ├── ingest_vectorize_zoning.ipynb # RAG ingestion pipeline
│   ├── open_street_maps_api_test.ipynb # Geospatial data engineering
│   ├── mcp_layer.ipynb             # Server-client protocol testing
│   └── resize_metadata_tag_prop_images.ipynb # Vision processing pipeline
├── data/                           # RAG source files, Housing data, Images
├── tests/                          # Test suite (Integration, Unit)
├── .env                            # Environment Variables (OpenAI, Pinecone, LangSmith)
├── pyproject.toml                  # Modern uv-managed project dependencies
└── README.md                       # Project documentation

💻 Getting Started

Prerequisites

Python 3.12+
uv (Rapid Python Package Installer & Resolver)
Accounts for: OpenAI, Pinecone, LangSmith

1. Installation

Clone the repository and install dependencies using uv for hyper-fast resolution:

git clone https://github.com/Tejas-TA/sturdy-barnacle.git
cd sturdy-barnacle
uv sync

2. Environment Configuration

Create a .env file in the root directory (never commit this) with the following keys:

OPENAI_API_KEY=your_openai_key
PINECONE_API_KEY=your_pinecone_key
PINECONE_ENV=your_pinecone_env
LANGCHAIN_TRACING_V2=true
LANGCHAIN_API_KEY=your_langsmith_key
LANGCHAIN_PROJECT=orlando_real_estate

3. Exploring the AI Pipeline

The notebooks/ directory contains the bleeding-edge R&D workflows. I recommend stepping through them in this order:

RAG Vectorization: ingest_vectorize_zoning.ipynb
Machine Learning: xgboost-fmv-model-v2.ipynb
Computer Vision: resize_metadata_tag_prop_images.ipynb
Agent Orchestration: agent2agent.ipynb

4. Running the MCP Server

To expose the agentic tools to your environment locally:

uvicorn notebooks.mcp_server:app --reload --port 8000

🌟 Future Roadmap

Finalize migration of notebook prototypes into the src/ modular structure.
Increase Test-Driven Development (TDD) coverage in tests/ directory to harden MCP tools.
Seamless GUI integration for human-in-the-loop (HITL) approval on Fair Market Valuation predictions.
Containerize architecture using Docker / Kubernetes for one-click Kubernetes deployment.

This project exemplifies the seamless integration of Agentic Orchestration, Predictive Modeling, and Multimodal Vision AI-setting a benchmark for intelligent automation.

Name		Name	Last commit message	Last commit date
Latest commit History 54 Commits
.venv/Scripts		.venv/Scripts
data		data
notebooks		notebooks
output		output
src/deep_agent		src/deep_agent
tests		tests
.gitignore		.gitignore
README.md		README.md
agent2agent_streamlit_HITL_demo.mp4		agent2agent_streamlit_HITL_demo.mp4
agentic_rag_mcp_demo.gif		agentic_rag_mcp_demo.gif
gradio_RAG_chat_history_demo.mp4		gradio_RAG_chat_history_demo.mp4
orlando_fmv_model_v2_documentation.docx		orlando_fmv_model_v2_documentation.docx
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep Agent: AI-Powered Real Estate Orchestrator

🚀 Key Innovations & Architecture

1. Multi-Agent Orchestration (LangGraph)

2. MCP and A2A Integration

3. Advanced RAG & Vector Search

4. Large Multimodal Models (LMM) in Production

5. Classical ML Meets Agentic AI

6. Geospatial & Amenity Intelligence

7. Domain-Specific Fine-Tuning

🛠️ Technology Stack

📂 Project Structure

💻 Getting Started

Prerequisites

1. Installation

2. Environment Configuration

3. Exploring the AI Pipeline

4. Running the MCP Server

🌟 Future Roadmap

About

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Deep Agent: AI-Powered Real Estate Orchestrator

🚀 Key Innovations & Architecture

1. Multi-Agent Orchestration (LangGraph)

2. MCP and A2A Integration

3. Advanced RAG & Vector Search

4. Large Multimodal Models (LMM) in Production

5. Classical ML Meets Agentic AI

6. Geospatial & Amenity Intelligence

7. Domain-Specific Fine-Tuning

🛠️ Technology Stack

📂 Project Structure

💻 Getting Started

Prerequisites

1. Installation

2. Environment Configuration

3. Exploring the AI Pipeline

4. Running the MCP Server

🌟 Future Roadmap

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors

Uh oh!

Languages