RAG Search Testing

RAG Search Testing

This repository evaluates turnkey RAG (Retrieval Augmented Generation) solutions from AWS and GCP, provides working code examples for practitioners, and explores techniques for improving retrieval quality and user experience.

Project Objectives

1. Evaluate Turnkey RAG Solutions

Compare AWS Bedrock Knowledge Base and GCP Vertex AI Search across two dimensions:

Ease of deployment — time and complexity required to go from raw documents to a working RAG endpoint, including infrastructure setup, permissions, and data ingestion
Quality of responses — accuracy of document retrieval and relevance of generated answers against a standardized evaluation dataset

2. Provide Jumpstart Code Examples

All code is written to be readable and reusable. Each cloud implementation is self-contained in its own directory, with modular Python utilities that can be adapted to other projects. Notebooks walk through every step of the pipeline from data loading to querying, making it straightforward to adapt the code for a new dataset or cloud environment.

3. Explore Techniques for Improving RAG Responses

Beyond baseline RAG, the notebooks investigate techniques that can improve retrieval and answer quality.

4. Provide simple RAG examples helping builders develop intuition into how these tools work and how they might be improved

Because RAG tools often operate a large amounts of unstructured data and measuring performance is challenging, it's helpful to have a small number of digestable examples to gain intuition on how these platform perform.

Resources

Production Deployments

Presentation Covering the Work	Test Questions and RAG responses
Click to view the presentation	Click to see test questions

Repository Structure

rag_search/
├── data/
│   └── rag_eval_dataset/          # Shared evaluation dataset (CSV + metadata JSON)
├── aws_bedrock/                   # AWS Bedrock Knowledge Base implementation
│   ├── README.md
│   ├── S10_load_data_s3.ipynb
│   ├── S20_build_bedrock_knowledgebase.ipynb
│   ├── S30_search_bedrock_knowledgebase.ipynb
│   ├── s3_data_load.py
│   ├── bedrock_kb_security_build.py
│   ├── bedrock_kb_build.py
│   └── bedrock_kb_query.py
└── gcp_vertexai/                  # GCP Vertex AI Search implementation
    ├── README.md
    ├── S10_load_data_bq.ipynb
    ├── S12_load_data_gcs.ipynb
    ├── S20_build_vertexai_search_app.ipynb
    ├── S30_query_vertexai_searchapp.ipynb
    ├── gcs_data_load.py
    ├── vai_search_app_build.py
    └── vai_search_app_query.py

Each cloud directory contains its own README.md with detailed descriptions of every notebook and Python module, authentication setup, and configuration parameters.

Evaluation Dataset

Both implementations are tested against the Single-Topic RAG Evaluation Dataset originally created by Samuel Matsuo Harris, available on Kaggle.

The dataset contains 120 question-answer pairs across 20 documents, designed to test three retrieval scenarios:

Questions with no answer in the document corpus (40)
Questions requiring a single passage from one document (40)
Questions requiring multiple passages from one document (40)

Cloud Implementations

AWS Bedrock Knowledge Base

Located in aws_bedrock/. Uses Amazon Bedrock Knowledge Bases backed by OpenSearch Serverless with Amazon Titan Embed Text v2 embeddings. Documents are stored in S3 with JSON metadata sidecars. See aws_bedrock/README.md for details.

GCP Vertex AI Search

Located in gcp_vertexai/. Uses Google Cloud Vertex AI Search with Enterprise tier and LLM add-on. Documents are stored in GCS and optionally indexed in BigQuery. The search engine is configured with a custom schema enabling metadata filtering and faceted navigation. See gcp_vertexai/README.md for details.

Developed by Numantic Solutions

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RAG Search Testing

Project Objectives

1. Evaluate Turnkey RAG Solutions

2. Provide Jumpstart Code Examples

3. Explore Techniques for Improving RAG Responses

4. Provide simple RAG examples helping builders develop intuition into how these tools work and how they might be improved

Resources

Production Deployments

Repository Structure

Evaluation Dataset

Cloud Implementations

AWS Bedrock Knowledge Base

GCP Vertex AI Search

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
aws_bedrock		aws_bedrock
data		data
gcp_vertexai		gcp_vertexai
utils		utils
README.md		README.md

Folders and files

Latest commit

History

Repository files navigation

RAG Search Testing

Project Objectives

1. Evaluate Turnkey RAG Solutions

2. Provide Jumpstart Code Examples

3. Explore Techniques for Improving RAG Responses

4. Provide simple RAG examples helping builders develop intuition into how these tools work and how they might be improved

Resources

Production Deployments

Repository Structure

Evaluation Dataset

Cloud Implementations

AWS Bedrock Knowledge Base

GCP Vertex AI Search

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages