This installation guide is for R2R Core. For solo developers or teams prototyping, we highly recommend starting with R2R Light.

R2R Local System Installation

This guide will walk you through installing and running R2R on your local system without using Docker. This method allows for more customization and control over individual components.

Local installation of R2R Core is challenging due to the numerous services it integrates. We strongly recommend using Docker to get started quickly.If you choose to proceed with a local installation, be prepared to set up and configure the following services:

Postgres with pgvector: A relational database with vector storage capabilities.
Unstructured.io: A complex system for file ingestion.
Hatchet: A RabbitMQ-based orchestration system.

Alternatively, you can use cloud versions of these services, but you’ll be responsible for enrolling in them and providing the necessary environment variables.Each of these components has its own requirements, potential compatibility issues, and configuration complexities. Debugging issues in a local setup can be significantly more challenging than using a pre-configured Docker environment.

Unless you have a specific need for a local installation and are comfortable with advanced system configuration, we highly recommend using the Docker setup method for a smoother experience.

Prerequisites

Before starting, ensure you have the following installed and/or available in the cloud:

Python 3.10 or higher
pip (Python package manager)
Git
Postgres + pgvector
Unstructured file ingestion
Hatchet workflow orchestration

Install the R2R CLI & Python SDK

First, install the R2R CLI and Python SDK:

pip install 'r2r[core ingestion-bundle hatchet]'

Environment Setup

R2R requires connections to various services. Set up the following environment variables based on your needs:

Cloud LLM Providers

Note, cloud providers are optional as R2R can be run entirely locally.

 # Set cloud LLM settings
 export OPENAI_API_KEY=sk-...
 # export ANTHROPIC_API_KEY=...
 # ...

Hatchet

R2R uses Hatchet for orchestration. When building R2R locally, you will need to either (a) register for Hatchet’s cloud service or (b) install it locally following their instructions, or (c) deploy the R2R docker and connect with the internally provisioned Hatchet service.

 # Set cloud LLM settings
 export HATCHET_CLIENT_TOKEN=...

Postgres+pgvector

With R2R you can connect to your own instance of Postgres+pgvector or a remote cloud instance.

 # Set Postgres+pgvector settings
 export POSTGRES_USER=$YOUR_POSTGRES_USER
 export POSTGRES_PASSWORD=$YOUR_POSTGRES_PASSWORD
 export POSTGRES_HOST=$YOUR_POSTGRES_HOST
 export POSTGRES_PORT=$YOUR_POSTGRES_PORT
 export POSTGRES_DBNAME=$YOUR_POSTGRES_DBNAME
 export R2R_PROJECT_NAME=$YOUR_PROJECT_NAME # see note below

The R2R_PROJECT_NAME environment variable defines the tables within your Postgres database where the selected R2R project resides. If the specified tables do not exist then they will be created by R2R during initialization.

Unstructured

By default, R2R uses unstructured.io to handle file ingestion. Unstructured can be:

Installed locally by following their documented instructions
Connected to via the cloud

For cloud connections, set the following environment variable:

# Set Unstructured API key for cloud usage
export UNSTRUCTURED_API_KEY=your_api_key_here

Alternatively, R2R has its own lightweight end-to-end ingestion, which may be more appropriate in some situations. This can be specified in your server configuration by choosing r2r as your ingestion provider.

Running R2R

The full R2R installation does not use the default r2r.toml, instead it provides overrides through a pre-built custom configuration, full.toml.

After setting up your environment, you can start R2R using the following command:

# requires services for unstructured, hatchet, postgres
r2r serve --config-name=full

For local LLM usage:

r2r serve --config-name=full_local_llm

Python Development Mode

For those looking to develop R2R locally:

Install Poetry: Follow instructions on the official Poetry website.

Clone and install dependencies:

git clone git@github.com:SciPhi-AI/R2R.git
cd R2R/py
poetry install -E "core ingestion-bundle hatchet"

Setup environment: Follow the steps listed in the Environment Setup section above. Additionally, you may introduce a local .env file to make development easier, and you can customize your local r2r.toml to suit your specific needs.
Start your server:

poetry run r2r serve --config-name=core

Next Steps

After successfully installing R2R:

Verify Installation: Ensure all components are running correctly by accessing the R2R API at http://localhost:7272/v2/health.
Quick Start: Follow our R2R Quickstart Guide to set up your first RAG application.
In-Depth Tutorial: For a more comprehensive understanding, work through our R2R Walkthrough.
Customize Your Setup: Configure R2R components with the Configuration Guide.

If you encounter any issues during installation or setup, please use our Discord community or GitHub repository to seek assistance.

Getting Started

Setup

Deployment

Deep Dives

Local system

R2R Local System Installation

Prerequisites

Install the R2R CLI & Python SDK

Environment Setup

Running R2R

Python Development Mode

Next Steps

Getting Started

Setup

Deployment

Deep Dives

​R2R Local System Installation

​Prerequisites

​Install the R2R CLI & Python SDK

​Environment Setup

​Running R2R

​Python Development Mode

​Next Steps

R2R Local System Installation

Prerequisites

Install the R2R CLI & Python SDK

Environment Setup

Running R2R

Python Development Mode

Next Steps