PrivateGPT

Overview

PrivateGPT on Apolo gives you a turnkey, fully private solution to ingest, index, and query your data using LLMs — powered by embeddings and a PGVector backend. It’s perfect for building chatbots over proprietary docs, internal knowledge search tools, and RAG workflows—all while maintaining complete data sovereignty. To learn more about PrivateGPT, refer to the main PrivateGPT page for Apolo Console.

Installing

Here we provide brief description of the application installation using Apolo CLI. See Managing Apps page for generic flow of application installation via CLI. In this example, we will integrate with previously deployed PostgreSQL, vLLM and Text Embedding Inference applications.

Step 1 — Obtain the application configuration file template:

apolo app-template get private-gpt -o myservice.yaml

Step 2 — Customize the application parameters. With PrivateGPT, you can deploy RAG application within several clicks relying on previously deployed PostgreSQL for data storage, Apolo storage for binary files store, vLLM for LLM inference and Text Embeddings or vLLM for embeddings creation. Below is an example configuration file that deploys a PrivateGPT app.

# Example of private-gpt-v25.5.1-config.yaml

template_name: private-gpt
template_version: v25.5.1
display_name: ysem-docs-pgpt
input:
    ingress_http:
        auth: true
    embeddings_api:
        type: app-instance-ref
        instance_id: 26914c96-a284-46c3-a3db-306fa89a4a4a
        path: embeddings_internal_api
    llm_chat_api:
        type: app-instance-ref
        instance_id: 9a5b6eef-1094-40cb-ba66-4a749e2c92d4
        path: chat_internal_api
    private_gpt_specific:
        llm_temperature: 0.1
        embeddings_dimension: 768
        llm_max_new_tokens: 5000
        llm_context_window: 8192
        llm_tokenizer_name: Qwen/Qwen2-7B-Instruct
    pgvector_user:
        type: app-instance-ref
        instance_id: cfd2ebe0-c6a5-4a86-a5b0-ebc0a9834ae0
        path: postgres_users/users/0
    preset:
        name: cpu-medium

Step 3 — Deploy the application in your Apolo project:

apolo app install -f private-gpt-v25.5.1-config.yaml

Monitor the application status using:

apolo app list

To uninstall the application, use:

apolo app uninstall <app-id>

If you want to see logs of the application, use:

apolo app logs <app-id>

For instructions on how to access the application, please refer to the Usage section in web console.

Cleanup

When the application is not needed anymore, you could remove it by clicking the "uninstall" button on the installed app details/status screen.

References

PreviousApache Spark NextSuperset

Last updated 1 month ago

Was this helpful?