Weaviate

Weaviate is a robust, open-source vector database that allows you to store and query data based on its meaning. It supports various modules for text, image, and multimodal vectorization, enabling semantic search, advanced filtering, and question-answering. Weaviate offers flexible deployment options and integrates seamlessly with popular machine learning models and frameworks, providing GraphQL, and REST for easy integration with your applications.

The Apolo Weaviate App delivers a fully‑managed cluster with:

  • REST, GraphQL & gRPC APIs

  • Persistent volume for data

  • Optional S3 backups

  • One‑click HTTPS ingress secured by basic‑auth

Key Features

Feature
How the Apolo App Helps

Semantic Search

Query with nearVector, nearText, hybrid BM25‑+‑vector, etc.

Multimodal Support

Bring your own embeddings for text, images, audio or mix.

Modular Pipeline

Add vectorizers & rerankers outside the DB; Apolo bundles everything in one namespace.

Horizontal Scaling

Change the Resource Preset and restart—no manual sharding.

Secure Ingress

Auto‑issued TLS cert + platform basic‑auth.

Automated Backups

Nightly snapshots to an Apolo Files bucket (toggleable).

The Apolo Platform ships a one‑click Weaviate App that encapsulates Helm deployment, persistent storage, ingress and (optionally) automatic backups.

Installing

Below are the detailed instructions for installing Service Deployment using Apolo Console. For instructions on how to install it using Apolo CLI, visit Apolo CLI Service Deployment.

Installing via Apolo Console

1 · Open the catalogue

Navigate to Apps ▸ All apps and locate Weaviate App.

2 · Fill the wizard

Section
Field
Example
Notes

Resource Preset

cpu-large

4 vCPU / 8 GiB

Adjust as needed.

Persistent Storage

size = 32 GiB

PVC for data & WAL.

Enable Backups

true

Creates storage:weaviate-backups bucket.

Enable HTTP Ingress

auth = true

Makes https://weaviate-<id>.apps.<cluster>.apolo.us.

Click Install. In Details your external endpoints (REST & GraphQL), basic‑auth creds and namespace appear once status is healthy.

Apolo CLI

Deploy the same config via YAML.

Explanation

  • preset.name — picks CPU/RAM preset.

  • persistence.size — volume size; backups on by default.

  • ingress_http.auth=false — exposes endpoints openly; set true for private mode.


Inputs / Outputs (schema v1)

Inputs

JSON Path

Default

Description

preset.name

Resource preset per replica.

persistence.size

32

Volume size (GiB).

persistence.enable_backups

true

Nightly S3 snapshots.

ingress_http.auth

true

Require basic‑auth.

Outputs

Key
Purpose

external_graphql_endpoint.*

Public /v1/graphql endpoint.

external_rest_endpoint.*

Public /v1 REST endpoint.

internal_graphql_endpoint.*

Private /v1/graphql endpoint.

internal_rest_endpoint.*

Private /v1 REST endpoint.

internal_grpc_endpoint.*

Private gRPC in‑cluster.

auth.username / auth.password

Basic‑auth creds (if enabled).


Usage

Quick connectivity test & schema bootstrap:

Example Python Scripts

This example demonstrates connecting to Weaviate, defining a schema, embedding documents using the NV-Embed-v2 model, storing them in Weaviate, and performing a similarity search.

This script demonstrates connecting to Weaviate, defining a schema, embedding documents using OpenAI embeddings, storing them in Weaviate via LlamaIndex, and performing a similarity search.\

References

Last updated

Was this helpful?