APIbenchmarks
Weaviate Cloud logo

Weaviate Cloud

Weaviate · Ranked #6 of 7 in Vector Database APIs

79.6/ 100
BStrong

Feature-rich vector DB with hybrid search and agentic query tooling, but only a 14-day free sandbox.

Best for

Hybrid search + AI-native query agents

Screenshot of Weaviate Cloud

Overview

Weaviate Cloud is the managed, fully-hosted offering of the open-source Weaviate vector database. It targets teams building semantic search, RAG, and generative-AI applications who want HNSW vector indexing, native hybrid (vector + BM25 keyword) search, and built-in vectorizer/generative module integrations (OpenAI, Cohere, Hugging Face, etc.) without operating their own Kubernetes clusters. The platform is developer-first: schema-driven collections, a GraphQL/REST/gRPC query surface, and first-class Python and TypeScript clients make it fast to stand up a production semantic-search backend. Weaviate has leaned into agentic and embedding features as well, bundling Weaviate Embeddings and "Agents" into the managed tiers, which differentiates it from pure index-only competitors.

In October 2025 Weaviate overhauled its cloud pricing, retiring the old "Serverless" and "Enterprise" labels in favor of Free, Flex (pay-as-you-go from $45/mo), Premium/Premium Dedicated (prepaid from ~$400/mo), with HIPAA-eligible Enterprise/BYOC options. Pricing is now metered on three dimensions, vector dimensions stored, hot storage, and backup storage, which makes flat-index and compression savings visible on the invoice but also makes budgeting harder for newcomers. SLAs scale with tier: 99.5% on Flex (shared GCP) up to 99.95% on Premium, and the company is SOC 2 Type II audited (via Drata) with HIPAA available on AWS Enterprise Cloud. Published ANN benchmarks are strong and transparent: on SIFT1M Weaviate reports 98.35% recall@10 at ~10,940 QPS with 1.44ms mean latency, and on the 1536-dim DBPedia-OpenAI set 97.24% recall@10 at ~5,639 QPS / 2.80ms mean.

Where Weaviate Cloud wins is breadth (hybrid search, multi-tenancy, modules, agents), excellent docs, and an active community that shortens the learning curve for core use cases. Where it loses is cost-at-scale and pricing legibility: the most consistent criticism across G2 and Reddit is that managed pricing climbs quickly into the thousands per month for tens of millions of high-dimensional vectors, and that metering jargon ("vector dimensions," AIUs) makes cost estimation hard, pushing a meaningful slice of users toward self-hosting the open-source core on rented high-memory hardware. Advanced operations (sharding, schema design, replication tuning) also carry a steeper learning curve than the beginner-friendly happy path suggests. Overall it is a credible, well-documented managed vector DB that is most compelling for mid-sized RAG/search workloads rather than hundreds-of-millions-of-vectors deployments.

How this score is derived

The APIbenchmarks Index is a weighted sum of four dimensions, each scored on an absolute 0–100 reference scale. See the methodology for every mapping.

DimensionScoreWeightContribution
Documentation & DXExtensive, well-structured docs at docs.weaviate.io with quickstarts, per-language client guides, a public ANN benchmark page, and a developer academy, widely praised in G2 reviews for clarity.
84
30%25.2
ReliabilityTiered uptime SLAs from 99.5% (Flex/shared GCP) to 99.95% (Premium), backed by SOC 2 Type II (Drata-audited) and HIPAA availability on AWS Enterprise Cloud.
80
25%20.0
Ecosystem & SDKsOpen-source core with official Python, TypeScript/JS, Go and Java clients, native vectorizer/generative modules (OpenAI, Cohere, Hugging Face), and AWS/GCP marketplace listings plus an active community.
80
25%20.0
AccessibilityFree always-on tier and a 14-day Sandbox lower the barrier to entry, but metered pricing dimensions (vector dimensions, storage, backups) and advanced features like sharding raise the learning curve at scale.
72
20%14.4
APIbenchmarks Index (ABI)79.6

Table 1. Derivation of the ABI for Weaviate Cloud. Contribution = score × weight; the index is their sum.

At a glance

Vendor
Weaviate
Pricing model
Usage-based (dimensions + storage)
Free tier
14-day sandbox only (auto-expires), no permanent free
Official SDKs
7 languages

Pricing

Free / Sandbox$0Always-free tier plus a 14-day Sandbox trial (no credit card); best-effort availability, auto-expires.
Flexfrom $45/moPay-as-you-go on shared GCP; 99.5% uptime SLA; metered from $0.00465/1M vector dimensions, $0.12/GiB storage, $0.0264/GiB backup. Includes RBAC, Embeddings, Agents, automated upgrades.
Premium (Shared)from $400/moPrepaid contract; 99.95% uptime SLA; lower per-unit metering (from $0.003875/1M vector dimensions); enhanced security and stronger support.
Premium Dedicatedfrom $400/moDedicated infrastructure, 99.95% SLA, business-critical support, lowest per-unit vector pricing (from $0.002718/1M dimensions).
Enterprise / BYOCCustomBring-Your-Own-Cloud and Enterprise Cloud on AWS; HIPAA-eligible for regulated healthcare workloads.

Key features

  • HNSW approximate-nearest-neighbor vector indexing with flat-index and compression (PQ/BQ/SQ) options
  • Native hybrid search combining dense vectors and BM25 keyword scoring
  • Built-in vectorizer modules (text2vec-openai, cohere, huggingface, etc.) and generative/RAG modules
  • Weaviate Embeddings and Agents bundled into managed tiers
  • Multi-tenancy for isolated per-tenant collections
  • GraphQL, REST and gRPC query interfaces
  • Role-based access control (RBAC)
  • Automated backups and managed version upgrades
  • Replication and sharding for horizontal scale
  • SOC 2 Type II compliance and HIPAA-eligible Enterprise/BYOC deployment

Official SDKs

Python (v4 client)JavaScript / TypeScript (v3 client)GoJava (v6 client)GraphQL APIREST APIgRPC API

Strengths & trade-offs

Strengths
  • +Native hybrid search (HNSW vector + BM25 keyword) with a single query, a standout vs index-only competitors
  • +Strong, transparent published ANN benchmarks (e.g. 98.35% recall@10 at ~10,940 QPS on SIFT1M)
  • +Built-in vectorizer and generative modules (OpenAI, Cohere, Hugging Face) plus bundled Weaviate Embeddings and Agents
  • +Open-source core means a credible self-host escape hatch and no hard vendor lock-in
  • +Clear documentation and an active, responsive community that shortens the learning curve for core use cases
  • +Free always-on tier plus 14-day Sandbox make it easy to start
Trade-offs
  • Managed cost scales steeply, tens of millions of high-dimensional vectors can run into thousands of dollars per month
  • Pricing model is hard to estimate; metering jargon (vector dimensions, AIUs) makes budgeting difficult, a frequent Reddit/G2 complaint
  • Advanced operations (sharding, schema design, replication tuning) have a steep learning curve for beginners
  • October 2025 repricing raised the paid entry point (old $25 Serverless to $45 Flex) and renamed tiers, causing churn in published guidance
  • Best uptime SLA caps at 99.95%, lower than some enterprise database competitors
  • Lower-tier Flex runs on shared GCP infrastructure with only a 99.5% SLA

What developers say

G2 4.6/5 · 29 reviews

Developers praise Weaviate's hybrid search, clean Python/REST APIs, and documentation, but consistently flag that managed cloud cost and pricing legibility get painful at scale.

Weaviate makes it incredibly easy to implement semantic search and generative AI applications, with smooth integration with Python and REST APIs and powerful support for hybrid search (vector + keyword).

Key figures

Recall@10 / QPS / mean latency (SIFT1M, 128-dim)98.35% recall, 10,940 QPS, 1.44ms mean, 3.13ms p99Weaviate official ANN benchmark
Recall@10 / QPS / mean latency (DBPedia-OpenAI, 1536-dim)97.24% recall, 5,639 QPS, 2.80ms mean, 4.43ms p99Weaviate official ANN benchmark
Recall@10 / QPS / mean latency (MSMARCO Snowflake, 8.8M, 768-dim)97.36% recall, 7,363 QPS, 2.15ms mean, 3.69ms p99Weaviate official ANN benchmark
Recall@10 / QPS / mean latency (Sphere DPR, 10M, 768-dim)96.06% recall, 3,523 QPS, 4.49ms mean, 7.73ms p99Weaviate official ANN benchmark
Uptime SLA (Flex / Premium)99.5% (Flex, shared GCP) to 99.95% (Premium)Weaviate pricing page
Flex price (vector dimensions / storage / backup)$0.00465/1M dims, $0.12/GiB storage, $0.0264/GiB backup; $45/mo minimumWeaviate pricing page

Compare Weaviate Cloud head to head

Sources

  1. https://weaviate.io/pricing
  2. https://weaviate.io/blog/weaviate-cloud-pricing-update
  3. https://docs.weaviate.io/weaviate/benchmarks/ann
  4. https://docs.weaviate.io/weaviate/client-libraries
  5. https://www.g2.com/products/weaviate/reviews
  6. https://www.eesel.ai/blog/weaviate-pricing
  7. https://aws.amazon.com/marketplace/pp/prodview-27nbweprm7hha

Figures last verified 2026-06-27. Spotted an error? corrections@apibenchmarks.com