Weaviate Cloud
Weaviate · Ranked #6 of 7 in Vector Database APIs
Feature-rich vector DB with hybrid search and agentic query tooling, but only a 14-day free sandbox.
Hybrid search + AI-native query agents

Overview
Weaviate Cloud is the managed, fully-hosted offering of the open-source Weaviate vector database. It targets teams building semantic search, RAG, and generative-AI applications who want HNSW vector indexing, native hybrid (vector + BM25 keyword) search, and built-in vectorizer/generative module integrations (OpenAI, Cohere, Hugging Face, etc.) without operating their own Kubernetes clusters. The platform is developer-first: schema-driven collections, a GraphQL/REST/gRPC query surface, and first-class Python and TypeScript clients make it fast to stand up a production semantic-search backend. Weaviate has leaned into agentic and embedding features as well, bundling Weaviate Embeddings and "Agents" into the managed tiers, which differentiates it from pure index-only competitors.
In October 2025 Weaviate overhauled its cloud pricing, retiring the old "Serverless" and "Enterprise" labels in favor of Free, Flex (pay-as-you-go from $45/mo), Premium/Premium Dedicated (prepaid from ~$400/mo), with HIPAA-eligible Enterprise/BYOC options. Pricing is now metered on three dimensions, vector dimensions stored, hot storage, and backup storage, which makes flat-index and compression savings visible on the invoice but also makes budgeting harder for newcomers. SLAs scale with tier: 99.5% on Flex (shared GCP) up to 99.95% on Premium, and the company is SOC 2 Type II audited (via Drata) with HIPAA available on AWS Enterprise Cloud. Published ANN benchmarks are strong and transparent: on SIFT1M Weaviate reports 98.35% recall@10 at ~10,940 QPS with 1.44ms mean latency, and on the 1536-dim DBPedia-OpenAI set 97.24% recall@10 at ~5,639 QPS / 2.80ms mean.
Where Weaviate Cloud wins is breadth (hybrid search, multi-tenancy, modules, agents), excellent docs, and an active community that shortens the learning curve for core use cases. Where it loses is cost-at-scale and pricing legibility: the most consistent criticism across G2 and Reddit is that managed pricing climbs quickly into the thousands per month for tens of millions of high-dimensional vectors, and that metering jargon ("vector dimensions," AIUs) makes cost estimation hard, pushing a meaningful slice of users toward self-hosting the open-source core on rented high-memory hardware. Advanced operations (sharding, schema design, replication tuning) also carry a steeper learning curve than the beginner-friendly happy path suggests. Overall it is a credible, well-documented managed vector DB that is most compelling for mid-sized RAG/search workloads rather than hundreds-of-millions-of-vectors deployments.
How this score is derived
The APIbenchmarks Index is a weighted sum of four dimensions, each scored on an absolute 0–100 reference scale. See the methodology for every mapping.
| Dimension | Score | Weight | Contribution |
|---|---|---|---|
| Documentation & DXExtensive, well-structured docs at docs.weaviate.io with quickstarts, per-language client guides, a public ANN benchmark page, and a developer academy, widely praised in G2 reviews for clarity. | 84 | 30% | 25.2 |
| ReliabilityTiered uptime SLAs from 99.5% (Flex/shared GCP) to 99.95% (Premium), backed by SOC 2 Type II (Drata-audited) and HIPAA availability on AWS Enterprise Cloud. | 80 | 25% | 20.0 |
| Ecosystem & SDKsOpen-source core with official Python, TypeScript/JS, Go and Java clients, native vectorizer/generative modules (OpenAI, Cohere, Hugging Face), and AWS/GCP marketplace listings plus an active community. | 80 | 25% | 20.0 |
| AccessibilityFree always-on tier and a 14-day Sandbox lower the barrier to entry, but metered pricing dimensions (vector dimensions, storage, backups) and advanced features like sharding raise the learning curve at scale. | 72 | 20% | 14.4 |
| APIbenchmarks Index (ABI) | 79.6 | ||
Table 1. Derivation of the ABI for Weaviate Cloud. Contribution = score × weight; the index is their sum.
At a glance
- Vendor
- Weaviate
- Pricing model
- Usage-based (dimensions + storage)
- Free tier
- 14-day sandbox only (auto-expires), no permanent free
- Official SDKs
- 7 languages
Pricing
| Free / Sandbox | $0 | Always-free tier plus a 14-day Sandbox trial (no credit card); best-effort availability, auto-expires. |
| Flex | from $45/mo | Pay-as-you-go on shared GCP; 99.5% uptime SLA; metered from $0.00465/1M vector dimensions, $0.12/GiB storage, $0.0264/GiB backup. Includes RBAC, Embeddings, Agents, automated upgrades. |
| Premium (Shared) | from $400/mo | Prepaid contract; 99.95% uptime SLA; lower per-unit metering (from $0.003875/1M vector dimensions); enhanced security and stronger support. |
| Premium Dedicated | from $400/mo | Dedicated infrastructure, 99.95% SLA, business-critical support, lowest per-unit vector pricing (from $0.002718/1M dimensions). |
| Enterprise / BYOC | Custom | Bring-Your-Own-Cloud and Enterprise Cloud on AWS; HIPAA-eligible for regulated healthcare workloads. |
Key features
- •HNSW approximate-nearest-neighbor vector indexing with flat-index and compression (PQ/BQ/SQ) options
- •Native hybrid search combining dense vectors and BM25 keyword scoring
- •Built-in vectorizer modules (text2vec-openai, cohere, huggingface, etc.) and generative/RAG modules
- •Weaviate Embeddings and Agents bundled into managed tiers
- •Multi-tenancy for isolated per-tenant collections
- •GraphQL, REST and gRPC query interfaces
- •Role-based access control (RBAC)
- •Automated backups and managed version upgrades
- •Replication and sharding for horizontal scale
- •SOC 2 Type II compliance and HIPAA-eligible Enterprise/BYOC deployment
Official SDKs
Strengths & trade-offs
- +Native hybrid search (HNSW vector + BM25 keyword) with a single query, a standout vs index-only competitors
- +Strong, transparent published ANN benchmarks (e.g. 98.35% recall@10 at ~10,940 QPS on SIFT1M)
- +Built-in vectorizer and generative modules (OpenAI, Cohere, Hugging Face) plus bundled Weaviate Embeddings and Agents
- +Open-source core means a credible self-host escape hatch and no hard vendor lock-in
- +Clear documentation and an active, responsive community that shortens the learning curve for core use cases
- +Free always-on tier plus 14-day Sandbox make it easy to start
- –Managed cost scales steeply, tens of millions of high-dimensional vectors can run into thousands of dollars per month
- –Pricing model is hard to estimate; metering jargon (vector dimensions, AIUs) makes budgeting difficult, a frequent Reddit/G2 complaint
- –Advanced operations (sharding, schema design, replication tuning) have a steep learning curve for beginners
- –October 2025 repricing raised the paid entry point (old $25 Serverless to $45 Flex) and renamed tiers, causing churn in published guidance
- –Best uptime SLA caps at 99.95%, lower than some enterprise database competitors
- –Lower-tier Flex runs on shared GCP infrastructure with only a 99.5% SLA
What developers say
G2 4.6/5 · 29 reviews
Developers praise Weaviate's hybrid search, clean Python/REST APIs, and documentation, but consistently flag that managed cloud cost and pricing legibility get painful at scale.
“Weaviate makes it incredibly easy to implement semantic search and generative AI applications, with smooth integration with Python and REST APIs and powerful support for hybrid search (vector + keyword).”
Key figures
| Recall@10 / QPS / mean latency (SIFT1M, 128-dim) | 98.35% recall, 10,940 QPS, 1.44ms mean, 3.13ms p99 | Weaviate official ANN benchmark ↗ |
| Recall@10 / QPS / mean latency (DBPedia-OpenAI, 1536-dim) | 97.24% recall, 5,639 QPS, 2.80ms mean, 4.43ms p99 | Weaviate official ANN benchmark ↗ |
| Recall@10 / QPS / mean latency (MSMARCO Snowflake, 8.8M, 768-dim) | 97.36% recall, 7,363 QPS, 2.15ms mean, 3.69ms p99 | Weaviate official ANN benchmark ↗ |
| Recall@10 / QPS / mean latency (Sphere DPR, 10M, 768-dim) | 96.06% recall, 3,523 QPS, 4.49ms mean, 7.73ms p99 | Weaviate official ANN benchmark ↗ |
| Uptime SLA (Flex / Premium) | 99.5% (Flex, shared GCP) to 99.95% (Premium) | Weaviate pricing page ↗ |
| Flex price (vector dimensions / storage / backup) | $0.00465/1M dims, $0.12/GiB storage, $0.0264/GiB backup; $45/mo minimum | Weaviate pricing page ↗ |
Compare Weaviate Cloud head to head
Sources
- https://weaviate.io/pricing
- https://weaviate.io/blog/weaviate-cloud-pricing-update
- https://docs.weaviate.io/weaviate/benchmarks/ann
- https://docs.weaviate.io/weaviate/client-libraries
- https://www.g2.com/products/weaviate/reviews
- https://www.eesel.ai/blog/weaviate-pricing
- https://aws.amazon.com/marketplace/pp/prodview-27nbweprm7hha
Figures last verified 2026-06-27. Spotted an error? corrections@apibenchmarks.com
