Grounded search

Search GenAIWiki

Search across models, tools, comparisons, tutorials, and glossary entries — with sources shown.

GenAIWiki

Grounded AI answer — wiki index sources only

Searching GenAIWiki index…

Full results from the index

All matches for “vector database RAG”, grouped by content type.

Tools

Qdrant

Vector database focused on high-performance similarity search with strong payload filtering, hybrid retrieval features, and both open-source and managed cloud options.

Strong match

Weaviate

Open source vector database with hybrid search, metadata filtering, and flexible deployment options across self-hosted clusters and managed cloud environments.

Strong match

Pinecone

Managed vector database for semantic search and RAG systems with metadata filtering, namespaces, and cloud-hosted reliability for production retrieval workloads.

Chroma

Chroma is an open-source embedding database designed for managing and searching embeddings efficiently. It provides robust performance with sub-100ms latency for retrieval tasks.

Milvus

An open-source vector database designed for high-performance similarity search and analysis of large-scale vector data. It handles millions of vectors efficiently with a query latency of under 100ms for similarity searches.

FAISS

FAISS (Facebook AI Similarity Search) is a library for efficient similarity search and clustering of dense vectors. It allows for millions of items to be searched with latency typically under 100ms for nearest neighbor searches.

LlamaIndex

Data framework for LLM applications focused on ingestion pipelines, indexing, retrieval, and query orchestration over private and enterprise content sources.

Want a cited narrative answer?

Ask GenAIWiki →

Tutorials

Graph RAG for Entity-Heavy Domains

Explore the use of Graph Retrieval-Augmented Generation (RAG) for domains with complex entities, requiring knowledge of graph databases and RAG techniques.

Strong match

Graph RAG for Entity-Heavy Domains: A Practical Guide

This tutorial delves into using Graph RAG (Retrieval-Augmented Generation) techniques for domains rich in entities, such as legal and healthcare sectors. Prerequisites include understanding of RAG and graph database concepts.

Strong match

Pgvector Index Tuning (HNSW vs IVF)

Learn how to tune pgvector indexes using HNSW and IVF algorithms for optimal performance. Prerequisites include familiarity with PostgreSQL and vector databases.

Golden-Set Design for RAG Faithfulness

Understand how to design a golden set for evaluating the faithfulness of Retrieval-Augmented Generation (RAG) models. Prerequisites include familiarity with RAG systems and evaluation metrics.

Optimizing Golden-Set Design for RAG in Healthcare Applications

Glossary

Vector search

Vector search retrieves semantically similar items by comparing embedding vectors rather than relying only on exact keyword overlap.

Strong match

graph-database

A database specifically designed to store and navigate relationships between data points using graph structures.

Strong match

variational-autoencoder

A generative model that learns to represent data in a latent space using variational inference.

support-vector-regression

An extension of support vector machines that predicts continuous values instead of categories.

graph-attention-network

A neural network architecture that employs attention mechanisms to process graph-structured data.

Models

text-embedding-3-large

text-embedding-3-large produces high-dimensional text embeddings for semantic search, clustering, and classification. Teams pair it with pgvector or SaaS vector DBs for RAG; output dimensions can be reduced with tradeoffs described in OpenAI documentation.

Strong match

Stable Diffusion XL

Stable Diffusion XL (SDXL) 1.0 is Stability AI's latent diffusion text-to-image model for native 1024x1024 generation. The base model can run standalone or feed an optional refiner for the final denoising steps, and the published weights support self-hosted Diffusers workflows.

Strong match

Cursor Composer 2.5

Cursor's Composer 2.5 model, documented in the official Cursor changelog as a coding model used in the Auto routing flow.

Snowflake Arctic

Snowflake Arctic is an enterprise-oriented open model emphasizing efficient training recipes and SQL-adjacent enterprise tasks inside the Snowflake ecosystem. It targets teams that want LLM features colocated with governed data in Snowflake Cortex.

DeepSeek-V3

DeepSeek-V3 is a large-scale language model family noted for strong coding and math performance under open or research-friendly terms (verify the exact license for your deployment). Teams adopt it for cost-sensitive research, self-hosted inference, or comparison against frontier APIs.