One Database to Rule Them All: Why PostgreSQL + pgvector Is the Only Backend AI Agents Need

Every AI platform has an opinion about its database. Most have the same opinion: "You need PostgreSQL for your relational data and Pinecone for your vectors." Or Weaviate. Or Qdrant. Or Milvus. Pick your flavor of separate vector database.

AACFlow has a different opinion: you need PostgreSQL. Period.

We run one database: PostgreSQL with the pgvector extension. Relational data (users, workspaces, workflows, executions, triggers, connectors) and vector data (embeddings for RAG, semantic search, agent memory) live in the same system. No separate vector database. No synchronization between two databases. No "which database has the latest data?" questions at 2 AM.

This is not a compromise. It's an architectural choice that has paid off in reliability, simplicity, and performance.

Why One Database?

The standard "PostgreSQL + Pinecone" architecture looks reasonable on a whiteboard. In production, it creates problems:

Two systems to operate. Backups, migrations, monitoring, scaling — all doubled. Two sets of credentials. Two connection pools. Two failure modes. When something breaks at 3 AM, you have to figure out which database broke.

No transactional consistency. You insert a document into PostgreSQL. You embed it and store the vector in Pinecone. The PostgreSQL insert succeeds but the Pinecone insert fails. Now you have a document without a vector. Or you delete a document from PostgreSQL but the Pinecone delete times out. Now you have a ghost vector that returns in search results. Either way, your data is inconsistent.

One Database to Rule Them All: Why PostgreSQL + pgvector Is the Only Backend AI Agents Need

Why One Database?

Related posts

The pgvector Extension: Vectors as First-Class Citizens

HNSW Indexes: Fast Similarity Search at Scale

Hybrid Search: SQL WHERE + Vector Similarity

Real Query Patterns from Production

The Drizzle ORM Layer: Type Safety From Schema to Query

Migration Strategy: How We Got Here

Performance: Handling Millions of Vectors

What We Don't Use (and Why)

The Philosophy: Fewer Moving Parts