OpenAI Scales PostgreSQL for 800 Million ChatGPT Users
OpenAI scales PostgreSQL to support 800 million ChatGPT users, handling millions of queries per second with a single primary and 50 read replicas.

OpenAI Scales PostgreSQL for 800 Million ChatGPT Users
OpenAI has unveiled its engineering accomplishments in scaling PostgreSQL to manage millions of queries per second (QPS), supporting 800 million ChatGPT users. This feat is achieved through a single primary Azure PostgreSQL flexible server instance, backed by nearly 50 read replicas across multiple global regions. The details were shared by engineer Bohan Zhang on OpenAI's official blog, highlighting optimizations in application-layer tweaks, database tuning, caching, rate limiting, and workload isolation. These efforts demonstrate PostgreSQL's capability for massive read-heavy workloads amid a 10x load growth over the past year (OpenAI).
Technical Breakdown: From Surge to Scale
The scaling journey began after ChatGPT's launch in late 2022, when traffic surged unexpectedly. OpenAI responded with rapid optimizations:
- Application Logic Refinement: Reduced inefficient queries.
- PostgreSQL Tuning: Adjusted parameters for better performance.
- Scaling: Vertically scaled instance sizes and horizontally expanded read replicas.
This setup maintains near-zero replication lag and low-latency reads across geo-distributed regions, providing substantial headroom for future growth (OpenAI).
Key Strategies
- Replicas and Replication: A single primary handles writes, with ~50 read replicas distributing read load globally on Azure infrastructure. High availability on the primary side ensures resilience.
- Caching and Rate Limiting: Aggressive caching offloads repeated reads, while rate limiting prevents overload spikes.
- Workload Isolation: Separating read/write traffic and isolating workloads minimizes contention.
Despite successes, challenges emerged, such as Multi-Version Concurrency Control (MVCC) causing write amplification, prompting a shift toward sharded databases for newer workloads (YouTube).
Historical Context: PostgreSQL Evolution
PostgreSQL has been a backbone for OpenAI's core products like ChatGPT and its API. However, the post-ChatGPT boom tested its limits. By 2024-2025, a 10x load growth demanded reinvention. Earlier efforts focused on basic vertical scaling, but 2025 optimizations unlocked "millions of QPS," far beyond conventional wisdom for PostgreSQL in read-heavy scenarios (OpenAI).
Competitor Comparison: PostgreSQL vs. Alternatives
OpenAI's PostgreSQL reliance contrasts with peers using NoSQL or custom solutions:
| Database Approach | Company/Example | Key Strengths | Limitations vs. OpenAI's PostgreSQL |
|---|---|---|---|
| PostgreSQL (Single Primary + Replicas) | OpenAI (ChatGPT) | ACID compliance, SQL familiarity | Write amplification, single-primary risks |
| Cassandra/ScyllaDB | Discord | Horizontal write scaling | Weaker consistency, complex queries |
| CockroachDB | Cockroach Labs | Distributed SQL, auto-sharding | Higher latency/cost at scale |
| Custom Spanner-like | Global consistency | Proprietary, integration hurdles | |
| TiDB | ByteDance | MySQL-compatible HTAP | Less mature for AI vector workloads |
OpenAI's approach excels in read-heavy AI inference, where PostgreSQL's reliability trumps NoSQL's eventual consistency. However, for write-intensive training data, sharding is incoming.
Strategic Timing Amid AI Data Explosion
This disclosure comes as ChatGPT reaches 800 million users—likely weekly actives—and OpenAI eyes profitability post-$157B valuation. It counters narratives of PostgreSQL "failing at scale," reframing it as an architecture lesson: "PostgreSQL didn't fail—my assumptions did" (OpenAI).
Implications for Industry
OpenAI's blueprint validates PostgreSQL for AI at planetary scale, challenging myths of inevitable NoSQL migration. Developers gain a playbook: prioritize read replicas, zero-lag replication, and hybrid sharding. For Azure users, it highlights flexible servers' untapped potential.
Critiques persist: Single-primary fragility demands vigilance, and full sharding may loom for writes. Yet, powering ChatGPT's global dominance proves relational databases endure in AI's data deluge (OpenAI).



