pgEdge Posts from Antony Pegg

Meeting High Availability Requirements in Non-Distributed PostgreSQL Deployments

Fri, 31 Oct 2025 18:25:07 GMT

High availability in PostgreSQL doesn't always require a globally distributed architecture. Sometimes you need reliable failover and replication within a single datacentre or region. pgEdge Enterprise Postgres handles this scenario with a production-ready PostgreSQL distribution that includes the tools you need for high availability out of the box.

What You Get

pgEdge Enterprise Postgres bundles PostgreSQL with components you'd otherwise install and configure separately. The distribution includes pgBouncer for connection pooling, pgBackRest for backup and restore, pgAudit for audit logging, and pgAdmin for database management. You also get PostGIS and pgVector if you need geographic or vector comparison capabilities.High availability support is added in through the addition of open-source PostgreSQL extensions from pgEdge (all available to work with on GitHub), like spock, lolor, and snowflake sequences.The package supports PostgreSQL versions 16, 17, and 18 running on Red Hat Enterprise Linux v9 and v10 (including Rocky, Alma, and Oracle Enterprise) on both x86 and ARM architectures. You can currently deploy it as a VM, and we'll be adding container and managed cloud editions soon (keep an eye on our social channels for updates along the way).

High Availability Through Physical and Logical Replication

pgEdge Enterprise Postgres supports physical replication with automated failover for traditional high availability setups. This works for the standard scenario where you need a primary database with standby replicas ready to take over if the primary fails.For more advanced scenarios, Spock comes bundled out-of-the-box to enable logical multi-master replication. Unlike physical replication that copies entire database clusters at the block level, Spock uses PostgreSQL's logical decoding to replicate individual table changes between nodes. This enables active-active deployments where multiple nodes can accept writes simultaneously.

Spock: Multi-Master Replication for High Availability

Spock provides multi-master replication for PostgreSQL 15 and later. The extension comes pre-integrated with pgEdge Enterprise Postgres, built against a patched 100% standard PostgreSQL installation that integrates necessary hooks directly into the database engine.

Core Capabilities

Spock replicates data between nodes using logical decoding. You can configure provider and subscriber nodes, add tables to replication sets, and create subscriptions that keep data synchronised across multiple PostgreSQL instances. The extension supports replicating tables from the same schemas across nodes.The extension tracks commit timestamps for conflict resolution. When multiple nodes update the same row simultaneously, Spock uses these timestamps to determine which change takes precedence. This is important for high availability because you can continue processing transactions on any available node without waiting for failover procedures.

Beyond Table Data

Spock integrates with additional extensions included in pgEdge Enterprise Postgres:LOLOR (Large Object Logical Replication) extends Spock to replicate PostgreSQL large objects. Standard logical replication doesn't handle large objects, but LOLOR adds this capability so you can replicate binary data stored using PostgreSQL's large object facility.Snowflake Sequences handles sequence replication across nodes. In a multi-master setup, you need sequences that won't generate conflicting values across different nodes. This extension provides sequences that work correctly in distributed scenarios where multiple nodes generate IDs simultaneously.

Schema Requirements

Spock requires tables to have identical structures across all nodes. Tables must have the same names, schemas, column definitions, and primary keys. Check constraints and not null constraints need to be the same or more permissive on the subscriber than on the provider.This strictness exists because logical replication applies changes based on row identity through the primary key. The extension needs matching schemas to reliably identify and update rows across nodes.

High Availability Without Downtime

With Spock configured across multiple nodes, you can route traffic to any node in your cluster. If one node fails, your application continues writing to the remaining nodes without downtime. You don't wait for a failover election or for a standby to get promoted to primary.This differs from physical replication where only the primary accepts writes. With Spock, every node can accept writes, giving you true active-active capability within your datacentre or region.

Connection Pooling and Workload Management

pgBouncer handles connection pooling to manage database connections efficiently. This is important for high availability because connection storms during failover events can overwhelm a database. With connection pooling in place, you can limit and manage connections to prevent resource exhaustion when traffic shifts between nodes.The bundled pgBouncer configuration works with the rest of the stack, so you don't need to figure out how to integrate a separate connection pooler with your replication setup.

Backup and Recovery

pgBackRest provides advanced backup and restore capabilities. In a high availability setup, you need reliable backups that work with your replication architecture. pgBackRest handles full and incremental backups, parallel backup and restore operations, and can work across multiple repositories.Having integrated backup tooling means your backup strategy accounts for your replication setup from the start, rather than bolting on backup solutions that might not understand your multi-node configuration.

The Open Source Approach

pgEdge Enterprise Postgres runs on 100% standard PostgreSQL (bar some small patches which do not affect compatibility) with no proprietary forks. The distribution is fully open source, licenced under the permissive PostgreSQL Licence. You can use any extensions or tools from the PostgreSQL ecosystem that you'd like.Need support for PostgreSQL and the included tools in your deployments? pgEdge offers 24x7x365 support subscriptions with access to PostgreSQL experts. Forward Deployed Engineer services are available for organisations that need dedicated assistance with architecture reviews, performance tuning, and ongoing guidance.

When to Use pgEdge Enterprise Postgres

Consider using pgEdge Enterprise Postgres when you need high availability within a single geographic region or datacentre, when you want to avoid managing multiple PostgreSQL installations and extensions separately, or when you need the option to scale to multi-master replication without changing your entire infrastructure.If you value having a tested, integrated stack over assembling individual components yourself, this package is designed for you. The logical replication capabilities through Spock add flexibility that physical replication can't provide, particularly for scenarios where you need multiple writable nodes or want to minimise downtime during maintenance windows.Read more about how to get started in the official pgEdge Enterprise Postgres docs , or check out the pgEdge GitHub page to browse our code repositories. Have any questions along the way? You can join our official Discord community channel - we're here to help!

When Failure Isn't an Option: Choosing Postgres for Critical Operations

Mon, 22 Sep 2025 04:44:50 GMT

just about any use case Using Postgres means you get total control over your data and how it’s managed; it’s the ultimate lens for understanding your infrastructure, reducing costs, and optimizing your workload for performance, high availability, and resiliency.With decades of development backing the project and a global community contributing from every industry and background, Postgres has become a solid choice as a data management solution for mission critical applications across any kind of workload, including geospatial, vector, time-series, IoT, OLTP, and OLAP.Postgres adoption is growing rapidly, and businesses that end up using it often have an “a-ha!” moment that leads them to understand that Postgres really does work for a diverse array of use cases.However, as enterprise demand for scalability and flexibility grows, it’s still common to see concerns arise, like:

Can Postgres Handle Our Uptime Requirements?

A survey published July 10, 2025 from Foundry focused on the evaluation of "PostgreSQL Usage in Mission-Critical Operations: From High Availability to Cloud Outages". 212 IT professionals working at companies (with 500+ employees) using Postgres were surveyed on their use of Postgres in development and production deployments.The conclusion: Postgres can be adapted to handle workloads that are in the terabytes in real-time, with fault tolerance, consistency, and availability. So much so, that in the survey it was found that the vast majority (62%) of organizations using Postgres have a hard requirement that there can be no more than four minutes of downtime a month (99.99%). 24% actually required that there be less than 30 seconds a month of downtime (99.999%).These results show Postgres is trusted by companies with extremely high availability standards to handle their data, across industries like FBSI, Software & Computing, and Manufacturing.Beyond the survey, you'll find Postgres powering well-known emerging platforms like Mastodon, established services like Groupon and Trivago, financial services companies like Revolut, and countless government institutions and international banks. Even the internet-based grocery delivery service Instacart recently announced they chose to switch from Elasticsearch to PostgreSQL and saw “nearly 80% savings on storage and indexing costs, reduced dead-end searches, and [overall improved] customer experience.”The common thread? These organizations chose Postgres not because it was free, but because it delivered the reliability, performance, and scalability their business demanded.

Choosing a Postgres HA Solution

Postgres is 100% free-and-open-source (under its own licensing). As a result, the Postgres ecosystem is vast - you should leverage it to make the most of the power of Postgres. Many distributed Postgres extensions are out there, but only a few are in complete alignment with the open-source core of Postgres, ensuring you’re not subject to unexpected limitations. (A great resource for comparing some of the options is PGScorecard.)The Foundry survey shows organizations are employing many different solutions for database failover and redundancy management. Of the solutions available:

41% are Built-in cloud provider solutions.

33% are commercial high availability products.

29% are open-source (including Patroni, CloudNativePG, repmgr, and pg_autofailover) or are custom-built.

But as with anything, it’s important to compare solutions and know the drawbacks.

Cloud Failures Do Happen

41% of solutions for handling Postgres failover and redundancy management are built into cloud provider solutions; it is worth noting that 21% of survey respondents directly experienced cloud region failures in the past 12 months that exceeded downtime goals. This problem is specific to the cloud and the nature of how it operates; if uptime is a hard requirement for your organization, you should consider the implementation of solutions such as multi-cloud or multi-region deployments.Among organizations with built-in cloud solutions, the list of cloud providers is topped by AWS.

AWS RDS: 55%

AWS cross-region backups: 55%

AWS Aurora Global Database: 45%

Azure Cosmos DB: 29%

Google Cloud SQL: 24%

Other cloud provider technologies: 12%

Postgres has the Ecosystem Advantage

Postgres extensibility is what separates it from other commercial options on the market. Extensibility means you're not locked into a single vendor's vision of your database requirements.

Need to add time-series capabilities? TimescaleDB extends Postgres without breaking compatibility.

Want full-text search? Postgres' built-in features rival dedicated search engines.

Interested in vector similarity search? pgvector (and many others, including pgvectorscale and pgai) handles AI workloads at scale.

But here's the critical part: stick with extensions that maintain Postgres compatibility. Avoid solutions that require proprietary SQL syntax or lock you into specific deployment patterns. The power of Postgres is that it remains Postgres, regardless of how you extend it.For distributed deployments and high availability scenarios, solutions like pgEdge provide multi-master replication while maintaining full Postgres compatibility. You get the benefits of a distributed system without the vendor lock-in or learning curve of platform-specific alternatives.

Enterprise-Grade Availability

If you have an enterprise or project with a use case for 99.99% of high availability and above, Postgres can deliver on your requirements. pgEdge Enterprise Postgres is a great example of a robust Postgres ecosystem that delivers low latency, ultra-high availability, redundancy, and reliability. When you consider a solution like pgEdge, you'll find you can run fully open, fully distributed Postgres with advantages over other database alternatives, such as Oracle's GoldenGate, CockroachDB, or AWS RDS. pgEdge Enterprise Postgres comes without vendor lock-in or the steep learning curves associated with platform-specific SQL syntax, or other obstacles to seamless integration.

More Than Just "Good Enough"

Postgres is no longer just "good enough"; it does so much more, and the thriving community behind the project ensures that it will continue to adjust to handle modern day use cases for years to come. Have specific requests or things you'd like to see implemented in the project? Consider contributing time and/or code to the Postgres project and be the change you want to see.The question isn't whether Postgres can handle your critical workloads; companies across every industry have already proven it can. The question is whether you're ready to make the move.When failure truly isn't an option, Postgres delivers. The technology is robust, the community is strong, the ecosystem is growing, and the operational costs are predictable. What's taking you so long to make the switch and join us?

Scaling Without Stopping: Inside pgEdge Distributed Postgres Zero-Downtime and Exception-Resilient Replication

Thu, 21 Aug 2025 06:16:00 GMT

pgEdge Distributed Postgres v25 introduces several new features that make managing and scaling distributed PostgreSQL clusters dramatically easier and more resilient. Among these, zero-downtime node addition and a new Apply-Replay mechanism for replication exception handling stand out for their ability to improve operational efficiency and system stability, allowing teams to scale and more effectively handle a wide range of runtime scenarios with minimal disruption.

What Is the Spock Extension?

Spock is pgEdge’s advanced logical replication extension for PostgreSQL. It powers active-active, multi-master clusters with support for row filtering, column projection, conflict handling, and more. Spock is a core component underpinning both the self-hosted pgEdge Distributed postreSQL: VM Edition and the managed pgEdge Distributed postreSQL: Cloud Edition from pgEdge.While Spock is descended from earlier projects like pgLogical and BDR 1, it has evolved far beyond its roots. Backed by a dedicated team of PostgreSQL experts, Spock has undergone continuous innovation and improvement to become a high-performance, enterprise-grade replication system built for distributed environments.

Feature Spotlight: Zero-Downtime Node Addition

Adding a new node to a live cluster has historically meant a tradeoff between downtime and complexity. In Spock 5.0, zero-downtime node addition eliminates this tradeoff entirely.This feature allows you to add a new PostgreSQL node to an existing Spock cluster without requiring any downtime on the origin or existing subscriber nodes. The feature works by creating a temporary replication slot and subscription, and allowing the new node to clone the origin’s state in parallel. Once synchronization is complete, the temporary slot is retired, and the new node is promoted to a fully active peer.

Benefits of Zero-Downtime Node Addition:

No service interruption or replication pause

Safe scaling of production clusters

Minimal manual intervention

Works with the standard Spock CLI or via scripted workflows

This workflow is based on a coordinated process that ensures the new node joins the cluster cleanly and consistently:

This approach ensures a seamless and accurate integration of the new node without interrupting activity in the existing cluster.This same workflow can be used to perform seamless, in-place major PostgreSQL version upgrades across your entire cluster. By introducing a new node running the desired higher version of PostgreSQL, and following the coordinated steps for data synchronization and slot management, you can bring an updated node into the cluster without interrupting read or write traffic. Once the new node is in place, older-version nodes can be removed or replaced one at a time, performing a rolling upgrade with zero downtime. This approach provides a safe, flexible method for upgrading infrastructure while maintaining full application availability.For users ready to implement zero-downtime node addition in their own environment, the Spock documentation offers a step-by-step guide to the full process.To further support implementation, the Spock GitHub repository provides several working examples:

A Python-based orchestration script, designed to run outside the database and coordinate the node addition via external automation tools.

A stored procedure version that performs the entire process within PostgreSQL itself using the dblink extension, offering a fully internal option ideal for controlled or restricted environments.

You can explore these examples in the samples/Z0DAN directory .For more complex or scripted rollouts, pgEdge also provides spockctrl, a lightweight command-line orchestrator written in C. Spockctrl accepts a structured JSON plan that defines the sequence of operations to perform, including a sample JSON file demonstrating how to add a new node. Both the tool and sample configurations can be found in the spockctrl directory .

Ensuring Seamless Node Addition with LSN Checkpointing

pgEdge’s Spock 5.0 extension includes functions that make seamless node addition with zero downtime possible. LSN checkpointing (using the spock.sync_event() and spock.wait_for_sync_event() functions) allows you to create a logical checkpoint in the WAL stream on the source node, and then monitor another node for the arrival of that checkpoint's LSN to ensure that all transactions have completed. When adding a node, you can use this to guarantee that schema or data changes have been fully replicated to your source node before you continue.Used in the context of zero-downtime node addition, these functions are critical. When you have confirmed that any in-flight transactions (from all nodes) have arrived on the designated source node, you can initiate a data copy to the new node. Without this precise synchronization, adding a node without interrupting cluster usage would not be possible; Spock's checkpointing guarantees safety and consistency as part of the overall orchestration strategy.

Feature Spotlight: Apply-Replay for Exception Handling

Spock has always been more resilient than earlier logical replication solutions. Neither pgLogical nor BDR 1 included any automatic error recovery. Spock, by contrast, introduced automated exception handling early in its lifecycle.

Exception Handling in Early Spock Versions

nitially, when a data conflict occurred during replication that couldn't be automatically resolved, the apply worker would restart itself, re-request the transaction from the origin, and continue from there. This was already a step above legacy approaches. Spock also allows you to configure how these exceptions are handled: you can pause replication and discard the problematic transaction, or Spock can step through each sub-transaction to isolate only the part of the transaction that caused the exception.Although better than the other open source solutions, this exception handling process came at a cost:

The apply worker had to terminate and restart

The transaction had to be re-fetched over the network

XID resources were consumed during replay

The overall process
could introduce some
replication lag and resource overhead

Taking a New Approach in Spock 5.0

With Spock 5.0, this process has been vastly improved via a new “Apply-Replay” mechanism.Now, when a replicated transaction encounters an exception, the apply worker does not terminate. Instead, Spock buffers transactions in memory up to a default of 4MB (configurable via a new GUC: spock.exception_replay_queue_size). If an exception occurs, the apply worker enters exception-handling mode and simply replays the buffered transaction from memory.

Benefits of Apply-Replay:

No worker restart required

No need to re-fetch from origin

Dramatic reduction in lag during exception handling

Reduced XID usage and less WAL churn

This enhancement significantly improves replication stability, especially in high-throughput environments or under intermittent network conditions where transient data conflicts could previously cause substantial delays.In a synthetic benchmark test involving 10,000 intentionally triggered conflicts that would cause an exception, the Apply-Replay feature resolved all exceptions in just 3 to 4 seconds—compared to over 5 minutes using the previous approach. This represents a dramatic leap in both speed and efficiency for exception handling in distributed PostgreSQL clusters.Large transactions that exceed the memory size will still use the old process, but we are currently at work making those transactions that exceed the allotted memory to be written to, and replayed, from disk, allowing us to completely retire the previous approach.

Why These Features Matter

Both zero-downtime node addition and Apply-Replay reflect a core philosophy behind Spock 5.0: to eliminate avoidable disruption and make high-availability PostgreSQL truly hands-off at scale. These improvements:

Reduce operational complexity

Lower the risk of human error

Improve cluster elasticity and resilience

Enable real-time distributed applications to stay online and in sync

Truly allow full zero-downtime for reads and writes across all nodes in a cluster while expanding or upgrading.

Final Thoughts

The pgEdge Spock 5.0 extension isn’t just a version bump—it is a major change to how robust, fast, and easy logical replication can be. Whether you’re managing global clusters or network edge deployments, the new features highlighted here will help your team scale smarter and operate more effectively. And these are just part of a broader set of enhancements: Spock 5.0 also includes improved automatic conflict resolution that can now handle more conflict scenarios without user intervention, along with other performance and usability upgrades that make it the most capable version yet.Spock 5.0 is available now as a fully integrated component of both the self-hosted pgEdge Distributed Postgres: VM Edition 25.2 and the managed SaaS pgEdge Distributed Postgres: Cloud Edition offering. Whether you’re running your infrastructure on-premises or in the cloud, you can take advantage of the powerful features described in this post, plus many others, to build fast, resilient, globally distributed PostgreSQL applications.To learn more about pgEdge’s distributed multi-master replication technology, visit the pgEdge website or explore the pgEdge documentation .

pgEdge Distributed PostgreSQL Now Available on Akamai Cloud

Wed, 18 Jun 2025 13:47:47 GMT

Today your applications face unprecedented demands: they must be always-on, globally responsive, and capable of serving users anywhere with the option of meeting complex data residency requirements. For web and AI applications that rely on PostgreSQL, we're excited to announce that the pgEdge Distributed PostgreSQL platform is now available on Akamai Cloud (formerly Linode), a cloud vendor that "brings core cloud computing and edge computing together, along with industry-leading security — all on the most distributed network on the planet." Together, pgEdge and Akamai Cloud create a powerful solution, bringing performance and availability for database infrastructure to the network edge.

Providing Consistency to Akamai Cloud with pgEdge

While Akamai has long made it easy for developers to place their applications close to users through their extensive global infrastructure (with over 4,100 points of presence), many databases have remained centralized, creating latency bottlenecks and single points of failure. Now, with pgEdge running on Akamai Cloud, you can deploy distributed active-active multi-master PostgreSQL databases at or near the edge to ensure your applications deliver consistently fast performance regardless of where your users are located.

How pgEdge Makes a Difference

pgEdge is a fully distributed PostgreSQL database optimized for high availability and low latency. As a true multi-master (active-active) distributed database, pgEdge facilitates read and/or write operations on every node in a cluster, providing several key advantages:100% Standard PostgreSQL: pgEdge maintains full compatibility with PostgreSQL, including complete language support, triggers, stored procedures, all data types, functions, operators, and the full SQL syntax. This means you can leverage existing PostgreSQL expertise and tools without modification.100% Open (Source Available): Our source code is completely open and available for review, ensuring transparency and security for deployments everywhere.Write Anywhere Architecture: Unlike traditional read-replica setups, pgEdge's multi-master architecture allows you to write to any node in your cluster. Our logical replication system keeps nodes synchronized while providing automatic conflict resolution and load distribution.Fault Tolerance: If one data center experiences an outage—whether from infrastructure failure or something as simple as a severed fiber optic cable—when applications are configured for zero-downtime operations, traffic can be automatically rerouted to another database node for optimal resilience.Interested in learning more about pgEdge on your own? Don’t forget, you can:

Real-World Impact

Customers, ranging from startups to Fortune 500 companies and governmental organizations like the European Parliament, leverage pgEdge's flexibility to address diverse use cases such as:

AI at the Edge

By distributing your PostgreSQL databases globally, you can:

Share state and context across geographic locations

Maintain data locality for compliance requirements

Reduce response times for AI inference requests

Scale AI workloads efficiently across regions

One of the most compelling use cases for pgEdge on Akamai Cloud is enabling AI at the edge. This distributed data layer opens up additional flexibility for handling complex AI use cases in a performant manner while maintaining application availability.

pgEdge Deployment Options for Akamai Cloud

pgEdge is certified to run on Akamai Cloud with flexible self-hosted deployment options:

Looking to the Future

The combination of pgEdge's distributed PostgreSQL capabilities and Akamai Cloud's global infrastructure represents a fundamental shift in how we think about database architecture. Instead of accepting the latency and availability trade-offs of a centralized database, you can deliver consistently fast, reliable data access to users anywhere in the world, self-hosting your instances with vendors like Akamai Cloud or receiving fully managed services with pgEdge Cloud.Whether you're building the next generation of web applications, deploying AI workloads at scale, or modernizing existing systems for global reach, pgEdge on Akamai Cloud provides the foundation you need to succeed.Ready to get started? Head to the Platform Download for Akamai page to start deploying distributed PostgreSQL on Akamai Cloud, contact our team to discuss your specific use case and requirements, or schedule a live demo to see how it all works. Stay tuned for our upcoming tutorial on setting up pgEdge specifically with Akamai Cloud, coming soon!

Unlocking The Power of Multi-Master: 7 Migration Design Considerations

Mon, 14 Apr 2025 06:25:09 GMT

From Sundials to Chronometers: The Shift to Multi-Master

Making the move from a traditional single-master database to a multi-master system is like trading in your sundial for a marine chronometer. A sundial is simple, reliable… and completely dependent on a single source of truth (the sun overhead). It works, but only if conditions are perfect and you're standing in the same place. A chronometer, on the other hand, lets you navigate the open seas, across longitudes, giving you freedom you never had before, but it demands precision, discipline, and an entirely new way of thinking about time.The same is true when moving from a single-master (single-write) database to a multi-master (also known as active-active) database. You gain the benefits of global availability, reduced latency, and higher resilience, but you are also now dealing with changes happening simultaneously across space ( on distributed nodes). To take complete advantage of a multi-master replication cluster, you may want to make some fundamental changes to your original underlying schema design, application distribution, and possibly data residency.Switching to a multi-master architecture isn’t just about changing how you replicate data, it’s about unlocking a fundamentally new capability for your applications. This blog outlines the limitations of a centralized system and how to move to a globally distributed, highly available writes and reads system.

Multi-Master vs Single-Master: The Big Picture

In a single-master database, there is only ever one source of truth at any given moment. Every , , or happens in a predictable, linear timeline. With multi-master replication, you are working with multiple timelines converging, synchronizing, and sometimes even colliding.This post covers a few of the most immediate and major considerations to help get you started.

Why Every Table Needs a Primary Key In Multi-Master Replication

Primary Keys aren’t just best practice, they are essential. In a multi-master system, replication depends on knowing exactly which row is which, even when written to different nodes at the same time. Without a primary key, your replication system has no guaranteed way of identifying and synchronizing rows correctly.Every replicated table should have a unique primary key. If you're missing them, you’re going to want to add them. BUT! Read on before you do.

Uniqueness Across Space: Generating Unique Primary Key IDs Safely

In a single-master system, simple auto-incremented sequences or UUIDs often suffice. On a multi-master cluster, they can become dangerous. Imagine two nodes both inserting new rows at exactly the same time — using the same sequence starting at 1. You'll end up with duplicate primary keys and immediate replication conflicts.Here’s a hypothetical scenario:A replicated table of “customers” currently has 200 rows in it, with primary key IDs from 1 - 200.

At 12:00 PM, Node A in Germany creates a new customer row locally, with the next auto-assigned ID of 201, with a customer name of “John Smith”

Simultaneously, Node B in California also creates a new row locally, also with the ID of 201, for a customer with the name of “Jane Doe”

Both new rows are then replicated to each other. Each node receives the other row, but they both have the same primary key of 201. So, is 201 meant to be John Smith, or Jane Doe?The solution is to use an ID generation strategy designed for distributed systems. There are a few ways this could be approached, such as using node-specific ID ranges or UUIDs (but be cautious of index bloat). For pgEdge, we recommend using Snowflake sequences. A Snowflake sequence is a globally unique, time-ordered ID that is unique across nodes. You can read more about them in the pgEdge documentation.Snowflake sequences are composite values that let you:

add or modify data in different regions while ensuring a unique transaction sequence.

preserve unique transaction identifiers without manual/administrative management of a numbering scheme.

accurately identify the order in which globally distributed transactions are performed.

pgEdge provides functions to conver t your PostgreSQL sequence key field to a Snowflake sequence field. After converting a table to use a Snowflake sequence, old keys remain in the original format, but new keys are a unique composite value that contains information about the row. For example, in the following query, you can see a mix of PostgreSQL-style sequences and Snowflake sequences:In the first column, you can see that the id assigned to each new row changes from a simple value to a more complex Snowflake sequence after the first seven rows - that change indicates the point at which the table was converted to use a Snowflake sequence for its primary key.Since a Snowflake sequence is a composite value, it provides a bonus; you can use a Snowflake function to extrapolate information from each unique id; for example:

Multi-Master Conflict Management

What are Conflicts and Why Do They Happen?

In a multi-master system, conflicts arise when two nodes make concurrent changes to the same data before they have a chance to synchronize. Examples include:

In a single-master system, like a single sundial, the linear timeline is enforced by WRITE transactions happening on a single node in the order in which they occur - conflicts can't happen. In a multi-master system however, like the ships roaming the seas with chronometers that must stay in sync with Greenwich Mean Time, there are now multiple, simultaneous writers that are geographically distributed. This means there is replication lag that must be dealt with.Timeline:Without timestamps: Write X might incorrectly overwrite Write Y.With timestamps: Write Y wins, as it happened later.

Approaches to Conflict Management

Resolution: Handling Conflicts After They Happen

The most common approach is to detect and resolve conflicts when synchronizing, through the use of accurate timestamps

Last-Update-Wins: The change with the latest timestamp wins.

Insert-Insert conflict resolution: When two inserts collide, the one with the latest timetamp is converted into a full-row update, ensuring no data is lost.

This makes accurate clocks critical. pgEdge ensures this by maintaining a monotonically increasing logical clock on each node, preventing clock drift from causing inconsistent conflict resolution.

Avoidance: Designing to Prevent Conflicts

A more elegant approach is, wherever possible, to avoid conflicts entirely through smart data modeling. Two of the main approaches here are:

CRDTs (Conflict-Free Replicated Data Types): Data structures designed to automatically merge without conflict.

Immutable Data Patterns: Prefer insert-only or append-only models where possible.

Summed-Value Fields Need Special Handling - The Delta_Apply CRDT

In pgEdge, one of the most practical tools for multi-master conflict avoidance is the mechanism. Instead of sending the final value, pgEdge can replicate the change itself (the delta), allowing each node to apply the adjustment rather than overwrite the value.Think about values that are naturally summed over time:

Bank account balances

Inventory quantities

Game scores

If Node A and Node B both adjust the same field concurrently, which value should win? If you rely on simple "last write wins" logic, you'll lose data.By replicating deltas instead of full values for numeric fields, you remove the possibility of overwriting concurrent increments or decrements. This is especially important for fields like balances, counters, and inventory levels.This ensures that all concurrent adjustments are merged correctly, rather than just overwritten. This one function can handle all numeric column types, without requiring any additional schema changes.

Why delta_apply Matters: A Concrete Example

Imagine you are managing a simple bank account balance replicated across two nodes (Node A and Node B). The account starts with a balance of $100.

Without delta_apply:

Each node independently updates its local copy of the balance based on concurrent transactions.In a multi-master system, as replication occurs each node will try to overwrite the other node’s balance with its own final value:

Node A will send: balance = $70

Node B will send: balance = $150

Depending on conflict resolution (typically Last-Update-Wins), you might end up with either $70 or $150, but the correct value should have been: $100 - $30 + $50 = $120

With delta_apply enabled:

When the system replicates the transactions, it no longer tries to send full values. Instead, it sends deltas:

Node A sends: delta = -30

Node B sends: delta = +50

The receiving nodes will now apply both changes, no matter the order:$100 - $30 + $50 = $120This is why is essential for fields that store a summed value. It preserves correctness even when updates happen concurrently across nodes.

Rethinking Backup & Restore in a Multi-Master World

In a single-master (or primary-replica) database, backup and restore are relatively straightforward:

You take a snapshot (base backup + WAL logs) from the primary node.

You can restore this snapshot to a new replica or a replacement primary.

But in a multi-master system, things are more complex, because:

There is no single "source of truth" — all nodes are simultaneously authoritative.

The replication state (including sequence numbers, logical replication positions, and timestamps) is part of the system’s integrity.

Restoring from an old backup can cause immediate replication conflicts or inconsistencies if not done carefully.

Why Standard Backups Can Go Wrong

Imagine restoring a node from a snapshot that is 1 hour old:

The node will start with stale data and outdated replication state.

Upon reconnecting, it may replicate old changes as if they were new, or it may incorrectly attempt to overwrite more recent updates from other nodes.

Worse, it can trigger primary key conflicts or timestamp regressions that violate system integrity.

Principles of Multi-Master Backup & Restore

Coordinated Backups

If you're using multi-master replication, your backups should be taken from all nodes (or at least a designated consistent set) at the same logical point in time, not just one node. This ensures you can rebuild the whole cluster without conflicting histories.

Consistent Restore

You cannot restore just one node in isolation and let it rejoin an existing cluster unless you are certain that:

the backup is recent enough to be safely replayed.

the logical replication state is reconciled with the other nodes.

Node Replacement Strategy

If you lose a single node, it’s generally safer to:

Remove the failed node from the cluster.

Deploy a fresh node from a recent backup.

Let the new node perform a sync from a healthy peer to catch up.

This avoids introducing stale data back into the cluster.

Rethinking Application Connectivity for Multi-Master

When using a traditional single-master replicated database, application connection patterns typically look like this:

In this model, developers often hard-code or configure their applications to direct all write traffic to a specific host (the master) and distribute read-only traffic across replicas.

What changes in Multi-Master?

In a multi-master system like pgEdge:

Every node is writable, not just readable.

Each node participates equally in accepting writes and replicating them globally.

This means you can now:

Write locally, significantly reducing round-trip latency.

Still read locally, as before.

Why This Matters

If you continue using a single-master connection strategy:

You may still be sending writes across the globe unnecessarily.

You might not fully benefit from the performance improvements of using local writes.

You are underutilizing the core feature of multi-master replication.

Practical Connection Changes You May Need

Topology-Aware Connection Strings

Configure applications to connect to the nearest node (e.g., via DNS, load balancer, or topology-aware connection string). By doing this, your applications gain the benefits of low-latency for both reads and writes.

Connection Pool Adjustments

In some frameworks, connection pools may be tuned under the assumption that writes are slow due to network latency introduced by remote write transactions. You may want to revisit timeouts, pool sizes, and retry logic now that writes can be local and fast.

Multi-Region Application Awareness

In multi-region deployments, you may want each regional deployment of your application to connect to its co-located pgEdge node:

Region A app → Region A database node

Region B app → Region B database node

Failover Considerations

Since all nodes are writable, application failover logic may also be simplified. Instead of failing over to a remote master, or being stuck waiting for a physical standby to be promoted, you may simply redirect the connection to another nearby writable node.Example: Before vs AfterBy adapting your application’s connection strategy, you unlock one of the biggest practical benefits of pgEdge and multi-master replication: low-latency writes anywhere.

Scaling AI Inference at the Edge

Thu, 20 Feb 2025 06:15:00 GMT

In the rapidly evolving landscape of artificial intelligence (AI), the demand for real-time data processing has never been more critical. Traditional cloud-based AI inference often introduces latency by transmitting data to centralized servers for analysis. In a distributed world, a centralized inference pipeline becomes a bottleneck. This delay can be detrimental in applications requiring immediate responses, such as autonomous vehicles, production line monitoring or real-time analytics. This challenge can now be addressed by moving AI inference closer to where the inference is used, distributing AI across your network to local devices near the data source. This approach significantly reduces latency, enhances data security, and improves overall system efficiency.However, implementing AI inference at the edge presents its own set of challenges, particularly concerning data availability and consistency across distributed environments. This is where pgEdge Distributed PostgreSQL becomes indispensable. By providing a distributed PostgreSQL architecture optimized for the network edge, pgEdge ensures that data remains consistently accessible across multiple nodes, even in the face of network issues or hardware failures. This multi-master (active-active) setup guarantees high availability, lowers response times, and facilitates seamless data replication and synchronization, which are crucial for maintaining the integrity of AI models and their inferences.

Key Benefits of pgEdge for AI Applications

High Availability

pgEdge's multi-master (active-active) architecture ensures that read and write operations can occur at any node within a geographically distributed cluster. This design eliminates single points of failure, providing continuous data availability even during maintenance or unexpected outages. This resilience is crucial for AI applications that demand uninterrupted access to data for real-time processing and decision-making.

Distributed Processing

By enabling data to be stored and processed across multiple locations, pgEdge facilitates distributed AI workloads. This allows for parallel processing of large datasets, enhancing the efficiency of tasks such as training machine learning models or executing complex inference algorithms. For instance, in a three-node cluster managing a 900,000-row table, each node can process 300,000 rows concurrently, significantly reducing overall processing time, with the resulting data automatically distributed across all nodes without needing to repeat the computations. This can be especially valuable adding and maintaining embeddings for large data sets with constant and geographically distributed read/write activity.

Data Consistency Across Nodes

pgEdge employs advanced replication and conflict resolution mechanisms to maintain data consistency across all nodes in an active-active Multi-Master configuration. This ensures that AI models operate on accurate and up-to-date information, which is essential for generating reliable predictions and insights. The platform's support for synchronous read replicas within regions further enhances data integrity, making it a dependable choice for mission-critical AI applications.

Flexibility in Deployment

pgEdge's architecture supports deployment across various cloud regions and data centers, as well as on-premise or in air-gapped deployments. This unparalleled flexibility and resilience is particularly beneficial for AI applications that require scalability and adaptability to different operational environments. By integrating pgEdge into their AI infrastructure, organizations can effectively overcome the data limitations associated with centralized AI inference, thereby achieving faster decision-making processes and enhanced user experiences.

Considerations for Distributed AI Compute

While an high availability, multi-master distributed data environment is an essential foundation of a distributed AI inference implementation, it's only half of the story. The AI Compute itself also needs to be distributed to realize the full benefits.When a distributed database is integrated with a single, centralized AI compute environment, you can still encounter latency, as all nodes are required to send data to and await responses from the centralized compute resource. This bottleneck undermines the key advantages of a distributed database.

Implementing Localized AI Compute Instances

To mitigate this issue, you can deploy a localized AI compute instance in proximity to each database node. This approach ensures that AI processing, such as vector generation, occurs locally, thereby minimizing latency and reducing the need for data transmission over potentially congested networks. By processing data closer to its source, your system can achieve faster inference times and improved overall performance.

Parallel Processing Through Distributed AI Compute

Distributing AI compute resources across multiple nodes not only alleviates centralized bottlenecks but also enables parallel processing of large datasets. For instance, a smart city project can process sensor data (e.g., traffic, weather, or transport) at nodes near data sources, enabling real-time decisions like rerouting traffic or adjusting bus schedules, with global synchronization of critical updates. This parallelism accelerates data processing and leverages the inherent scalability of distributed systems, leading to more efficient AI workflows. New data received at one node can be vectorized locally for immediate use by the local application, with other nodes receiving the new or updated embeddings through logical replication.

Integrating AI Compute with Databases

The goal of course is to get the Compute as close to the data, at the point of usage, as possible. The question is just how close do you want to get?In-Database Processing: Integrating AI capabilities directly within the database using extensions like PostgresML (PGML) allows for execution of machine learning tasks without the need for an external compute system. This tight integration reduces data movement and can enhance performance for certain workloads, but there are quite a few caveats with this approach. Extensions like PGML are usually limited to running on very specific OS flavours, and can come with heavy Python library and version dependencies. The additional AI workload will have a direct impact on hardware capability considerations for the database instance, especially if you intend to use GPU acceleration, as this will also require NVIDIA CUDA libraries.In a distributed environment, other limitations are likely to emerge. For example, in our local testing and experimentation with PGML, we found that it was using Primary Key Sequences in its model storage. With active-active replication, this will cause duplicate Primary Key conflicts on insert-insert scenarios (where two nodes insert a new row with the same Primary Key value). To make PGML function in a distributed environment, pgEdge patches it to use our snowflake sequence extension, so that if a HuggingFace model was first used on one node, after download, it would replicate successfully to the other nodes. PGML also caches the model as part of its usage, making it necessary to invalidate the cache on the receiver node so that it would be rebuilt and function correctly.Sidecar Deployments: Implementing AI models as sidecar services using frameworks such as ONNX or OLLAMA enables AI processing to occur alongside the database. This configuration offers flexibility, allowing for the use of specialized hardware or software environments tailored to AI tasks while maintaining close proximity to the database. This can be seen in extensions such as localAI (which uses ONNX), pg_vectorize, and TimescaleDB’s PGAI (both of whom use OLLAMA), that support remote calls to openAI or calls to a local framework/API. Maintaining a close proximity with the Compute environment also enables the ability to add asynchronous processes to update vectors when the underlying information is added or updated, without having to use triggers to invoke and wait for the return of the updated embedding.

Practical Use Cases

This post provides a rather high level overview, but let's end with a few practical use cases:

Accelerating Vector Search

Vector search is pivotal in AI applications, enabling similarity comparisons essential for recommendation systems and semantic search. pgEdge has integrated the pgvector extension, providing efficient storage and querying of vector embeddings directly within a distributed PostgreSQL database. This integration facilitates low-latency, distributed access to embeddings, ensuring that AI-powered search operations are both swift and scalable.

Parallelizing Vectorization of Large Datasets

Handling large datasets is a common challenge in AI workflows. pgEdge's distributed architecture enables an accelerated parallel vectorization of data across multiple nodes. For example, a global e-commerce platform can use a multi-master database to process transaction logs locally on regional nodes, identifying fraud or issues in real-time, while replicating key insights across the cluster for global visibility.

Real-Time Updates to AI Models and Embeddings

In AI applications, especially those involving real-time data processing, the ability to update models and embeddings promptly is crucial. pgEdge's multi-master replication ensures that updates made to AI models or vector embeddings on any node are propagated across the entire cluster in near real-time. This capability guarantees that all nodes operate with the most current data, enhancing the accuracy and reliability of AI-driven insights.

Enhancing AI Inference at the Edge

Deploying AI inference closer to end-users reduces latency and improves responsiveness. pgEdge's support for the pgvector extension allows AI inference and similarity search requests to be processed nearer to users, delivering faster search results regardless of their location.

Implementing Edge AI for Real-Time Analytics

Edge AI enables real-time data processing and analysis without constant reliance on cloud infrastructure. By bringing computation closer to the source of data, edge AI reduces latency, optimizes bandwidth usage, and enables faster decision-making.