Veeam v13: High Availability Cluster Setup from Scratch

Veeam v13 Series
Platform: Linux (Veeam Software Appliance only)
Applies to: Veeam Backup and Replication v13.0.1+ - Veeam Data Platform Premium license required
High Availability HA Cluster VBR v13 Linux Appliance Failover

How the HA cluster works

The v13 HA cluster is a two-node active/passive setup for the Linux Software Appliance. One node is primary and runs all backup and replication operations. The secondary sits in standby, continuously receiving a replicated copy of the primary's configuration database. If the primary goes down, you promote the secondary and work resumes against the same repositories.

Replication between nodes uses Patroni, the PostgreSQL high availability framework. The primary's configuration database is continuously streamed to the secondary as WAL logs. TCP must be open between nodes for this to work - if the TCP connection drops, WAL logs accumulate on the primary until the secondary reconnects. The secondary is read-only and cannot run backup jobs independently while in standby.

What replicates between nodes: the VBR configuration database and configuration state. What does not replicate: backup data. Your repositories are completely separate infrastructure. Both nodes can reach them, and they stay exactly where they are during a failover. The only thing that moves is which node is active.

A virtual cluster IP address floats between nodes. You always connect to that IP - or its DNS name - rather than to individual node addresses. When failover or switchover happens, the virtual IP migrates to the new primary. Everything pointing at the cluster IP keeps working without reconfiguration.

Version and license requirement

HA cluster requires VBR 13.0.1 or later. It was not included in the initial 13.0.0 release. You also need Veeam Data Platform Premium license applied to the primary node. The feature is not available in Foundation, Advanced, Essentials, Community Edition, or NFR licenses.

Requirements and hard limitations

Requirement Detail
License Veeam Data Platform Premium Edition, applied to primary node only
VBR version 13.0.1 or later on both nodes
Node platform Veeam Software Appliance (Linux) only on both nodes. Windows-based VBR cannot participate
Node count Exactly two nodes. No three-node configurations
Network Both nodes and the virtual cluster IP must be on the same subnet. Static IPs required for all three. IPv4 and IPv6 cannot be mixed in the same cluster configuration
DNS Both nodes must have the same DNS suffixes and correct DNS addresses. Forward and reverse DNS records for all three addresses (primary, secondary, cluster IP) must resolve correctly before assembly
TCP between nodes TCP must be open between primary and secondary for WAL log streaming
Local repositories Cannot be used within an HA cluster. The default local repository on the backup server is not usable. All repositories must be external
Management tool for assembly Windows-based VBR console only. The wizard cannot be run from the Web UI or Host Management console
Universal Storage API plug-ins Veeam does not automatically install these on HA nodes. Install needed plug-ins on both nodes manually

What to prepare before you start

You need three static IP addresses on the same subnet: the primary node IP, the secondary node IP, and the virtual cluster IP. The primary node IP likely already exists if you have a running appliance. Reserve the other two before proceeding.

In DNS, create A records for the primary node hostname, the secondary node hostname, and the cluster endpoint hostname. Then create matching PTR reverse records for all three IPs. Test all six lookups from the machine you will run the Windows console on before touching the wizard. The official docs are explicit: if DNS is wrong, cluster assembly will fail. Sorting out DNS after the fact is painful.

Deploy the secondary as a fresh Veeam Software Appliance using the same ISO version as the primary. Assign the static IP you reserved, set the hostname to match the DNS record you created, and complete the full appliance initial configuration wizard including the Security Officer account if you use one in your environment.

Install the Premium license on the primary node through the VBR Web UI: Settings (gear icon), License Information, License tab, Install. The secondary does not need a separate license install.

You can also use an existing backup server as the primary node - you do not have to start fresh. Just make sure you do not designate the same backup server as both primary and secondary. If you do, the secondary's configuration database gets replaced by the primary's during assembly.

Submit HA enable requests through Host Management

Before you can run the cluster wizard in the Windows console, you must submit an HA enable request through the Host Management console on each node separately. This is the step that catches people by surprise because it is not in the Windows console at all.

On the primary node

Submit the HA request

In a browser, go to https://<primary-node-ip-or-hostname>:10443. Log in with the Host Administrator account. In the navigation, click Applications. Under the High Availability section, click Submit Request.

If a Security Officer account is configured on this appliance, the Security Officer must log in separately and approve the request before it takes effect. If no Security Officer was configured during the initial setup wizard, the request is approved automatically.

On the secondary node

Repeat on the second node

Go to https://<secondary-node-ip-or-hostname>:10443. Log in with that node's Host Administrator account. Click Applications, then Submit Request under High Availability. Again, if a Security Officer is configured on that node, they must approve it.

The approval has a time limit

The HA enable approval is valid for a limited time window after it is granted. Complete the cluster assembly in the Windows console before it expires. If it expires before you finish, you need to submit and approve the request again on both nodes before the wizard will proceed.

Assemble the cluster in the Windows console

Connect the Windows console directly to the primary node - not the cluster IP, that does not exist yet. Use the primary node's hostname or IP address.

Step 1

Launch the New High Availability Cluster wizard

In the console, go to Backup Infrastructure > Managed Servers > Linux. Find your primary node in the list. Right-click it and select Create HA Cluster. This opens the New High Availability Cluster wizard.

Step 2

Specify cluster settings

Enter the virtual cluster IP address and the cluster endpoint DNS name. These must exactly match the IP and A record you created for the cluster endpoint. Double-check them here - if you enter the wrong cluster IP, you will need to disassemble and rebuild to correct it.

Step 3

Specify cluster node settings

The primary node IP is pre-populated from the machine you connected to. Confirm it is correct. For the secondary node, enter its IP address and specify the Host Administrator credentials for that appliance. Veeam uses these to connect to the secondary and initialize it as a standby replica.

Step 4

Review and finish

Review the summary and click Finish. Veeam assigns the virtual cluster IP to the primary node, connects to the secondary, initializes it as a read-only standby, deploys the HA service on both nodes, and starts the initial configuration data synchronization. This takes several minutes. When it completes, disconnect the console from the primary node IP and reconnect using the cluster endpoint hostname or virtual cluster IP. From this point forward, always connect through the cluster endpoint.

Self-signed certificate

If you want the cluster DNS name included in the self-signed certificate's alternative names, you must regenerate the certificate after assembling the cluster. The official docs call this out explicitly. It is not automatic.

Verify the cluster is healthy

After reconnecting through the cluster endpoint, go to Backup Infrastructure > Managed Servers. The HA cluster appears as a single entry with both nodes shown underneath, labeled Primary and Secondary. Both should show healthy status.

Right-click the cluster and select Rescan to force a fresh status check. This queries both nodes, confirms the HA service is running, and verifies replication is active and synchronized. If the secondary shows a warning, check TCP connectivity between nodes first, then confirm the Patroni service is running on both.

You can view session statistics by right-clicking the cluster and selecting Statistics. This shows the HA maintenance sessions that periodically sync configuration data. Healthy completed sessions confirm replication is working.

Enterprise Manager and HA clusters

If your VBR server is connected to Enterprise Manager, you must re-add it in EM using the cluster virtual IP address or cluster DNS name after assembling the cluster. EM does not automatically update the server entry. If you leave the original individual node IP in EM, EM will lose the ability to collect data from that server after a switchover or failover, because the individual node IP is no longer the active primary.

In the EM web console, go to Configuration > Backup Servers. Remove the existing entry for the VBR server, then add it back using the cluster virtual IP or cluster DNS name.

Switchover (planned)

Use switchover when both nodes are healthy, synchronized, and you need to take the primary down for planned maintenance. In the Backup Infrastructure view, right-click the HA cluster and select Switchover. Veeam confirms both nodes are in sync before proceeding, stops running jobs on the current primary, migrates the virtual cluster IP to the secondary, and promotes it to active. The original primary transitions to standby. The process takes around 10 minutes.

After your maintenance work on what is now the standby, you can leave the new node assignment in place or switchover again to restore the original layout. There is no requirement to switch back.

Test this before you need it

Run a switchover in a non-emergency situation before you actually need it. You want to confirm the virtual IP migration works in your network, the console reconnects cleanly afterward, and you know what the process looks like. Discovering a DNS or network issue during an actual outage is a much worse experience than discovering it during a planned test.

Failover (unplanned)

Use failover when the primary is down and not recovering in a reasonable timeframe. Veeam ONE includes dedicated HA cluster alarms for this. The Veeam Backup and Replication HA cluster primary node state alarm can be configured to trigger automated failover when the primary becomes unavailable - meaning Veeam ONE can initiate failover without manual intervention. If you have Veeam ONE and are deploying HA, configure that alarm and its automated action before you need it.

Without automated failover, trigger it manually: connect the console to the cluster endpoint (it connects to whichever node is reachable), go to Backup Infrastructure, right-click the cluster, and select Failover. Veeam promotes the secondary, assigns the virtual cluster IP to it, and resumes backup operations on the new primary.

After the failover and after the original primary is repaired and back online, it rejoins the cluster automatically as the new secondary. Veeam detects it, resyncs its configuration from the current primary, and places it in standby. You do not need to rebuild the cluster from scratch after a failover.

Limitations table

Limitation Detail
Two nodes only Exactly two nodes. No three-node or quorum configurations
Linux Software Appliance only Windows-based VBR installations cannot participate
Premium license required Veeam Data Platform Premium, applied to primary node only
VBR 13.0.1 minimum Not available in the 13.0.0 release. Both nodes must run the same VBR version
Same subnet Both nodes and virtual cluster IP must be on the same subnet
No local repositories The default local repository on the backup server cannot be used within an HA cluster. All repositories must be external
Console only for management HA cluster cannot be managed via the Web UI or the Host Management console. Windows console required
EM must be re-added after assembly Enterprise Manager must be updated to use the cluster virtual IP or DNS name rather than an individual node IP
Universal Storage API plug-ins Not automatically installed on HA nodes. Must be installed manually on both nodes
Syslog events HA cluster reconfiguration events and failover events are not written to or sent to the syslog server
Configuration database not auto-updated during upgrade VBR does not automatically update the configuration database during HA cluster upgrade. Must be configured for automatic updates or updated manually on the primary node
Manual failover by default Failover is manual unless automated failover is configured through Veeam ONE alarms
What you completed
  • Confirmed Premium license and VBR 13.0.1 requirement on both nodes
  • Allocated three static IPs and created forward and reverse DNS records for all three
  • Deployed the secondary as a fresh Software Appliance, removed local repositories
  • Submitted HA enable requests through Host Management on both nodes
  • Ran the New High Availability Cluster wizard from the Windows console connected to the primary
  • Reconnected through the cluster endpoint and verified cluster health
  • Re-added the VBR server in Enterprise Manager using the cluster IP or DNS name
  • Tested switchover to confirm virtual IP migration and console reconnect work correctly

Read more