vSphere Replication Management Overview
vSphere Replication Management Overview
Replication
vSphere Replication 6.5
Using VMware vSphere Replication
You can find the most up-to-date technical documentation on the VMware website at:
[Link]
If you have comments about this documentation, submit your feedback to
docfeedback@[Link]
VMware, Inc.
3401 Hillview Ave.
Palo Alto, CA 94304
[Link]
Copyright © 2012–2016 VMware, Inc. All rights reserved. Copyright and trademark information.
VMware, Inc. 2
Contents
VMware, Inc. 3
About Using VMware vSphere
Replication 1
Using vSphere Replication provides information about using VMware vSphere Replication.
Intended Audience
This information is intended for anyone who wants to protect the virtual machines in their virtual
infrastructure by using vSphere [Link] information is written for experienced Windows or Linux
system administrators who are familiar with virtual machine technology and datacenter operations.
VMware, Inc. 4
Replicating Virtual Machines 2
With vSphere Replication you can replicate virtual machines from a source site to a target site.
You can set a recovery point objective (RPO) to a certain time interval depending on your data protection
needs. vSphere Replication applies all changes made to virtual machines configured for replication at the
source site to their replicas at the target site. This process reoccurs periodically to ensure that the replicas
at the target site are not older than the RPO interval that you set. See How the Recovery Point Objective
Affects Replication Scheduling.
To replicate a virtual machine using vSphere Replication, you must deploy the vSphere Replication
appliance at the source and target sites. A vSphere Replication infrastructure requires one
vSphere Replication appliance at each site.
The source and target sites must be connected for you to be able to configure replications. You cannot
perform replications if one of the sites is in the Connection issue state. For more details, see Site
Connection States Displayed in the vSphere Web Client in the vSphere Replication Installation and
Configuration guide. If the sites appear in the Not authenticated state, scheduled replications continue
as normal, but you cannot manage replications. See vSphere Replication Sites Appear in the Not
Authenticated State.
vSphere Replication does not support the recovery of multiple virtual machines from the same workflow.
Each recovery workflow is for an individual virtual machine.
You can configure replications for powered-off virtual machines, but the data synchronization begins when
the virtual machine is powered on. While the source virtual machine is powered off, the replication
appears in Not active status.
VMware, Inc. 5
Using VMware vSphere Replication
n How vSphere Replication Synchronizes Data Between vCenter Server Sites During Initial
Configuration
n Reconfiguring Replications
The RPO value affects replication scheduling, but vSphere Replication does not adhere to a strict
replication schedule. For example, when you set the RPO to 15 minutes, you instruct vSphere Replication
that you can tolerate losing the data for up to 15 minutes. This does not mean that data is replicated
every 15 minutes.
If you set an RPO of x minutes, and the RPO is not violated, the latest available replication instance can
never reflect a state that is older than x minutes. A replication instance reflects the state of a virtual
machine at the time the synchronization starts.
Assume that during replication configuration you set the RPO to 15 minutes. If the synchronization starts
at 12:00 and it takes five minutes to transfer to the target site, the instance becomes available on the
target site at 12:05, but it reflects the state of the virtual machine at 12:00. The next synchronization can
start no later than 12:10. This replication instance is then available at 12:15 when the first replication
instance that started at 12:00 expires.
If you set the RPO to 15 minutes and the replication takes 7.5 minutes to transfer an instance,
vSphere Replication transfers an instance all the time. If the replication takes more than 7.5 minutes, the
replication encounters periodic RPO violations. For example, if the replication starts at 12:00 and takes 10
minutes to transfer an instance, the replication finishes at 12:10. You can start another replication
immediately, but it finishes at 12:20. During the time interval 12:15-12:20, an RPO violation occurs
because the latest available instance started at 12:00 and is too old.
The replication scheduler tries to satisfy these constraints by overlapping replications to optimize
bandwidth use and might start replications for some virtual machines earlier than expected.
To determine the replication transfer time, the replication scheduler uses the duration of the last few
instances to estimate the next one.
VMware, Inc. 6
Using VMware vSphere Replication
vSphere Replication 6.5 displays the 5 minute RPO setting when the target and the source site use
VMFS 6.0, VMFS 5.x, NFS 4.1, NFS 3, VVOL, or Virtual SAN 6.0 storage and later.
You can use the 5 minute RPO setting if you are using different datastore types between the source and
the target site.
The 5 minute RPO can be applied to a maximum of 100 VMs on VMFS 6.0, VMFS 5.x, NFS 4.1, NFS 3,
and Virtual SAN 6.0 storage and later. The maximum for VVOL datastore is 50 VMs.
Note If you select the OS quiescing option while configuring replication, you cannot use an RPO value
lower than 15 minutes.
For example, you can configure the retention of 3 instances per day for the last 5 days.
After you recover a replicated virtual machine, the retained replicas appear as snapshots of the virtual
machine in the vSphere Web Client. The list of snapshots includes the retained instances according to the
retention policy that you set, and the latest instance. By the example above, the list will contain 15
snapshots and the latest saved instance of the virtual machine, or a total of 16 snapshots. You can use
the snapshots to revert to an earlier state of the recovered virtual machine.
Administrators cannot configure the precise time when replica instances are created, because the
retention policy is not directly related to replication schedule and RPO. As a consequence, replications
with the same retention policy might not result in replicas retained at the same time instants.
When the age of the latest replication instance approaches the RPO interval, vSphere Replication starts a
sync operation to create a new instance on the target site. The replication instance reflects the state of
the virtual machine at the time the synchronisation starts. If no retention policy is configured, when the
new instance is created, the previous instance expires and the vSphere Replication Server deletes it.
VMware, Inc. 7
Using VMware vSphere Replication
When the age of the latest instance approaches the RPO interval, vSphere Replication starts creating a
new replica instance. The start time of the sync operation is the time of the new instance. When the sync
operation completes, vSphere Replication assesses the existing replica instances to determine which
ones to keep:
1 The granularity of the retention policy is determined based on the replication settings. For example, if
you configured vSphere Replication to keep 3 instances for the last 1 day, it means that you want to
keep 3 replica instances that are relatively evenly distributed over 24 hours. This equals
approximately 1 instance in a 8-hour interval, or the granularity of this retention policy is 8 hours.
2 The time of the last saved instance is rounded down to the nearest slot time. If the granularity is 8
hours, the slot times are 0:00, 8:00, and 16:00.
3 The instances that are between the nearest slot time and the last saved instance are traversed. Let
us assume that the time of the last saved instance is 10:55. Following our example, the nearest slot
time is 8:00 o'clock. Let us also assume that the RPO is 1 hour, and each sync operation takes 5
minutes to complete. Between 8:00 o'clock and 10:55, the slot contains an 8:55 instance, and a 9:55
instance.
4 The earliest instance that is newer than the nearest slot time is saved, and the rest of the instances in
this slot are deleted, except for the latest created instance that vSphere Replication always keeps.
Following our example, the 8:55 instance is saved, and the 9:55 instance is deleted. The 10:55
instance is the latest created instance, so it is also saved.
5 The slot time is decremented by the granularity of the retention policy and a check is performed for
the earliest instance between the beginning of the current slot and the beginning of the previous slot.
If the slot contains expiring instances, they are deleted.
6 The number of slots that contain saved instances is analyzed. If the number of slots with saved
instances is higher than the number of slots determined by the retention policy, the oldest saved
instance expires and is deleted. The last saved instance is not included in this count. In our example,
if we had an instance saved for the interval 8:00 - 16:00 o'clock of the previous day, that instance
would be deleted.
VMware, Inc. 8
Using VMware vSphere Replication
The number of replication instances that vSphere Replication keeps depends on the configured retention
policy, but also requires that the RPO period is short enough for these instances to be created. Because
vSphere Replication does not check whether the RPO settings will create enough instances to keep, and
does not display a warning message if the instances are not enough, you must ensure that you set
vSphere Replication to create the instances that you want to keep. For example, if you set
vSphere Replication to keep 6 replication instances per day, the RPO period should not exceed 4 hours,
so that vSphere Replication can create 6 instances in 24 hours.
When you configure replication of a virtual machine, you can enable multiple point in time (PIT) instances
in the recovery settings in the Configure Replication wizard. vSphere Replication retains a number of
snapshot instances of the virtual machine on the target site based on the retention policy that you specify.
vSphere Replication supports a maximum of 24 snapshot instances. After you recover a virtual machine,
you can revert it to a specific snapshot.
During replication, vSphere Replication replicates all aspects of the virtual machine to the target site,
including any potential viruses and corrupted applications. If a virtual machine suffers from a virus or
corruption and you have configured vSphere Replication to keep PIT snapshots, you can recover the
virtual machine and then revert it to a snapshot of the virtual machine in its uncorrupted state.
You can also use the PIT instances to recover the last known good state of a database.
Replication
VR Appliance VR Appliance
t0 t1 t2 t3
VM VM VM VM VM
Note VMware Virtual SAN is a fully supported feature of vSphere 5.5 Update 1 and later.
VMware, Inc. 9
Using VMware vSphere Replication
Because user-friendly names of Virtual SAN datastores might change and cause errors during replication
or recovery operations, vSphere Replication automatically replaces the user-friendly name of a datastore
with its UUID, which is constant. Therefore, the UUID is displayed everywhere in the vSphere Replication
user interface, though you selected a human-readable name during replication configuration.
Using vSphere Replication adds to the load on the storage. Every virtual machine generates regular read
and write operations. Configuring replications on those virtual machines adds another read operation to
the regular read and write operations, which increases the I/O latency on the storage. The precise
number of virtual machines that you can replicate to Virtual SAN storage by using vSphere Replication
depends on your infrastructure. If you notice slower response times when you configure replications for
virtual machines in Virtual SAN storage, monitor the I/O latency of the Virtual SAN infrastructure.
Potentially, reduce the number of virtual machines that you replicate in the Virtual SAN datastore.
Note When you stop a replication, vSphere Replication does not delete the replica directory at the target
datastore. As a result, stale directories remain on VMFS and NFS target datastores, and unused
namespaces remain on Virtual SAN and Virtual Volume target datastores. Because the maximum number
of directories and namespaces on a datastore is limited, you must manually clean them up to free
resources on the datastore. See Clean Up the Target Datastore After You Stop a Replication.
n If a virtual machine has one 256 GB disk and you use the default Virtual SAN storage policy, the disk
object will have 2 mirror components of 256 GB each and 1 witness, to make a total of 3 components.
n If a virtual machine has one 512 GB disk and you use the default Virtual SAN storage policy, the disk
object will have 4 mirror components of 256 GB each and 1 witness, to make a total of 5 components.
VMware, Inc. 10
Using VMware vSphere Replication
If you enable multiple point-in-time (PIT) snapshots, you must make allowances for the additional
components that each snapshot creates in the Virtual SAN storage, based on the number of disks per
virtual machine, the size of the disks, the number of PIT snapshots to retain, and the number of failures to
tolerate. When retaining PIT snapshots and using Virtual SAN storage, you must calculate the number of
extra components that you require for each virtual machine:
Number of disks x number of PIT snapshots x number of mirror and witness components
Examples of using this formula demonstrate that retaining PIT snapshots rapidly increases the number of
components in the Virtual SAN storage for every virtual machine that you configure for
vSphere Replication:
n You have a virtual machine with two 256 GB disks for which you retain 10 MPIT snapshots, and you
set the default Virtual SAN storage policy:
n You have a virtual machine with two 512 GB disks for which you retain 10 PIT snapshots, and you set
the default Virtual SAN storage policy:
The number of PIT snapshots that you retain can increase I/O latency on the Virtual SAN storage.
Storage DRS can detect the data that vSphere Replication copies on the target site and can move
replications without affecting the replication process.
The speed of data synchronization depends on the availability of information about block allocation of the
VMDK files. vSphere Replication uses this information to find empty regions of the disks and accelerate
the sync operations by skipping these regions. The speed of data synchronization also depends on the
site for which block allocation information is available.
n If the allocation information is available at both sites, data synchronization occurs at the highest
possible speed.
n If the allocation information is available only at the source or the target site, vSphere Replication skips
the empty regions on the VMDK disks at that site, but processes the entire disk at the site where
allocation information is not available. Therefore, data synchronization is slower.
VMware, Inc. 11
Using VMware vSphere Replication
n If the allocation information is not available at either site, data synchronization is done by comparing
all blocks between the source site and the target site, even if many of the blocks have not been
allocated on the disk by the guest OS. This is the slowest method for data synchronization.
Note The availability of block allocation information has little effect on the speed of data synchronization
for VMDK disks that are almost full.
Product Versions at the The acceleration of initial synchronization is supported only on ESXi hosts
Source and the Target 6.0.x or later.
Site If the ESXi and the vSphere Replication Server on the source site are 6.x or
later, but the vSphere Replication Server or the hosts at the target site are
not 6.x or later, the allocation information will be available only on the
source site.
If the vSphere Replication Management servers at the source and at the
target site are both 6.x, but one or more ESXi hosts at the target site are
not 6.0 or later, if the vSphere Replication Management server selects a
target host that is not 6.0 or later, there will be no allocation information
available on the target site.
The Type of the Disks on VMFS or VSAN datastores provide full allocation information.
Datastore NFS datastores cannot provide allocation information for the disks that are
located on them.
Note Replication disks on the source and the target site can be on
different datastore types. The acceleration of the initial synchronization
depends on whether both sites can provide allocation information, or only
one site. If none of the sites can provide allocation information, no
acceleration occurs.
The Type of Virtual Disk Lazy zeroed thick disks, thin disks, and vSAN sparse disks, Space-Efficient
sparse disks, and VMDK sparse snapshots provide allocation information.
VMware, Inc. 12
Using VMware vSphere Replication
vSphere Replication compares the differences on the source and target site, and replicates only the
changed blocks.
When, during replication configuration, you select a target datastore for the virtual machine,
vSphere Replication looks for a disk with the same filename in the target datastore. If a file with the same
name exists, vSphere Replication prompts you with a warning and offers you the option to use the
existing disk file as a seed for the replication. If you accept the option, after the virtual machine replication
is fully configured and enabled, vSphere Replication compares the differences and replicates only the
changed blocks. If you do not accept the prompt, you must change the target location for your replication.
Note If you plan to copy files from the source to the target datastore, the source virtual machine must be
powered off before downloading the vmdk files that will be used as seeds for the replication.
When you configure replication in a single vCenter Server instance, you can select the source site as the
target site for replication. You then configure replication in the same way as for an infrastructure with a
source and a target site. For example, you can replicate a virtual machine to a different datastore
attached to the same host or another host. vSphere Replication prevents you from using the source or
replicated virtual machine's vmdk files as the target of the replication.
The virtual machine name must be unique in the same folder in the vCenter Server inventory. In the
recovery wizard, vSphere Replication does not allow you to select a folder if there is already a virtual
machine with the same name registered to it. During recovery if there is a virtual machine with the same
name, you might see an error message. See Error Recovering Virtual Machine in a Single vCenter Server
Instance for more information.
VMware, Inc. 13
Using VMware vSphere Replication
Although vSphere Replication supports up to 24 recovery points, you must set the MPIT to the lowest
number of recovery points that meets your business requirements. For example, if the business
requirement is for 10 recovery points, you must set up vSphere Replication to save only 10 snapshots.
You can set up two recovery points per day for the last five days. As a result, the consumed storage and
the time needed to consolidate the snapshots after recovery are less than if you use the maximum
number of recovery points.
Configuring Quiescing
For VMs with high levels of storage I/O, quiescing of the file system and applications can take several
minutes and impact the performance of the VM . When quiescing a file system and applications for
Windows VMs, vSphere Replication requires a regular VM snapshot before replication. When you
estimate the RPO time, consider the time and resource consumption for the quiescing and for the
consolidation of the snapshots. For example, if you configure replication of a Windows VM with an RPO
of 15 minutes and quiescing is enabled, vSphere Replication generates a VM snapshot and consolidates
it every 15 minutes.
VMware, Inc. 14
Using VMware vSphere Replication
This procedure is for configuring replications to a target vCenter Server. To configure a replication to a
cloud provider, see vSphere Replication for Disaster Recovery to the Cloud.
When you configure replication, you set a recovery point objective (RPO) to determine the maximum data
loss that you can tolerate. For example, an RPO of 1 hour seeks to ensure that a virtual machine loses
the data for no more than 1 hour during the recovery. For smaller RPO values, less data is lost in a
recovery, but more network bandwidth is consumed keeping the replica up to date. The RPO value affects
replication scheduling, but vSphere Replication does not adhere to a strict replication schedule. See How
the Recovery Point Objective Affects Replication Scheduling and How the 5 Minute Recovery Point
Objective Works.
Every time that a virtual machine reaches its RPO target, vSphere Replication records approximately
3800 bytes of data in the vCenter Server events database. If you set a low RPO period, this can quickly
create a large volume of data in the database. To reduce the volume of data that is kept in the
vCenter Server events database, limit the number of days that vCenter Server retains event data. See
Configure Database Retention Policy in the vCenter Server and Host Management Guide. Alternatively,
set a higher RPO value.
vSphere Replication guarantees crash consistency amongst all the disks that belong to a virtual machine.
If you use quiescing, you might obtain a higher level of consistency. The available quiescing types are
determined by the operating system of the virtual machine. See Interoperability Pages for vSphere
Replication 6.5 for quiescing support for Windows and Linux virtual machines.
You can configure virtual machines to replicate from and to Virtual SAN datastores. See Using vSphere
Replication with Virtual SAN Storage for the limitations when using vSphere Replication with Virtual SAN.
Note VMware Virtual SAN is a fully supported feature of vSphere 5.5 Update 1 and later.
VMware, Inc. 15
Using VMware vSphere Replication
Prerequisites
n Verify that the vSphere Replication appliance is deployed at the source and the target sites.
n To enable the quiescing of virtual machines that run Linux guest OS, install the latest version of
VMware Tools on each Linux machine that you plan to replicate.
Procedure
3 Right-click the virtual machine and select All vSphere Replication Actions > Configure replication.
n If you want to configure a replication to a local target site, select the target site from the list and
click Next.
If the source and target sites are not connected, the connection between the sites is configured
when you click Next.
n If you want to configure a replication to a remote target site, and the source and target sites are
connected, select the target site from the list and click Next.
n If you want to configure a replication to a remote target site, and the source and target sites are
not connected, click Add Remote Site), enter the IP address or host name of the server where
the target PSC runs, and provide the credentials of a user that has the VRM [Link]
VRM privilege assigned. When the user is authenticated with the target site, all vCenter Server
instances on which the vSphere Replication Management Server is registered with the Lookup
Service appear in the list of target sites. Select the target site from the list and click OK and Next.
6 Accept the automatic assignment of a vSphere Replication server or select a particular server on the
target site and click Next.
7 On the Target Location page, click Edit to select or change the target location datastore.
8 (Optional) To configure the replication of individual disks, click the name of the source virtual
machine.
For each disk, you can select the virtual format, storage policy, and a datastore where it is replicated.
You can disable the replication of a disk by clicking Disable in its Replication Enabled row.
VMware, Inc. 16
Using VMware vSphere Replication
9 (Optional) On the Replication options page, select the quiescing method for the guest operating
system of the source virtual machine.
Note Quiescing options are available only for virtual machines that support quiescing.
vSphere Replication does not support VSS quiescing on Virtual Volumes.
Compressing the replication data that is transferred through the network saves network bandwidth
and might help reduce the amount of buffer memory used on the vSphere Replication server.
However, compressing and decompressing data requires more CPU resources on both the source
site and the server that manages the target datastore.
11 On the Recovery settings page, use the RPO slider or the time spinners to set the acceptable period
for which data can be lost in the case of a site failure.
The available RPO range is from 5 minutes to 24 hours for target and source sites.
12 (Optional) To save multiple replication instances that can be converted to snapshots of the source
virtual machine during recovery, select Enable in the Point in time instances pane, and adjust the
number of instances to keep.
Note You can keep up to 24 instances for a virtual machine. This means that if you configure
vSphere Replication to keep 6 replication instances per day, the maximum number of days you can
set is 4 days.
The number of replication instances that vSphere Replication keeps depends on the configured
retention policy, but also requires that the RPO period is short enough for these instances to be
created. Because vSphere Replication does not check whether the RPO settings will create enough
instances to keep, and does not display a warning message if the instances are not enough, you
must ensure that you set vSphere Replication to create the instances that you want to keep. For
example, if you set vSphere Replication to keep 6 replication instances per day, the RPO period
should not exceed 4 hours, so that vSphere Replication can create 6 instances in 24 hours.
13 Click Next.
14 On the Ready to complete page, review the replication settings, and click Finish.
vSphere Replication starts an initial full synchronization of the virtual machine files to the designated
datastore on the target site.
VMware, Inc. 17
Using VMware vSphere Replication
When you configure replication, you set a recovery point objective (RPO) to determine the maximum data
loss that you can tolerate. For example, an RPO of 1 hour seeks to ensure that a virtual machine loses
the data for no more than 1 hour during the recovery. For smaller RPO values, less data is lost in a
recovery, but more network bandwidth is consumed keeping the replica up to date. The RPO value affects
replication scheduling, but vSphere Replication does not adhere to a strict replication schedule. See How
the Recovery Point Objective Affects Replication Scheduling and How the 5 Minute Recovery Point
Objective Works.
Every time that a virtual machine reaches its RPO target, vSphere Replication records approximately
3800 bytes of data in the vCenter Server events database. If you set a low RPO period, this can quickly
create a large volume of data in the database. To reduce the volume of data that is kept in the
vCenter Server events database, limit the number of days that vCenter Server retains event data. See
Configure Database Retention Policy in the vCenter Server and Host Management Guide. Alternatively,
set a higher RPO value.
vSphere Replication guarantees crash consistency amongst all the disks that belong to a virtual machine.
If you use quiescing, you might obtain a higher level of consistency. The available quiescing types are
determined by the operating system of the virtual machine. See Interoperability Pages for vSphere
Replication 6.5 for quiescing support for Windows and Linux virtual machines.
You can configure virtual machines to replicate from and to Virtual SAN datastores. See Using vSphere
Replication with Virtual SAN Storage for the limitations when using vSphere Replication with Virtual SAN.
Note VMware Virtual SAN is a fully supported feature of vSphere 5.5 Update 1 and later.
Configuring vSphere Replication on a large number of virtual machines simultaneously when using Virtual
SAN storage can cause the initial full synchronization of the virtual machine files to run very slowly. Initial
full synchronization operations generate heavy I/O traffic, and configuring too many replications at the
same time can overload the Virtual SAN storage. Configure vSphere Replication in batches of a
maximum of 30 virtual machines at a time.
Prerequisites
n Verify that the vSphere Replication appliance is deployed at the source and the target sites.
n To enable the quiescing of virtual machines that run Linux guest OS, install the latest version of
VMware Tools on each Linux machine that you plan to replicate.
Procedure
VMware, Inc. 18
Using VMware vSphere Replication
3 Select the virtual machines to replicate by using the Ctrl or Shift keys.
4 Right-click the virtual machines and select All vSphere Replication Actions > Configure
replication.
The virtual machines pass a validation check before they can be configured for replication.
5 Click Next.
n If you want to configure a replication to a local target site, select the target site from the list and
click Next.
If the source and target sites are not connected, the connection between the sites is configured
when you click Next.
n If you want to configure a replication to a remote target site, and the source and target sites are
connected, select the target site from the list and click Next.
n If you want to configure a replication to a remote target site, and the source and target sites are
not connected, click Add Remote Site), enter the IP address or host name of the server where
the target PSC runs, and provide the credentials of a user that has the VRM [Link]
VRM privilege assigned. When the user is authenticated with the target site, all vCenter Server
instances on which the vSphere Replication Management Server is registered with the Lookup
Service appear in the list of target sites. Select the target site from the list and click OK and Next.
8 Accept the automatic assignment of a vSphere Replication server or select a particular server on the
target site and click Next.
9 On the Target Location page, click Edit to select or change the target location datastore.
10 (Optional) On the Replication options page, select the quiescing method for the guest operating
system of the source virtual machine.
Note Quiescing options are available only for virtual machines that support quiescing.
vSphere Replication does not support VSS quiescing on Virtual Volumes.
Compressing the replication data that is transferred through the network saves network bandwidth
and might help reduce the amount of buffer memory used on the vSphere Replication server.
However, compressing and decompressing data requires more CPU resources on both the source
site and the server that manages the target datastore.
VMware, Inc. 19
Using VMware vSphere Replication
12 On the Recovery settings page, use the RPO slider or the time spinners to set the acceptable period
for which data can be lost in the case of a site failure.
The available RPO range is from 5 minutes to 24 hours for target and source sites.
13 (Optional) To save multiple replication instances that can be converted to snapshots of the source
virtual machine during recovery, select Enable in the Point in time instances pane, and adjust the
number of instances to keep.
Note You can keep up to 24 instances for a virtual machine. This means that if you configure
vSphere Replication to keep 6 replication instances per day, the maximum number of days you can
set is 4 days.
The number of replication instances that vSphere Replication keeps depends on the configured
retention policy, but also requires that the RPO period is short enough for these instances to be
created. Because vSphere Replication does not check whether the RPO settings will create enough
instances to keep, and does not display a warning message if the instances are not enough, you
must ensure that you set vSphere Replication to create the instances that you want to keep. For
example, if you set vSphere Replication to keep 6 replication instances per day, the RPO period
should not exceed 4 hours, so that vSphere Replication can create 6 instances in 24 hours.
14 Click Next.
This option searches the selected target datastore for replication seeds. If candidate files are found,
confirm whether to use the files as seeds.
16 On the Ready to complete page, review the replication settings, and click Finish.
vSphere Replication starts an initial full synchronization of the virtual machine files to the designated
datastore on the target site.
If a replication source virtual machine is powered off, the replication remains in Not Active state until
you power on the virtual machine.
Prerequisites
Other than the embedded vSphere Replication Server, you must have an additional vSphere Replication
Server deployed and registered.
Procedure
VMware, Inc. 20
Using VMware vSphere Replication
3 Select a vSphere Replication Server from the list, and click OK.
The newly assigned server is updated in the vSphere Replication Server column.
Take a note of the target datastore and the name of the replication that you are about to stop. You need
this information to clean up your environment after you stop the replication.
Prerequisites
Verify that you are logged in the vSphere Web Client as a VRM virtual machine replication user or a VRM
administration user. For more details, see vSphere Replication Roles Reference in the
vSphere Replication Installation and Configuration guide.
Procedure
1 In the vSphere Replication Home page, go to the Monitor tab and click Incoming Replications or
Outgoing Replications.
vSphere Replication asks you if you want to permanently stop the replication for the selected virtual
machine.
Note The hosts and vSphere Replication server used by the replication must be accessible to stop a
replication on both sites. If a host or the server is not accessible, you can force stop the replication on
the accessible site by selecting Force stop replication. If you force stop the replication from
Incoming Replications, you must also force stop the corresponding replication from Outgoing
Replications if the source site is available. If you force stop the replication from Outgoing
Replications, you can only recover or force stop the corresponding replication from Incoming
Replications.
3 Click Yes to confirm that you want to stop replicating this virtual machine.
VMware, Inc. 21
Using VMware vSphere Replication
When you stop a replication, the following operations are performed at the replication target site.
n VMDK files are deleted from the target site datastore if the VMDK files were created when the
replication was first configured.
Note When you stop a replication, vSphere Replication does not delete the replica directory at the target
datastore. As a result, stale directories remain on VMFS and NFS target datastores, and unused
namespaces remain on Virtual SAN and Virtual Volume target datastores. Because the maximum number
of directories and namespaces on a datastore is limited, you must manually clean them up to free
resources on the datastore. See Clean Up the Target Datastore After You Stop a Replication.
n VMDK files are not deleted and remain on the target datastore if you configured the replication to use
existing disks at the target site as seeds.
As a result, stale directories remain on VMFS and NFS target datastores, and unused namespaces
remain on Virtual SAN and Virtual Volume target datastores. Because the maximum number of directories
and namespaces on a datastore is limited, you must manually clean them up to free resources on the
datastore.
Prerequisites
Verify that you know the name of the replication that was stopped and its target datastore.
Procedure
1 Log in to the vSphere Web Client as and administrator user and navigate to the datastore that was
the target for the stopped replication.
2 Enter the name of the stopped replication in the search text box and locate the folder that
corresponds to this name.
Reconfiguring Replications
You can reconfigure a replication to modify its settings.
For example, you can reconfigure the replication to enable or disable a virtual machine disk file for
replication, modify replication options, such as RPO, MPIT retention policy, or quiescing method. You can
also specify a different target datastore for replica configuration and disk files.
To reconfigure replication parameters, select the replication from Outgoing Replications or Incoming
Replications, and select Reconfigure.
VMware, Inc. 22
Using VMware vSphere Replication
Procedure
You might be prompted to provide login credentials for the target site.
Procedure
3 On the target site, resize the disk that is left over after you stopped the replication.
4 Configure the replication on the source virtual machine and use the resized disk on the target site as
seed.
To resize a virtual machine disk if you did not initially use replication seeds, you must perform a recovery,
resize the disk on source and target site manually, and use the target disk as a replication seed to
configure a new replication.
Procedure
VMware, Inc. 23
Using VMware vSphere Replication
4 Resize the disk of the recovered virtual machine on the target site.
5 Unregister the recovered virtual machine on the target site, but do not delete the disks.
6 Configure replication by using the disks of the recovered virtual machine as seeds.
vSphere Replication can save replication instances that can be used as snapshots after recovery or
planned migration operations. You can save up to 24 point in time instances per VM.
Procedure
1 On the vSphere Replication Home page, click the Monitor tab, and click Outgoing Replications.
2 Select the replication that you want to reconfigure and click the Reconfigure replication icon , or
right-click the replication source virtual machine and select All vSphere Replication Actions >
Reconfigure.
The reconfiguration wizard opens. You might be prompted to provide login credentials for the target
site.
3 Click Next until you reach the Recovery settings page of the wizard.
4 In the Point in time instances pane, make the changes that you want to apply and click Next.
Action Procedure
Enable the saving of point in time Select the Enable check box.
instances
Disable the saving of point in time Deselect the Enable check box.
instances
Adjust the number of instances to keep Use the spin-boxes to adjust the number of instances to keep per day and the
number of past days for which you want to keep replication instances.
Note You cannot keep more than 24 replication instances per virtual machine.
If you selected to disable the saving of point in time instances, the instances that exist on the target site
are deleted when the next replication instance appears on the target site. The moment when a new
replication instance is saved on the target site depends on the RPO setting.
VMware, Inc. 24
Using VMware vSphere Replication
Note The old target datastore from which you want to move the replication data must be online. If the
old datastore is inaccessible, the reconfiguration task fails. To change the target datastore when the old
datastore is inaccessible, you must stop the replication to the old datastore and configure another
replication to the new datastore.
Procedure
1 In the vSphere Replication Home page, click the Monitor tab and click Outgoing Replications or
Incoming Replications.
2 Right-click the replication for which you want to change the target datastore and select Reconfigure.
The reconfiguration wizard open. You might be prompted to provide login credentials for the target
site.
5 Click Next until you reach the Ready to complete page and click Finish to save your settings.
vSphere Replication moves all replicated instances and configuration files to the new target datastore
according to your settings.
VMware, Inc. 25
Monitoring and Managing
Replications in
vSphere Replication 3
vSphere Replication provides a management interface where you can monitor and manage virtual
machine replication and connectivity states for local and remote sites.
The Home tab in vSphere Replication lists all vCenter Servers that are joined in the same SSO server
and the status of each vSphere Replication appliance with the total number of replications.
When you select a vCenter Server and go to the Summary tab, you can see vSphere Replication portlet
with a summary of the target sites, and outgoing and incoming replications for the vSphere Replication
appliance associated with that vCenter Server.
n A graphical representation of all incoming and outgoing replications with color-coded states of the
replicated virtual machines.
Prerequisites
Procedure
VMware, Inc. 26
Using VMware vSphere Replication
Warning Paused, OK(RPO violation), Not Active, Not Active(RPO violation), FullSync(RPO violation), Sync(RPO violation)
Recovered Recovered
Note If a replication is in the Not Active replication state, you might have connected the source and
target sites using network address translation (NAT). vSphere Replication does not support NAT. Use
credential-based authentication and network routing without NAT when connecting the sites. Another
cause for a Not Active replication state might be that the source virtual machine is powered off. Automatic
replication works only on virtual machines that are powered on.
For more information about how to identify replication errors, see Identifying Replication Problems in the
Issues Tab.
Prerequisites
Procedure
2 Select the vCenter Server to which the vSphere Replication appliance is registered.
4 Select Outgoing Replications to see details of the virtual machines replicated from this site.
5 Select Incoming Replications to see details of the virtual machines replicated to this site.
According to the status of a selected replication, you can perform different actions on the replication.
VMware, Inc. 27
Using VMware vSphere Replication
Prerequisites
Procedure
3 Click Reports.
The Reports page displays two types of data, current and historic. Current data appears in the graphs on
the left, and historic data is displayed on the right of the Reports page.
Note Data is collected in 5 minute intervals and the graphs represent aggregated data for each interval.
Therefore, you cannot see the exact moment when a peak value occurred.
What to do next
n You can use the drop-down menu above the historic data reports to limit the time range of the reports.
n When you maximize the report for transferred bytes, you can use the drop-down menu to filter data
by virtual machine. This helps you identify virtual machines that generate the most intense replication
traffic in your environment.
VMware, Inc. 28
Using VMware vSphere Replication
Server and site connectivity, number of RPO violations, and other metrics give you, as an administrator,
the information you need to diagnose replication issues.
The following sections contain examples of interpreting the data displayed under Reports on the
vSphere Replication tab under Monitor.
RPO Violations
The large number of RPO violations can be caused by various problems in the environment, on both the
source and the target site. With more details on historical replication jobs, you can make educated
decisions on how to manage the replication environment.
n The network bandwidth cannot accommodate all n Disable the replication on some virtual machines with high
replications. change rate to allow lower change rate virtual machines to
n The replication traffic might have increased. meet their RPO objectives.
n The initial full sync for a large virtual machine is taking n Increase the network bandwidth for the selected host.
longer than the configured RPO for the virtual machine. n Check if the replication traffic has increased. If the traffic has
increased, investigate possible causes, for example the
usage of an application might have changed without you
being informed.
n Check the historical data for average of transferred bytes for
a notable and sustained increase. If an increase exists,
contact application owners to identify recent events that
could be related to this increase.
n Adjust to a less aggressive RPO or look at other ways to
increase bandwidth to accommodate the current RPO
requirements.
n A connectivity problem exists between the source and the n Check the site connectivity data to verify the connection
target site. between the source and target site.
n An infrastructure change might have occurred on the target n Check if the infrastructure on the target site has changed or
site. is experiencing problems that prevent vSphere Replication
from writing on the target datastores. For example, storage
bandwidth management changes made to target hosts
might result in storage delays during the replication process.
n Check on the vSphere Replication Management Server
appliance and the vSphere Replication Server appliance.
Someone might have shut down the appliance or it might
have lost connection.
VMware, Inc. 29
Using VMware vSphere Replication
Transferred Bytes
Corelating the total number of transferred bytes and the number of RPO violations can help you make
decisions on how much bandwidth might be required to meet RPO objectives.
Table 3‑3. Analysing the Rate of Transferred Bytes and RPO Violations
Graph Values Probable Cause Solution
n High rate of transferred bytes and high number of RPO The network bandwidth might n Maximize the transferred
violations be insufficient to bytes graph and use the
n Low rate of transferred bytes and high number of RPO accommodate all replications. drop-down menu to filter
violations the data by virtual
machine. Disable the
replication on some virtual
machines with high change
rate to allow lower change
rate virtual machines to
meet their RPO objectives.
n Increase the network
bandwidth for the selected
host.
n High rate of transferred bytes and a few or no RPO The environment operates as N/A
violations expected.
n Low rate of transferred bytes and a few or no RPO violations
Not The replication is not active because the virtual machine is Power on the virtual machine to resume
Active powered off and a warning icon appears. Replication is not running replication.
for that virtual machine.
Paused If you paused the replication, a warning icon appears. Resume the paused replication from the Issues
tab.
Error If you added a disk on a virtual machine which is already Reconfigure the replication and enable or disable
configured for replication, the replication pauses and goes to an the newly added disk.
error state.
VMware, Inc. 30
Using VMware vSphere Replication
Error While configuring replication, the replication fails with the incorrect Reconfigure the replication.
UUID. For example, the replication seed found and intended for
use has a different UUID from the original hard disk.
Error You do not use replication seeds during configuration, but a disk Reconfigure the replication.
with the same name is found during configuration.
RPO A replication contains an RPO violation. See Reconfigure Recovery Point Objectives
Violation (RPO) in Replications.
To create a connection to a new target site, see Configure vSphere Replication Connections in the
vSphere Replication Installation and Configuration guide.
Prerequisites
Procedure
You can view, configure, reconnect and unregister vSphere Replication Server instances that are
registered in your environment.
Prerequisites
Procedure
2 Select the vCenter Server to which the vSphere Replication appliance is registered.
VMware, Inc. 31
Using VMware vSphere Replication
4 In the Replication Servers section, click the icons to manage vSphere Replication servers.
Option Description
Deploy new vSphere Replication Click to deploy an additional vSphere Replication Server. For more details, see
Server from an OVF template Deploying Additional vSphere ReplicationServers in the vSphere Replication
Installation and Configuration guide.
Register a virtual machine as vSphere Click to register a virtual machine as a vSphere Replication Server. For more
Replication Server details, see Register an Additional vSphere ReplicationServer in the
vSphere Replication Installation and Configuration guide.
Configure the selected vSphere Click to access the VAMI of the vSphere Replication Server that you selected
Replication Server from the list.
Reconnect Click if the status of the vSphere Replication Server that you selected from the list
is Disconnected.
Unregister the selected vSphere Click to unregister the vSphere Replication Server that you selected from the list.
Replication Server For more details, see Unregister and Remove a vSphere ReplicationServer in the
vSphere Replication Installation and Configuration guide.
VMware, Inc. 32
Performing a Recovery with
vSphere Replication 4
With vSphere Replication, you can recover virtual machines that were successfully replicated at the target
site.
n If you skip synchronization and recover with the latest data available, for example, if the source
site is not available, vSphere Replication uses the latest available data at the target site.
n vSphere Replication reconfigures the newly replicated virtual machine with the correct disk paths.
n vSphere Replication registers the virtual machine with vCenter Server at the target site.
You can recover one virtual machine at a time on the Incoming Replications tab at the target site.
Optionally, you can power on the recovered virtual machine. The network devices of the recovered virtual
machine are disconnected. You might need to configure the recovered virtual machine to render it fully
operational.
If you had enabled the saving of point in time instances, those instances are converted to snapshots of
the recovered virtual machine. You can use the vSphere Web Client to revert to a snapshot from the list.
VMware, Inc. 33
Using VMware vSphere Replication
Prerequisites
Verify that the virtual machine at the source site is powered off. If the virtual machine is powered on, an
error message reminds you to power it off.
Procedure
2 On the Incoming Replications tab, right-click the virtual machine to recover and select Recovery.
3 Select whether to recover the virtual machine with all the latest data, or to recover the virtual machine
with the most recent data available on the target site.
Option Description
Synchronize recent changes Performs a full synchronization of the virtual machine from the source site to the
target site before recovering the virtual machine. Selecting this option avoids data
loss, but it is only available if the data of the source virtual machine is accessible.
You can only select this option if the virtual machine is powered off.
Use latest available data Recovers the virtual machine by using the data from the most recent replication
on the target site, without performing synchronization. Selecting this option results
in the loss of any data that has changed since the most recent replication. Select
this option if the source virtual machine is inaccessible or if its disks are
corrupted.
6 If the virtual machine contains hard disks for which you have not enabled replication, select a target
destination to attach an existing disk or detach the disk, and click Next.
This page only appears if the virtual machine contains hard disks for which you have not enabled
replication.
n To select a target destination, click Browse and navigate to a folder on a datastore in which disk
file is placed.
n To detach the disk and exclude disk files from the recovery, click Detach.
8 Click Finish.
vSphere Replication validates the provided input and recovers the virtual machine. If successful, the
virtual machine status changes to Recovered. The virtual machine appears in the inventory of the target
site.
VMware, Inc. 34
Using VMware vSphere Replication
If you enabled multiple point in time instances when you configured replication for the virtual machine,
vSphere Replication presents the retained instances as standard snapshots after a successful recovery.
You can select one of these snapshots to revert the virtual machine. vSphere Replication does not
preserve the memory state when you revert to a snapshot.
If the recovery fails, the replication of the virtual machines reverts to the replication state before the
attempted recovery. For more information about the failed recovery attempt, check the last recovery error
message in the replication details pane or check vCenter Server tasks.
The recovery might also fail if you use the same name for the virtual machine in a scenario where you
use vSphere Replication to replicate a virtual machine in a single vCenter Server and the vCenter Server
instance has only one host in its inventory. See Error Recovering Virtual Machine in a Single vCenter
Server Instance for more information.
After a successful recovery, vSphere Replication disables the virtual machine for replication if the source
site is still available. When the virtual machine is powered on again it does not send replication data to the
recovery site. To unconfigure the replication, select Stop replication.
When the source virtual machine is no longer in the vCenter Server inventory, the replication is
unconfigured. Unconfigured replications do not appear in the Summary tab nor in the Incoming
Replications or Outgoing Replications tabs. Check vCenter Server task history for information on
performed recoveries.
If a replicated virtual machine is attached to a distributed virtual switch and you attempt to perform a
recovery in an automated DRS cluster, the recovery operation succeeds but the resulting virtual machine
cannot be powered on. Edit the recovered virtual machine settings to attach it to the correct network.
vSphere Replication disconnects virtual machine network adapters to prevent damage in the production
network. After recovery, you must connect the virtual network adapters to the correct network. If target
host or cluster has no access to the DVS the virtual machine was configured with at the source site,
manually connect the virtual machine to a network or other DVS to successfully power on the virtual
machine.
After performing a successful recovery on the target vCenter Server site, you can perform failback. You
log in to the target site and manually configure a new replication in the reverse direction, from the target
site to the source site. The disks on the source site are used as replication seeds, so that
vSphere Replication only synchronizes the changes made to the disk files on the target site. For more
information on replication seeds, see Replicating Virtual Machines Using Replication Seeds.
Before you configure a reverse replication, you must unregister the virtual machine from the inventory on
the source site.
VMware, Inc. 35
Troubleshooting
vSphere Replication 5
Known troubleshooting information can help you diagnose and correct problems that occur while
replicating and recovering virtual machines with vSphere Replication.
If you have problems with deploying vSphere Replication, replicating or recovering virtual machines, or
connecting to databases, you can troubleshoot them. To help identify the problem, you might need to
collect and review vSphere Replication logs and send them to VMware Support.
See Chapter 3 Monitoring and Managing Replications in vSphere Replication to learn about replication
states and how to identify replication issues.
You can also search for solutions to problems in the VMware knowledge base at [Link]
To access and download the vSphere Replication logs, you need access to the vSphere Replication
VAMI. vSphere Replication rotates its logs when the log file reaches 50MB and keeps at most 12
compressed log files.
Prerequisites
n Verify that you have administrator privileges to configure the vSphere Replication appliance.
Procedure
VMware, Inc. 36
Using VMware vSphere Replication
3 Click Generate to generate a .zip package of the current vSphere Replication logs.
A link to the package containing the replication and system logs appears. Log files from the
vSphere Replication appliance and all connected Additional vSphere Replication Servers are included
in the same package.
5 (Optional) Click Delete next to existing log packages to delete them individually.
Use SCP or Win SCP to copy log folders and files from the vSphere Replication appliance and all
Additional vSphere Replication Servers.
n /opt/vmware/hms/logs/
n /opt/vmware/var/log/lighttpd/
n /var/log/vmware/
n /var/log/[Link]
You can create an alarm that triggers when a specific event occurs, such as after you configure a virtual
machine for replication. See View and Edit Alarm Settings in the vSphere Web Client in the
vSphere Web Client documentation.
VMware, Inc. 37
Using VMware vSphere Replication
Host configured for Host is configured for [Link].h Info Host System
vSphere Replication vSphere Replication ostConfiguredForHbr
Event
VMware, Inc. 38
Using VMware vSphere Replication
VMware, Inc. 39
Using VMware vSphere Replication
Crash consistent sync Crash consistent sync [Link] Info Virtual Machine
completed completed pletedEvent
Sync failed to start Sync failed to start [Link] Error Virtual Machine
tartDeltaEvent
VMware, Inc. 40
Using VMware vSphere Replication
Problem
When you deploy the vSphere Replication, an error appears at vService bindings in the Deploy OVF
Template wizard.
Cause
This error is typically the result of the vCenter Management Web service being paused or stopped.
Solution
Attempt to start the vCenter Management Web service. If vCenter Server is running as a Linux virtual
appliance, reboot the appliance.
VMware, Inc. 41
Using VMware vSphere Replication
Problem
The error OVF package is invalid and cannot be deployed might appear while you attempt to
deploy the vSphere Replication appliance.
Cause
This problem is due to the vCenter Server port being changed from the default of 80.
Solution
Problem
vSphere Replication might not be able to connect to SQL Server, and you have insufficient information to
solve this problem.
Cause
Several issues can cause this problem, and initially available information about the problem is insufficient
to affect a resolution.
Solution
For example, you might use SCP or WinSCP. Connect using the root account, which is the same
account used to connect to the VAMI.
3 Connect to the VAMI and attempt to save the vSphere Replication configuration.
4 Connect to the vSphere Replication appliance again and find the [Link] file which is
in /opt/vmware/hms/logs.
This log file contains information about the error that just occurred. Use this information to
troubleshoot the connection issue, or provide the information to VMware for further assistance. For
more details, seeReconfigure vSphere Replication to Use an External Database in the
vSphere Replication Installation and Configuration guide.
VMware, Inc. 42
Using VMware vSphere Replication
Problem
The ESXi 5.1 or newer host is in a cluster with hosts from older versions and you use vMotion to move
the replicated virtual machine to an older host. vSphere Replication then creates a file system quiesced
replica.
Cause
A mix of ESXi 5.1 (or newer) and older hosts in the cluster creates a file system quiesced replica during
vMotion to an older host. The process should instead create an application quiesced replica.
Solution
Make sure that all hosts in the cluster are running ESXi 5.1 or newer before you use vMotion to move a
Windows Server 2008 and Windows 8 virtual machine with application quiescing.
Problem
Multiple source disks with device keys device_keys point to the same destination datastore and file
path disk_path.
Cause
This problem occurs because vSphere Replication does not generate a unique datastore path or file
name for the destination virtual disk.
Solution
If you select different datastores for the VMDK files on the protected site, you must also select different
datastores for the target VMDK files on the secondary site.
Alternatively, you can create a unique datastore path by placing the VMDK files in separate folders on a
single target datastore on the secondary site.
VMware, Inc. 43
Using VMware vSphere Replication
Problem
The vSphere Replication service stops running or does not start after a reboot. The error unable to
resolve host: non-fully-qualified-name appears in the vSphere Replication logs.
Solution
1 In the vSphere Web Client, select the vCenter Server instance and click Manage > Settings >
Advanced Settings to check that the [Link] key is set to either a fully qualified
domain name or to a literal address.
3 Review and confirm the browser security exception, if applicable, to proceed to the login page.
4 Type the root user name and password for the appliance.
You configured the root password during the OVF deployment of the vSphere Replication appliance.
5 Enter the same FQDN or literal address for vCenter Server as you set for the [Link]
key.
Problem
This problem occurs when running ESXi Server 5.0 on the recovery site. It can result in missed RPO
targets.
The number of virtual machines that can successfully replicate to a single, shared VMFS datastore
increases if the RPO targets are longer.
Follow the guidelines when calculating the number of virtual machines that you should replicate to a
single VMFS volume on the recovery site.
n If all your virtual machines have an RPO of 15 minutes, performance is affected when replicating 50
to 100 virtual machines to the same VMFS datastore.
VMware, Inc. 44
Using VMware vSphere Replication
n If all your virtual machines have an RPO of 30 minutes, performance is affected when replicating 100
to 200 virtual machines to the same VMFS datastore.
If you have heterogeneous RPO targets in a protection group, calculate the harmonic mean of the RPO
targets when calculating the number of virtual machines that you can replicate to a single VMFS volume.
For example, if you have 100 virtual machines with an RPO of 20 minutes and 50 virtual machines with
an RPO of 600 minutes, you calculate the harmonic mean of the RPO as follows:
In this example, the configuration is similar to a setup with 150 virtual machines, each having an RPO of
approximately 30 minutes. In this case, performance is affected if these 150 virtual machines replicate to
a single VMFS volume.
Cause
This problem affects only VMFS datastores that are shared by multiple hosts. It does not occur on
datastores that are local to one host or on other datastore types, such as NFS. This problem affects only
installations that are running ESXi Server 5.0.
The number of vSphere Replication servers is not relevant. These limits apply to the number of virtual
machines that you can replicate to a single VMFS datastore.
Solution
2 If you cannot upgrade ESXi Server to version 5.1 or later, redistribute the replicated virtual machines
or adjust their RPO.
n Reduce the number of virtual machines with a short RPO that replicate to a single VMFS volume,
for example by using a larger number of smaller datastores.
n Increase the RPO of the virtual machines replicating to a single VMFS volume to create a longer
harmonic mean RPO.
Problem
In configurations with two vSphere Replication sites where each includes a vCenter Server instance and
a vSphere Replication appliance, the vSphere Replication sites can appear in the Not authenticated
state, even if you have successfully connected the sites.
VMware, Inc. 45
Using VMware vSphere Replication
Cause
Sites that you have successfully connected can appear in the Not authenticated state when you
establish a new login session to the vSphere Web Client and the previous login session has timed out. In
this case, the Not authenticated state reflects the connection to the remote site from the
vSphere Web Client and not the state of the connection between the sites. If the two sites are running,
vSphere Replication still performs replications at the schedules that you have configured. To restore the
Connected state in the vSphere Web Client, you must provide the login credentials for the remote site.
Solution
1 In the vSphere Web Client, select the vCenter Server to which the vSphere Replication is registered.
3 In Target Sites, right-click the remote site, select Reconnect site, and click Yes.
4 Enter the login credentials for the Platform Service Controller (PSC) on the remote site, and click OK.
The address of the PSC is pre-populated, but you can enter a new address if necessary. For
example, if the target vCenter Server was moved for load balancing, it might be managed by a new
PSC.
Problem
Unable to register the recovered virtual machine VM_name with configuration file
<path_to_vmx_config_file>.
Cause
You cannot recover virtual machines with the same name in the same source and destination folder in the
vCenter inventory.
Solution
Recover the virtual machine in a different VMs and Templates folder in the same datacenter. Optionally,
after successful recovery, you can remove the old virtual machine from the vCenter inventory and drag
the recovered virtual machine to the required virtual machine folder.
VMware, Inc. 46
Using VMware vSphere Replication
Problem
When you replicate virtual machines, you might encounter RPO violations.
Cause
n Network connectivity problems between source hosts and vSphere Replication servers at the target
site.
n As a result of changing the IP address, the vSphere Replication server has a different IP address.
n Slow bandwidth between the source hosts and the vSphere Replication servers.
Solution
n Search the [Link] at the source host for the vSphere Replication server IP address to see
any network connectivity problems.
n Verify that the vSphere Replication server IP address is the same. If it is different, reconfigure all the
replications, so that the source hosts use the new IP address.
n Check /var/log/vmware/*hbrsrv* at the vSphere Replication appliance at the target site for
problems with the server accessing a target datastore.
Problem
Deleting the vSphere Replication appliance does not remove the vSphere Replication extension from
vCenter Server.
Solution
1 Use the Managed Object Browser (MOB) to delete the vSphere Replication extension manually.
For more details, see Unregister vSphere Replication from vCenter Server if the Appliance Was
Deleted in the vSphere Replication Installation and Configuration guide.
VMware, Inc. 47
Using VMware vSphere Replication
Problem
If the ESXi Server instance on which vSphere Replication runs is disconnected from vCenter Server and
is connected to another vCenter Server instance, you cannot access vSphere Replication functions. If you
try to restart vSphere Replication, the service does not start.
Cause
The OVF environment for the vSphere Replication appliance is stored in the vCenter Server database.
When the ESXi host is removed from the vCenter Server inventory, the OVF environment for the
vSphere Replication appliance is lost. This action disables the mechanisms that the vSphere Replication
appliance uses to authenticate with vCenter Server.
Solution
1 (Optional) If possible, redeploy the vSphere Replication appliance and configure all replications and if
possible, reuse the existing .vmdk files as initial copies.
b Remove any temporary hbr* files from the target datastore folders.
d Configure all replications, reusing the existing replica .vmdk files as initial copies.
2 (Optional) If you cannot redeploy the vSphere Replication appliance, use the VAMI to connect
vSphere Replication to the original vCenter Server instance.
e Type the correct managed object id of the appliance VM in Appliance VM MO value. Use the
vCenter Server MOB to obtain the appliance id.
VMware, Inc. 48
Using VMware vSphere Replication
Problem
In addition to the generic error, the message provides more detailed information about the problem,
similar to the following examples.
n Error - Unable to reverse replication for the virtual machine 'virtual machine
name'. VRM Server generic error. Please check the documentation for any
troubleshooting information. Exception details:
'[Link]: Transaction (Process ID 57)
was deadlocked on lock resources with another process and has been chosen as the
deadlock victim. Rerun the transaction. This problem relates to a deadlock in Microsoft SQL
Server.
Cause
vSphere Replication sends this message when it encounters configuration or infrastructure errors. For
example, network issues, database connection issues, or host overload.
Solution
Check the Exception details message for information about the problem. Depending on the details of
the message, you can choose to retry the failed operation, restart vSphere Replication, or correct the
infrastructure.
VMware, Inc. 49
Using VMware vSphere Replication
Problem
If you have two connected sites, and the vCenter Server address of either site changes, the connection
status Connection issue appears and you cannot reconnect the sites.
Solution
1 Open the VAMI for the vSphere Replication appliance that is registered to the vCenter Server whose
address has changed.
2 Reconfigure the vSphere Replication appliance with the new vCenter Server address.
4 In the vSphere Web Client, verify that the status of the connection between the two sites is
Connection issue.
6 Enter the IP address or host name of the server where the Platform Service Controller (PSC) runs,
and provide the credentials of a user that has the VRM [Link] VRM privilege assigned.
You can check the PSC address in the vSphere Replication VAMI on the target site, on the SSO tab
under VR, in the LookupService Address text box.
7 In the list of available vCenter Server instances, select the vCenter Server with the changed IP
address and click OK.
8 Verify that the connection between the two sites is successfully restored and the status is Connected.
Problem
When you use the virtual appliance management interface (VAMI) in Internet Explorer to upload
certificates to the vSphere Replication appliance, you see a certificate error:
The certificate installed with warnings. Remote VRM systems with the 'Accept only SSL certificate
signed by a
trusted CA' option enabled may be unable to connect to this site for the following reason: The
certificate was not issued
for use with the given hostname: vr_appliance_hostname.
VMware, Inc. 50
Using VMware vSphere Replication
Solution
Ignore this error, or connect to the VAMI by using a supported browser other than Internet Explorer.
Problem
If the vCenter Server inventory contains a few hundred or more hosts, the Register VR Server task takes
more than a few minutes to complete.
Cause
vSphere Replication updates each host's SSL thumbprint registry. The vCenter Server Events pane
displays Host is configured for vSphere Replication for each host as the vSphere Replication
server registration task progresses.
Solution
After it finishes, you can use vSphere Replication for incoming replication traffic.
Problem
In heavily loaded environments, generating log bundles can cause vSphere Replication connection
problems during recovery operations. Recovery fails with the error
A generic error occurred in the vSphere Replication Management Server. Exception details: 'Failed
write-locking object: object_ID'.
Cause
vSphere Replication server is blocked when the log bundle is generated. This situation occurs if the
storage for the vSphere Replication virtual machine is overloaded.
VMware, Inc. 51
Using VMware vSphere Replication
Solution
Rerun the recovery. If the recovery still fails, reevaluate the storage bandwidth requirements of the cluster
on which vSphere Replication is running, and the network bandwidth if the storage is NAS.
Problem
Operations such as recovering virtual machines fail with the following error:
Object object_GUID is locked by another ongoing operation in vSphere Replication Management Server.
Try again later.
Cause
When running under heavy load, some vSphere Replication operations might take a longer time to
complete and other operations can fail with this error because a background update operation on the
replication group is slow and holds a lock on the replication for a long time.
Solution
Problem
If two sites are paired, and, while the vSphere Web Client is open on the source site, you restart the
vCenter Server and the vSphere Replication Management Server on the target site, when you try to
configure a replication from the source to the target site, the configuration task fails with the following
error message:
The following error message appears in the HMS log file on the restarted target site:
The following error message appears in the HMS log file on the source site:
VMware, Inc. 52
Using VMware vSphere Replication
Cause
When you establish a connection between two sites, the connection is cached in the user session on both
sites. When you restart the vCenter Server and the vSphere Replication Management Server on the
target site, the information about user sessions is discarded. Because the vSphere Web Client is open
and connected to the source site, the login data remains cached in the vSphere Replication Management
Server. When you configure a replication, the source site tries to connect to the target site using the
cached login data. The target site interprets that data as stale and stops the reconnecting thread.
Solution
n Log out the vSphere Web Client and log back in.
Problem
When you refresh the incoming replications list on a remote site soon after the connection to the source
site has become unavailable, the replications do not display due to a communication error between the
two sites.
Solution
Refresh the vSphere Web Client. Alternatively, log out and log in again.
Problem
vSphere Replication uses certificate-based authentication to connect to vCenter Server. If you change the
vCenter Server certificate, vSphere Replication is inaccessible.
Cause
The vSphere Replication database contains the old vCenter Server certificate.
Solution
u Log into the virtual appliance management interface (VAMI) of the vSphere Replication appliance and
click Configuration > Save and Restart Service.
Do not change any configuration information before clicking Save and Restart Service.
VMware, Inc. 53
Using VMware vSphere Replication
Problem
vSphere Replication needs access to port 80. You might see forbidden HTTP connections in the
vSphere Replication logs.
Solution
Make sure the vSphere Replication appliance has access to port 80 on the storage hosts.
For a list of ports that must be open for vSphere Replication, see vSphere Replication Network Ports in
the vSphere Replication Installation and Configuration guide.
Problem
When you reconfigure the replication and start a full sync, the replication stops in the same data block
with the virus information in it unless the virus data has moved on the disk. Clones of the disk fail, but
other virtual machines of the same size and configuration from the same host replicating to the same
destination datastore replicate successfully.
Solution
Remove the virus information from the replicated guest to avoid replicating virus information.
Make an exception in the anti-virus rules in the firewall to allow the replication to proceed.
Problem
Configuring vSphere Replication on a large number of virtual machines simultaneously when using
vSphere Replication with Virtual SAN storage causes the initial full synchronization of the virtual machine
files to run very slowly.
Cause
Initial full synchronization operations generate heavy I/O traffic. Configuring too many replications at the
same time can overload the Virtual SAN storage.
VMware, Inc. 54
Using VMware vSphere Replication
Solution
Problem
Unable to configure replication for virtual machine VM_name because group group_name cannot be created.
Another virtual machine configured_VM_name}' that has the same instance UUID instance_UUID already
exists on protection site source_site_name.
Cause
n If, due to a connectivity issue or some other problem, an orphaned replication remains on one of the
sites while it is deleted from the other site, the orphaned replication prevents you from configuring a
new replication for the same virtual machine.
n If you have paired two sites and reinstall the vSphere Replication Management server appliance or
reset its database on one of the sites, the other site contains information about the old appliance and
database, and prevents you from configuring new replications.
Solution
n If you have not reinstalled the vSphere Replication Management server, an orphaned replication
exists in your environment, and you know the GID value of that replication, use the Managed Object
Browser (MOB) of the vSphere Replication Management server to delete the replication.
a Navigate to [Link]
b Invoke the destroy method to remove the replication from the site on which the
vSphere Replication Management server runs.
n If you have not reinstalled the vSphere Replication Management server and orphaned replication
exists in your environment, but you do not know the GID value of that replication, the value can be
retrieved from the log files or the vSphere Replication Management server database. Contact
VMware Global Support Services for assistance.
n If the vSphere Replication Management server on one of the sites was reinstalled or otherwise reset:
a Reinstall the vSphere Replication Management server at the other site or reset its database.
b Connect the sites and register any additional vSphere Replication server appliances.
VMware, Inc. 55
Using VMware vSphere Replication
c Remove any temporary hbr* files left over from the target datastore folders.
d Configure all replications, reusing the existing replica .vmdk files as replication seeds.
Problem
You use a vSphere Replication Server on the target site to manage replications, and the replication status
for the virtual machines that this vSphere Replication Server manages is Not active though there is no
obvious reason for this status.
Cause
The vSphere Replication appliance does not check the connectivity between vSphere Replication Server
instances that you register and the ESXi host on the primary site. If you deploy vSphere Replication
servers on the target site, but these servers cannot access the ESXi host on the primary site, the
vSphere Replication servers register successfully with the vSphere Replication appliance, but cannot
operate as expected.
Solution
u If the replication status of a virtual machine is Not active, check the network connectivity between
the host on which the replicated virtual machine is running and the target vSphere Replication Server.
Problem
Response times for vSphere Replication operations can increase as you replicate more virtual machines.
You possibly experience recovery operation timeouts or failures for a few virtual machines, and RPO
violations.
Cause
Every virtual machine in a datastore generates regular read and write operations. Configuring
vSphere Replication on those virtual machines adds another read operation to the regular read and write
operations, which increases the I/O load on the storage. The performance of vSphere Replication
depends on the I/O load of the virtual machines that you replicate and on the capabilities of the storage
hardware. If the load generated by the virtual machines, combined with the extra I/O operations that
vSphere Replication introduces, exceeds the capabilities of your storage hardware, you might experience
slow response times.
VMware, Inc. 56
Using VMware vSphere Replication
Solution
When running vSphere Replication, if response times are greater than 30 ms, reduce the number of
virtual machines that you replicate to the datastore. Alternatively, increase the capabilities of your
hardware. If you suspect that the I/O load on the storage is an issue and you are using VMware Virtual
SAN storage, monitor the I/O latency by using the monitoring tool in the Virtual SAN interface.
Problem
If you use the virtual appliance management interface (VAMI) to install a new SSL certificate, to change
the IP address of the VRM host, or apply another setting on the VR tab, and you click Save and Restart,
the following error message appears:
Cause
The OVF environment of the vSphere Replication appliance might be missing or corrupted.
Solution
a If the [Link] file is not empty, search for the vServiceEnvironmentSection element. If the
vServiceEnvironmentSection element is missing, there might be a problem with the vCenter
Management Web Services process on the vCenter Server machine. Verify that the vCenter
Management Web Services is running on the vCenter Server machine and then try powering the
vSphere Replication appliance off and on. To power the appliance off and on, use the
vSphere Web Client while you are connected to the vCenter Server, and not directly to the ESXi
host.
b If the [Link] file is empty, try to power off and on the vSphere Replication appliance by
using the vSphere Web Client while you are connected to the vCenter Server, and not directly to
the ESXi host.
3 If powering the vSphere Replication appliance does not resolve the issue, most certainly the
appliance has been temporarily removed and re-added in the vCenter Server. There is no solution for
restoring the OVF environment in that case. You must re-deploy the vSphere Replication appliance by
using an empty database, and configure all replications from scratch.
VMware, Inc. 57
Using VMware vSphere Replication
Problem
To apply custom settings to vSphere Replication, you need to establish an SSH connection to the
vSphere Replication appliance, and modify certain configuration files.
To transfer files from and to the vSphere Replication appliance, you use SCP or SFTP protocol.
Because the SSH connections are disabled, you cannot apply the changes that you need, and you
cannot transfer files.
Cause
By default, SSH connections to the vSphere Replication appliance are disabled to strengthen the security
in your environment.
Solution
Prerequisites
Verify that you have the root user credentials to log in to the vSphere Replication appliance.
Procedure
1 In the vSphere Web Client, right-click the vSphere Replication Management (HMS) virtual
machine, and select Open Console.
/usr/bin/[Link]
Procedure
The script configures the vSphere Replication appliance to enable SSH connections.
The Replication Pauses When You Add a New Disk To the Source
VM
You added a new disk to the source VM, which made the replication pause.
Problem
When you add a new disk to the source VM, the replication pauses.
Cause
vSphere Replication detects the addition of a disk to a VM and generates an event such as vSphere
Replication handled a disk addition on a virtual machine.
VMware, Inc. 58
Using VMware vSphere Replication
Solution
You can set up and view an alarm for the event by using the vSphere Web Client. See the vSphere
Administration with the vSphere Client documentation for details.
Problem
vSphere Replication server cannot update its database and becomes unresponsive. Login through
vSphere Replication virtual appliance management interface (VAMI) UI, ssh, or console fails. Attempts to
use the appliance console to log in result in the following error message:
Cause
To prevent data corruption the vSphere Replication appliance is configured to put its root file system in
read-only mode when it detects a problem with the underlying storage.
Solution
1 Resolve the storage problem or use Storage vMotion to migrate the vSphere Replication appliance to
another storage.
3 Verify that you can log in by using the VAMI UI and the appliance console.
VMware, Inc. 59
To ensure continuous replication after a vCenter Server IP change, access the VAMI for the vSphere Replication appliance registered with the changed server, update its configuration with the new server address, and restart the appliance. In the vSphere Web Client, verify the connection status and reconnect the sites by providing the new IP or hostname of the relevant server. This will restore the connection and replication services between the sites .
Redeployment of the vSphere Replication appliance is critical when there are authentication failures with the vCenter Server, lack of proper functionality, or infrastructure changes that cannot be resolved through simpler fixes. Steps for redeployment include powering off the old appliances, removing temporary files from the datastore, deploying new appliances, connecting sites, and reusing existing .vmdk files as initial copies when configuring replications .
To manage the potentially large data volume in the vCenter Server events database due to low RPO settings, administrators can limit the number of days that the database retains event data or increase the RPO value to reduce the frequency of records. Both strategies help manage the data volume and database performance effectively without sacrificing disaster recovery capabilities .
Network compression in vSphere Replication reduces network bandwidth usage, which is beneficial in environments with limited bandwidth. However, it increases CPU resource usage on the source site and target datastore management server due to the added compression and decompression processes. Compression should be enabled when network bandwidth is a constraint and additional CPU capacity is available to handle the increased load .
Enabling quiescing in vSphere Replication can improve data consistency by pausing the virtual machine's processes briefly during replication to achieve a higher consistency level across its disks. However, the level of consistency available through quiescing is determined by the virtual machine's operating system. Additionally, vSphere Replication does not support VSS quiescing on Virtual Volumes, which limits its utility for certain storage configurations .
Setting a very short RPO can lead to more frequent replication cycles, resulting in higher network and CPU load, and rapid accumulation of event data in the vCenter Server database. Additionally, it may not allow enough time to create the desired number of replication instances per day, potentially undermining the intended snapshot strategy for disaster recovery .
A generic error in the vSphere Replication Management Server can be caused by network connectivity issues, database connection problems, or host overload. To address these issues, users should check the detailed error message for clues, retry the failed operation, restart vSphere Replication services, or make necessary infrastructure adjustments. If database lock issues occur, reevaluating and managing SQL Server settings might be necessary .
The Recovery Point Objective (RPO) setting determines the maximum data loss that can be tolerated during the recovery process. A lower RPO value results in less data loss but increases network bandwidth consumption as the replica is kept more up-to-date. This setting also leads to frequent records in the vCenter Server events database due to replication targets being hit more often, which may require adjustments in database retention policies to manage data volume effectively .
Enabling SSH connectivity on a vSphere Replication appliance is necessary for tasks requiring direct file transfer or configuration changes via SCP or SFTP. It involves accessing the vSphere Web Client, opening the console for the vSphere Replication Management VM, logging in as the root user, and running specific commands to enable SSH. This step ensures secure remote management while balancing security risks posed by enabling SSH access .
To configure replication for a virtual machine to a remote target site that is not connected, you need to add the remote site by entering the IP address or hostname where the target PSC runs and provide the necessary user credentials. Once authenticated, the vSphere Replication Management Server will display available vCenter Server instances. Select the target site and continue with the replication configuration. You should ensure that the appropriate permissions are in place and that the required vSphere Replication server instance is available .