1 / 42

Mornay Van Der Walt Managing Architect VMware

Introducing Site Recovery Manager (SRM). Mornay Van Der Walt Managing Architect VMware. Agenda. Datacenter Automation DR and SRM Introduction and Concepts SRM 1.0 Prerequisites and SAN Integration SRM Workflows (Protected and Recovery Site) SRM Roles and Privileges

gzifa
Download Presentation

Mornay Van Der Walt Managing Architect VMware

An Image/Link below is provided (as is) to download presentation Download Policy: Content on the Website is provided to you AS IS for your information and personal use and may not be sold / licensed / shared on other websites without getting consent from its author. Content is provided to you AS IS for your information and personal use only. Download presentation by click this link. While downloading, if for some reason you are not able to download a presentation, the publisher may have deleted the file from their server. During download, if you can't get a presentation, the file might be deleted by the publisher.

E N D

Presentation Transcript


  1. Introducing Site Recovery Manager (SRM) Mornay Van Der Walt Managing Architect VMware

  2. Agenda • Datacenter Automation • DR and SRM Introduction and Concepts • SRM 1.0 Prerequisites and SAN Integration • SRM Workflows (Protected and Recovery Site) • SRM Roles and Privileges • SRM Alarms and Site Status Monitoring • SRM Core benefits and Summary

  3. VMware Leads the Way into the Automated Datacenter Automation = Business Agility • Automate IT processes • Create resource pools • Capacity on-demand OpEx Savings • Production Consolidation • Business Continuity • Workload Balancing Strategic Business Value Management & Automation CapEx Savings • Partitioning • Small Scale Consolidation Virtual Infrastructure Virtual Infrastructure Hypervisor Hypervisor Hypervisor Standardize 3rd generation2006- 2008 Explore 1st generation1998 – 2002 Expand 2nd generation2003 - 2005 3

  4. The Virtual Infrastructure Stack Œ  Virtualization Platform Management & Automation  Virtual Infrastructure Datacenter Automation Desktop Management Business Continuity IT Service Delivery Resource Mgt Availability Mobility Security

  5. Datacenter Automation Products  Business Continuity IT Service Delivery Management & Automation Lab Manager Stage Manager Site RecoveryManager Lifecycle Manager VirtualCenter Server Datacenter Automation

  6. What is a Disaster? • Complete loss of a data center for an extended period of time • Declaration of a disaster usually requires consensus from multiple parts of the organization (at the CxO level) • What is not a disaster? • Failure of an individual host • A temporary service interruption

  7. The Current State of (Physical) DR • DR services tiered according to business needs • Physical DR is challenging • Maintain identical hardware at both locations • Apply upgrades and patches in parallel • Little automation • Error-prone and difficult to test * RPO – Recovery Point Objective: Amount of data lost measured in time * RTO – Recovery Time Objective: The duration of time to restore a services after a disaster

  8. Press In Case of Disaster Advantages of Virtual Disaster Recovery • VMware is a true enabler for Disaster Recovery • Virtual machines are portable • Virtual hardware can be automatically configured • Test and failover can be automated (minimizes human error) • The need for idle hardware is reduced • Costs are lowered, and the quality of service is raised

  9. Introducing Site Recovery Manager (SRM) Site Recovery Manager leverages VMware Infrastructure to transform disaster recovery • What it is: • Site Recovery Manager is a new VMware product for disaster recovery • What it does: • Simplifies and automates disaster recovery processes • Setup Failover • Testing Failback • Site Recovery Manager works with VMware Infrastructure to enable faster, more reliable, affordable disaster recovery

  10. Array Replication VMware SRM at a Glance Site A Site B X ProtectedSite RecoverySite ProtectedSite RecoverySite SRM Supports bi-directional Site protection Site RecoveryManager Site RecoveryManager VirtualCenter VirtualCenter Protected VMs powered on offline Protected VMs online in Protected Site become unavailable Datastore Groups Datastore Groups

  11. SRM Server Side Components * Site 1 Site 2 VC Server 1 VC Server 2 VCMS 1 DB VCMS 2 DB SRM Server 1 SRM Server 2 SRM 2 DB SRM 1 DB Storage Replication Adapter Storage Replication Adapter Array 1 Array 2 Block Replication SW Block Replication SW * Note: Conceptual drawing only. SRM Server may run on another system than VCMS

  12. SRM Concept Relationship “Cheat Sheet”

  13. SRM Concepts And Their Relationships Recovery Plan 1 (Whole Site) Protection Groups: Datastore Group 1 Protection Group 1 VMFS 1 LUN 1 Protection Group 1 Datastore Group 2 Protection Group 2 LUN 2 Protection Group 2 VMFS 2 Protection Group 3 LUN 3 Recovery Plan 2(Subset) Protection Groups: Datastore Group 3 Protection Group 3 LUN 4 VMFS 3 Protection Group 1 LUN 5 VMFS 4 Protected Site Recovery Site

  14. Array Integration with SRM • Vendor-specific scripts support: • Array discovery • Replicated LUN discovery • SRM Test initiation (simulated failover in an isolated environment) • SRM Failover initiation (actual failover of services to the recovery site) In cooperation with VMware and with the full support of VMware the Storage Vendors create the SRAs for their respective storage arrays

  15. SRM Licensing Site 1 Site 2 RecoverySite ProtectedSite Site RecoveryManager Site RecoveryManager VirtualCenter VirtualCenter SRM Protected VMs SRM licensed per CPU socket on the ESX server that hosts the protected virtual machines in the Protected Site VMs not protected by SRM

  16. Safety Tip: DNS Validation – The Rule of ‘Four’ • Validate DNS is working as expected by performing the following DNS lookups for the VC,SRM and ESX servers • Short name • Long name • Reverse • Forward

  17. SRM 1.0 Prerequisites • ESX Server 3.0.2, ESX Server 3.5 or ESX Server 3i • VirtualCenter (VC) server version 2.5 installed at theprotected siteand at therecovery site • SRM server installed at the protected and at the recovery site • SRM plug-in installed on the VI Clients that will access the protected and recovery site • Network configuration that allows TCP connectivity between VC servers and SRM servers • An Oracle or SQL Server database that uses ODBC for connectivity in the protected site and in the recovery site • A SRM license file installed on the VC license server at the protected site and at the recovery site • Pre-configured array-based replication between the protected site and the recovery site

  18. SRM Installation Workflow • At the protected site the following activities are completed: • Installation of the SRM server • Installation of the SRM Plugin into the VI Client • Installation of the Storage Replication Adapter (SRA) • At the recovery site the following activities are completed: • Installation of the SRM server • Installation of the SRM Plugin into the VI Client * • Installation of the Storage Replication Adapter (SRA) • It is important to complete the SRM workflows in the order detailed in this presentation * Note: Optional step, only required if a different instance of the VI Client is used to access the recovery site

  19. SRM Protected and Recovery Site Datacenters SRM PROTECTED SITE SRM RECOVERY SITE

  20. SRM User Interface SRM UI Access Local and Paired Site Protection Setup Recovery Setup

  21. SRM Setup Workflow – Protection Site • At the protection site the following setup activities are completed: • The user pairs the SRM servers at the protected and recovery sites • Security certificates are established between the SRM servers and the VC servers

  22. SRM Setup Workflow – Protection Site - continued • Array Managers Configuration • Select the correct Manager Type from the Manager type drop down box • Storage Partner Participation • VMware provides the SRA specification • Storage Partners create the SRA • Storage Partners test the SRA • VMware review the SRA test results • SRA support with SRM granted if all test are passed

  23. SRM Setup Workflow – Protection Site - continued • SRM identifies available arrays in the Protection and Recovery Side and the replicated datastores and determines the datastore groups Protection Side Array Discovery Recovery Side Array Discovery Replicated Datastores and Datastore Groups

  24. VMware Site Recovery Manager with EMC Replication Integrates VI3 with EMC Replication for simplified DR SRDF and SRDF/A for Symmetrix Provides both synchronous and asynchronous remote replication within Symmetrix storage system MirrorView/S for CLARiiON Provides synchronous remote replication within CLARiiON storage system Celerra Replicator for Celerra Provides asynchronous remote replication within Celerra storage system SRM adapter supports iSCSI LUNs RecoverPoint Continuous Remote Replication (CRR) Provides asynchronous remote replication via RecoverPoint appliances Heterogeneous storage system support Features/Benefits Automates storage management activities with EMC Replication for reduced RTO Automates set-up with local replication for non-disruptive DR testing Improves VM restart with EMC Replication Provides central management of DR from within VirtualCenter Simplifies disaster restart of VMware Virtual Infrastructure environments

  25. SRM Setup Workflow – Protection Site - continued • Using the Inventory Preferences Mapper, the user maps resources in the protected site to their counterparts in the recovery site.

  26. SRM Setup Workflow – Protection Site - continued • A protection group is a group of VMs that will be failed over together to the recovery site • Working through the Protection Group wizard you will need to select a temporary location for placeholder VM configuration files for the protected VMs at the recovery site.

  27. SRM Setup Workflow – Protection Site - continued • Working through the Protection Group wizard a user selects which VMs need to be protected and assigns them to a protection group • The creation of a protection group results in VC inventory updates in the recovery site

  28. SRM Setup Workflow – Recovery Site • At the recovery site the following setup activity is completed: • The user creates a recovery plan which is associated to a single or multiple protection groups

  29. SRM Recovery Plan VM Shutdown High Priority VM Shutdown Prepare Storage High Priority VM Recovery Normal Priority VM Recovery

  30. SRM Recovery Plan - continued • SRM Recovery Plan Benefits: • turn manual BC/DR run books into an automated process • specify the steps of the recovery process in VirtualCenter • Provide a way to test your BC/DR plan in an isolated environment at the recovery site without impacting the protected VMs in the protected site Low Priority VM Recovery Post Test Cleanup Storage Reset

  31. Testing a SRM Recovery Plan • SRM enables you to ‘Test’ a recovery plan by simulating a failover with zero downtime to the protected VMs in the protected site

  32. Testing a SRM Recovery Plan - continued Recovery Only Status Success Errors Success Waiting for Input Test Only

  33. Executing an actual failover in SRM WARNING - Executing an actual failover with SRM will permanently alter virtual machines and infrastructure of both the protected and recovery sites

  34. Executing an actual failover in SRM - continued WARNING - Executing an actual failover with SRM will permanently alter virtual machines and infrastructure of both the protected and recovery sites WARNING - Failback to the protected site is a not an automated process in SRM 1.0

  35. SRM performs a Datastore re-signature • SRM will automatically perform a re-signature on the Datastores in the Recovery Site that were replicated from the SRM Protected Site • LVM.EnableResignature=1 • With a typical re-signature - Datastore names will change to snapxxxx_datastorename, for example • snap-00000002-shared-san-1 • snap-00000002-shared-san-2 • With a SRM initiated re-signature -Datastore will maintain theoriginal datastore name • shared-san-1 • shared-san-2 WARNING - The re-signature of the target datastore has implications during a failback (resync) of data back to the SRM Protected Site

  36. SRM 1.0 Failback Options • SRM 1.0 does not support an automated failback process via the SRM UI • Failback Options • Without SRM (no Recovery Plan, no Testing capabilities, no audit trail) • Unregister the protected virtual machines in the Protected Site VC • Work with your storage team, reverse data replication • VM re-inventory in Protected Site VC, restart and re-ip (manual or scripted) • With SRM (Recovery Plan, Test before Recovery, built-in audit trail) • Delete the protection groups in the Protected Site VC • Unregister the protected virtual machines in the Protected Site VC • Work with your storage team, reverse data replication • Leverage SRM, complete SRM workflows in the reverse direction from Recovery Site back to the Protected Site • Repeat the above steps from the Protected Site back to the Recovery Site to complete the re-protection of the virtual machines in the Protected Site

  37. SRM Default Roles and Privileges

  38. SRM Alarms and Site Status Monitoring • SRM will support the following alarm notification actions: • Send e-mail to specified address • Send SNMP trap to VC trap receivers • Execute specified command on VC host • We recommend you complete setup of alarm notifications for: • Remote Site Down • Remote Site Ping Failed • Replication Group Removed • Recovery Plan Destroyed • License Server Unreachable

  39. SRM Server Monitoring • SRM will raise VC events for the following conditions: • Disk Space Low • CPU use exceeded limit • Memory low • Remote Site not responding • Remote Site heartbeat failed • Recovery Plan Test started, ended, succeeded, failed, or cancelled • Virtual Machine Recovery started, ended, succeeded, failed, or reports a warning

  40. Site Recovery Manager Summary • Site Recovery Manager Leverages VMware Infrastructure to Make Disaster Recovery • Rapid • Automate disaster recovery process • Eliminate complexities of traditional recovery • Reliable • Ensure proper execution of recovery plan • Enable easier, more frequent tests • Manageable • Centrally manage recovery plans • Make plans dynamic to match environment • Affordable • Utilize recovery site infrastructure • Reduce management costs

  41. QUESTIONS Mornay Van Der Walt Managing Architect VMware

  42. Introducing Site Recovery Manager (SRM) Mornay Van Der Walt Managing Architect VMware

More Related