Share via


Skype for Business: Execute unplanned DR Fail Over and Fail back

This article outlines the unplanned failover and fail back for Skype for Business. However, the DR setup must be provisioned in order for the DR activation to happen.

The deployment must have the below setup:

  1. 1) Skype for Business HQ Front end, HQ SQL, HQ OOS will be part of the HQ active directory site.
  2. 2) HQ site will have its dedicated Edge server.
  3. 3) Skype for Business DR Front end, DR SQL and DR OOS will be part of the DR active directory site.
  4. 4) DR site will have its dedicated Edge server.
  5. 5) DR front end, edge servers will be in the same Skype for Business Site since the site is a standby site.
  6. 6) Synchronous data Replication will be enabled between the HQ FE pool and the DR FE pool.
  7. 7) DR sql store information must be published in the topology builder.
  8. 8) Associated backup pool must be specified as DR Skype for business FE pool in the topology builder.DR file stores must be published in the topology builder.
  9. 9) HQ and DR site edge servers DNS name spaces can be load balanced. DR site must be made unavailable during normal scenario and connections  to DR edge must be allowed only during DR scenario.
  10. 10) Required communication from HQ to DR FE,SQL should be present for the Pool replication to happen.

Example of DR setup with main site:

 

https://exchangequery.files.wordpress.com/2017/11/sfb112.png?w=600

Procedure to activate unplanned DR failover:

In case of unplanned failover its a total disaster where the main site will be completely unavailable.

So the  CMS (Central Management Store), HQ fe pool and HQ edge services will not be accessible during this scenario.

Below steps can be used:

1) Configure in the DNS load balancer and make sure the edge server DNS name spaces are ready to accept connections in the DR site edge server. There are multiple ways to achieve this based on the network setup. As a last resort also we can add simply two entries (hq & dr) on the DNS name spaces and stop the DR edge services. We can activate the DR edge services only during the DR scenarios.

2) Activate the CMS

We can try to run the below command to see the CMS status:

Invoke-CsManagementServerFailover -Whatif

This command will throw an error because this CMS is not available since it was present in the main site and main site is totally in accessible.

https://exchangequery.files.wordpress.com/2017/11/sfbdr7.png?w=600

In a normal state when the main site is available in a planned failover the result of the command will be the below:

https://exchangequery.files.wordpress.com/2017/11/sfbdr8.png?w=600

It will let us know the current state of the CMS and the proposed state of the CMS after the failover.

https://exchangequery.files.wordpress.com/2017/11/sfbdr9.png?w=600

3) In this scenario the CMS needs to be activated forcefully by the below command:

Invoke-CsManagementServerFailover -BackupSqlServerFqdn “DRSQLFQDN” –BackupSqlInstaceName “BACKUPDRSQLINSTANCE” –Force:$true

https://exchangequery.files.wordpress.com/2017/11/untitled11.png?w=600

https://exchangequery.files.wordpress.com/2017/11/sfbdr1.png?w=600

4) Wait for the replication status to be completed:

We can check the replication status by below command

Get-CsManagementStoreReplicationStatus | ft

5) Reconfigure Edge Federation Route  via DR edge and publish topology and run the setup on all edge servers.

Enable the federation on DR edge and modify the federation route via DR edge.

https://exchangequery.files.wordpress.com/2017/11/untitled21.png?w=600

https://exchangequery.files.wordpress.com/2017/11/untitled12.png?w=600

6) Failover the Pool using disaster mode switch:

Invoke-CsPoolFailOver -PoolFqdn “poolfqdn” -Force -DisasterMode

https://exchangequery.files.wordpress.com/2017/11/untitled14.png?w=600

Failback to HQ site:

Once after the main site is back  make sure the  DNS name spaces are available in the main site

1)  Failover the CMS:

Invoke-CsManagementServerFailover

Wait for the CMS replication to complete in the main site.

2) Failback the FE pool to the main site:

Invoke-CsPoolFailBack -PoolFqdn “poolfqdn”

https://exchangequery.files.wordpress.com/2017/11/sfbdr2.png?w=600

3) Reconfigure Edge Federation Route and publish topology and run the setup on all edge servers.

Note: The DNS routing and the VOIP component SIP/PSTN integration will vary in each and every deployment. The DR setup and failover needs to be taken into consideration according to these configuration.