Question 1

What is a Hyperscale database?

Accepted Answer

A Hyperscale database is a database in SQL Database that is backed by the Hyperscale scale-out storage technology. A Hyperscale database supports up to 128 TB of data and provides high throughput and performance, as well as rapid scaling to adapt to the workload requirements. Connectivity, query processing, database engine features, and so on, work like any other database in Azure SQL Database.

Question 2

What resource types and purchasing models support Hyperscale?

Accepted Answer

The Hyperscale service tier is only available for single databases using the vCore-based purchasing model in Azure SQL Database. It is not available in the DTU-based purchasing model.

Question 3

How does the Hyperscale service tier differ from the General Purpose and Business Critical service tiers?

Accepted Answer

The vCore-based service tiers are differentiated based on database availability and storage type, performance, and maximum storage size as described in resource limit comparison.

Question 4

Who should use the Hyperscale service tier?

Accepted Answer

The Hyperscale service tier is for all customers looking for higher performance and availability, fast backup and restores, fast storage, and compute scalability. This includes customers who are starting out small and growing, those running large mission-critical databases, those who are moving to the cloud to modernize their applications and customers who are already using other service tiers in Azure SQL Database. With Hyperscale, you get:

Database size that can grow from 10 GB up to 128 TB.
Compute vCore resources from 2 vCores up to 128 vCores
Fast database backups regardless of database size (backups are based on storage snapshots).
Fast database restores regardless of database size (restores are from storage snapshots).
Higher log throughput regardless of database size and the number of vCores.
Read Scale-out using one or more read-only replicas, used for offloading read-only workloads or as hot standby databases.
Rapid scaling up of compute, in constant time, to be more powerful to accommodate the heavy workload and then scale down, in constant time. Scaling operations take single-digit minutes for provisioned compute, and less than a second for serverless compute, regardless of database size.
The option to pay for what you use with serverless compute, where compute is billed based on usage.

Question 5

What regions currently support Hyperscale?

Accepted Answer

The Hyperscale service tier is available in all regions where Azure SQL Database is available.

Question 6

Can I create multiple Hyperscale databases per server?

Accepted Answer

Yes. For more information and limits on the number of databases per server, see SQL Database resource limits for single and pooled databases on a server.

Question 7

What are the performance characteristics of a Hyperscale database?

Accepted Answer

The Hyperscale architecture provides high performance and throughput while supporting large database sizes.

Question 8

What is the scalability of a Hyperscale database?

Accepted Answer

Hyperscale provides rapid scalability based on your workload demand.

Scaling Up/Down

With Hyperscale, you can scale up the primary compute size in terms of resources like CPU and memory, and then scale down, in constant time. Because the storage is remote, scaling up and scaling down isn't a size of data operation.

Support for serverless compute provides automatic scale-up and scale-down, and compute is billed based on usage.
Scaling In/Out

With Hyperscale, you can use three kinds of secondary replicas to cater to read scale-out, high availability, and geo-replication requirements. This includes:
- Up to four high-availability replicas having the same compute size as primary. These serve as hot standby replicas to quickly fail over from the primary. You can also use them to offload read workloads from the primary.
- Up to 30 named replicas having the same or different compute size than the primary, to cater to various read scale-out scenarios.
- A geo-replica in a different Azure region to protect against regional outages and to enable geographic read scale-out.

Question 9

Can I mix Hyperscale and single databases in a single server?

Accepted Answer

Yes, you can.

Question 10

Does Hyperscale require my application programming model to change?

Accepted Answer

No, your application programming model stays the same as for any other MSSQL database. You use your connection string as usual and the other regular ways to interact with your Hyperscale database. Once your application is using the Hyperscale database, your application can take advantage of features such as secondary replicas.

Question 11

What transaction isolation level is the default in a Hyperscale database?

Accepted Answer

On the primary replica, the default transaction isolation level is RCSI (Read Committed Snapshot Isolation). On the Read Scale-out secondary replicas, the default isolation level is Snapshot. This is the same as in any other Azure SQL database.

Question 12

Can I bring my on-premises or IaaS SQL Server license to Hyperscale?

Accepted Answer

With the new, simplified pricing in effect since December 15, 2023, the price of compute has been reduced for newly created Hyperscale databases, all serverless Hyperscale databases, and all Hyperscale elastic pools. With the new, simplified pricing, there is no need to apply Azure Hybrid Benefit (AHB) to obtain equivalent savings. Azure Hybrid Benefit (AHB) can only be applied to older (created before December 15, 2023) Hyperscale single databases with provisioned compute. For those older databases, AHB is only applicable until December 2026, after which those databases will also be billed as per the new, simplified pricing. For more information, see Hyperscale pricing blog and Azure SQL Database Hyperscale – lower, simplified pricing!.

Question 13

What kind of workloads is Hyperscale designed for?

Accepted Answer

Hyperscale works well for all workload types, including OLTP, Hybrid (HTAP), and Analytical (data mart) workloads.

Question 14

How can I choose between Azure Synapse Analytics and Azure SQL Database Hyperscale?

Accepted Answer

If you're currently running interactive analytics queries using SQL Server as a data warehouse, Hyperscale is a great option because you can host small and mid-size data warehouses (such as a few TB up to 128 TB) at a lower cost, and you can migrate your SQL Server data warehouse workloads to Hyperscale with minimal T-SQL code changes.

If you're running data analytics on a large scale with complex queries and sustained ingestion rates higher than 100 MB/s or using Parallel Data Warehouse (PDW), Teradata, or other Massively Parallel Processing (MPP) data warehouses such as Azure Synapse Analytics, then Microsoft Fabric could be the best choice.

Ingestion or log generation rate of 150 MB/s is available as an opt-in preview feature. For more information and to opt-in to 150 MB/s, see Blog: November 2024 Hyperscale enhancements.

Question 15

Can I pause my compute at any time?

Accepted Answer

Not at this time. However you can scale your compute and the number of replicas down to reduce cost during nonpeak times, or use serverless to automatically scale compute based on usage.

Question 16

Can I provision a compute replica with extra RAM for my memory-intensive workload?

Accepted Answer

For read workloads, you can create a named replica with a higher compute size (more cores and memory) than the primary. For more information on available compute sizes, see Hyperscale storage and compute sizes.

Question 17

Can I provision multiple compute replicas of different sizes?

Accepted Answer

For read workloads, this can be achieved using named replicas.

Question 18

How many Read Scale-out replicas are supported?

Accepted Answer

You can scale the number of HA secondary replicas between 0 and 4 using Azure portal or REST API. Additionally, you can create up to 30 named replicas for many read scale-out scenarios.

Question 19

For high availability, do I need to provision additional compute replicas?

Accepted Answer

In Hyperscale databases, data resiliency is provided at the storage level. You only need one replica (the primary) to provide resiliency. When the compute replica is down, a new replica is created automatically with no data loss.

However, if there's only the primary replica, it can take a minute or two to create a new replica after failover, vs. seconds in the case when an HA secondary replica is available. The new replica will have cold caches initially, which can result in higher storage latency and reduced query performance immediately after failover.

For mission-critical apps that require high availability with minimal failover impact, you should provision at least one HA secondary replica to ensure a hot standby replica is available to serve as a failover target.

Question 20

What is the maximum database size supported with Hyperscale?

Accepted Answer

The maximum size of a single Hyperscale database is currently 128 TB. The maximum size of a database in a Hyperscale elastic pool is currently 100 TB.

Question 21

What is the size of the transaction log with Hyperscale?

Accepted Answer

In Hyperscale, the transaction log is practically infinite, with a restriction that the active portion of the log cannot exceed 1 TB. The active portion of the log can grow because of long-running transactions, or because of Change Data Capture processing not keeping up with the rate of data change. Avoid unnecessarily long and large transactions to stay below this limit. Other than this restriction, you don't need to worry about running out of log space on a system that has high log throughput. However, log generation rate might be throttled for continuous aggressively writing workloads. The peak sustained log generation rate is 100 MB/s.

Log generation rate of 150 MB/s is available as an opt-in preview feature. For more information and to opt-in to 150 MB/s, see Blog: November 2024 Hyperscale enhancements.

Question 22

Does my tempdb scale as my database grows?

Accepted Answer

Your tempdb database is located on local SSD storage and is sized proportionally to the compute size (the number of cores) that you provision. The size of tempdb is not configurable and is managed for you. To determine maximum tempdb size for your database, see Hyperscale storage and compute sizes.

Question 23

Does my database size automatically grow, or do I have to manage the size of data files?

Accepted Answer

Your database size automatically grows as you insert/ingest more data.

Question 24

What is the smallest database size that Hyperscale supports?

Accepted Answer

10 GB. A Hyperscale database is created with a starting size of 10 GB and grows as needed in 10-GB chunks.

Question 25

In what increments does my database size grow?

Accepted Answer

Each data file grows by 10 GB. Multiple data files can grow at the same time.

Question 26

Is the storage in Hyperscale local or remote?

Accepted Answer

In Hyperscale, data files are stored in Azure standard storage. Data is fully cached on local SSD storage, on page servers that are remote to compute replicas. In addition, compute replicas have data caches on local SSD and in memory, to reduce the frequency of fetching data from remote page servers.

Question 27

Can I manage or define files or filegroups with Hyperscale?

Accepted Answer

No. Data files are added automatically to the PRIMARY filegroup. The common reasons for creating additional filegroups do not apply in the Hyperscale storage architecture, or in Azure SQL Database more broadly.

Question 28

Can I provision a hard cap on the data growth for my database?

Accepted Answer

No.

Question 29

Is database shrink supported?

Accepted Answer

Yes, database and file shrink operations are currently in preview. For more information on the preview, see Shrink for Azure SQL Database Hyperscale.

Question 30

Is data compression supported?

Accepted Answer

Yes, just like in any other Azure SQL DB database. This includes row, page, and columnstore compression.

Question 31

If I have a huge table, is table data spread out across multiple data files?

Accepted Answer

Yes. The data pages associated with a given table can end up in multiple data files, which are all part of the same filegroup. The MSSQL database engine uses proportional fill strategy to distribute data over data files.

Question 32

Can I move my existing databases in Azure SQL Database to the Hyperscale service tier?

Accepted Answer

Yes. You can move your existing databases in Azure SQL Database to Hyperscale. For proofs of concept (POCs), we recommend you make a copy of your database and migrate the copy to Hyperscale.

The time required to move an existing database to Hyperscale consists of the time to copy data, and the time to replay the changes made in the source database while copying data. The data copy time is proportional to data size. The time to replay changes is shorter if the move is done during a period of low write activity.

Get sample code to migrate existing Azure SQL Databases to Hyperscale in the Azure portal, Azure CLI, PowerShell, and Transact-SQL in Migrate an existing database to Hyperscale.

Reverse migration to the General Purpose service tier allows customers who have recently migrated an existing database in Azure SQL Database to the Hyperscale service tier to move back, should Hyperscale not meet their needs. While reverse migration is initiated by a service tier change, it's essentially a size-of-data operation between different architectures. Similarly to migration to Hyperscale, reverse migration is faster if done during a period of low write activity. Learn the limitations for reverse migration.

Question 33

Can I move my Hyperscale databases to other service tiers?

Accepted Answer

If you have previously migrated an existing Azure SQL Database to the Hyperscale service tier, you can reverse migrate it to the General Purpose service tier within 45 days of the original migration to Hyperscale. If you wish to migrate the database to another service tier, such as Business Critical, first reverse migrate to the General Purpose service tier, then modify the service tier. Reverse migration is a size of data operation.

Databases created in the Hyperscale service tier can't be moved to other service tiers.

Learn how to reverse migrate from Hyperscale, including the limitations for reverse migration and impacted backup policies.

Databases created in the Hyperscale service tier can't be moved to other service tiers.

Learn how to reverse migrate from Hyperscale, including the limitations for reverse migration and impacted backup policies.

Question 34

Do I lose any functionality or capabilities after migration to the Hyperscale service tier?

Accepted Answer

Yes. Some Azure SQL Database features are not supported in Hyperscale yet. If some of these features are enabled for your database, migration to Hyperscale could be blocked, or these features will stop working after migration. We expect these limitations to be temporary. For details, see Known limitations.

Question 35

Can I move my on-premises SQL Server database, or my SQL Server database in a cloud virtual machine, to Hyperscale?

Accepted Answer

Yes. You can use many existing migration technologies to migrate to Hyperscale, including transactional replication, and any other data movement technologies (Bulk Copy, Azure Data Factory, Azure Databricks, SSIS). See also the Azure Database Migration Service, which supports many migration scenarios.

Question 36

What is my downtime during migration from an on-premises or virtual machine environment to Hyperscale, and how can I minimize it?

Accepted Answer

Downtime for migration to Hyperscale is the same as the downtime when you migrate your databases to other Azure SQL Database service tiers. You can use transactional replication to minimize downtime migration for databases up to a few TB in size. For very large databases (10+ TB), you can consider implementing the migration process using ADF, Spark, or other bulk data movement technologies.

Question 37

How much time would it take to bring in X amount of data to Hyperscale?

Accepted Answer

Hyperscale is capable of consuming 100 MB/s of new/changed data, but the time needed to move data into databases in Azure SQL Database is also affected by available network throughput, source read speed and the target database service level objective. Log generation rate of 150 MB/s is available as an opt-in preview feature. For more information and to opt-in to 150 MB/s, see Blog: November 2024 Hyperscale enhancements.

Question 38

Can I read data from blob storage and do a fast load (like Polybase in Azure Synapse Analytics)?

Accepted Answer

You can have a client application read data from Azure Storage and load data load into a Hyperscale database (just like you can with any other database in Azure SQL Database). Polybase is currently not supported in Azure SQL Database. As an alternative to provide fast load, you can use Azure Data Factory, or use a Spark job in Azure Databricks with the Spark connector for SQL. The Spark connector to SQL supports bulk insert.

It is also possible to bulk read data from Azure Blob store using BULK INSERT or OPENROWSET: Examples of Bulk Access to Data in Azure Blob Storage.

Simple recovery or bulk logging model is not supported in Hyperscale. Full recovery model is required to provide high availability and point-in-time recovery. However, Hyperscale log architecture provides better data ingest rate compared to other Azure SQL Database service tiers.

Question 39

Does Hyperscale allow provisioning multiple nodes for parallel ingesting of large amounts of data?

Accepted Answer

No. Hyperscale is a symmetric multi-processing (SMP) architecture and is not a massively parallel processing (MPP) or a multi-master architecture. You can only create multiple replicas to scale out read-only workloads.

Question 40

Does Hyperscale support migration from other data sources such as Amazon Aurora, MySQL, PostgreSQL, Oracle, DB2, and other database platforms?

Accepted Answer

Yes. Azure Database Migration Service supports many migration scenarios.

Question 41

What SLAs are provided for a Hyperscale database?

Accepted Answer

See SLA for Azure SQL Database. We recommend adding HA secondary replicas for critical workloads. This provides faster failover, and reduces potential performance impact immediately after failover.

Question 42

Are the database backups managed for me by Azure SQL Database?

Accepted Answer

Yes.

Question 43

Does Hyperscale support Availability Zones?

Accepted Answer

Yes, Hyperscale supports zone redundant configuration. At least one HA secondary replica and the use of zone-redundant or geo-zone-redundant storage is required for enabling the zone redundant configuration for Hyperscale.

Question 44

Does Hyperscale support elastic pools?

Accepted Answer

Yes. For more information, see Hyperscale elastic pools and Blog: Hyperscale Elastic Pools are now generally available.

Question 45

How often are database backups taken?

Accepted Answer

There are no traditional full, differential, and transaction log backups for Hyperscale databases. Instead, there are regular storage snapshots of data files, with a separate snapshot cadence for each file. The generated transaction log is retained as-is for the configured retention period. At restore time, relevant transaction log records are applied to restored storage snapshots. Regardless of snapshot cadence, this results in a transactionally consistent database without any data loss as of the specified point in time within the retention period. In effect, database backup in Hyperscale is continuous.

Question 46

Does Hyperscale support point-in-time restore?

Accepted Answer

Yes.

Question 47

What is the Recovery Point Objective (RPO)/Recovery Time Objective (RTO) for database restore in Hyperscale?

Accepted Answer

The RPO for point-in-time restore is 0 min. Most point-in-time restore operations complete within 60 minutes regardless of database size. Restore time can be longer for larger databases, and if the database experienced significant write activity before and up to the restore point in time. Changing the storage redundancy when issuing a restore can result in longer restore times as the restore is size of data and hence the time will be proportional to the database size.

Question 48

Does database backup affect compute performance on my primary or secondary replicas?

Accepted Answer

No. Backups are managed by the storage subsystem, and use storage snapshots. They do not impact user workloads.

Question 49

Can I perform geo-restore with a Hyperscale database?

Accepted Answer

Yes. Geo-restore is fully supported if geo-redundant storage is used. This is the default for new databases. Unlike point-in-time restore, geo-restore requires a size-of-data operation. Data files are copied in parallel, so the duration of this operation depends primarily on the size of the largest file in the database, rather than on total database size. Geo-restore time will be significantly shorter if the database is restored in the Azure region that is paired with the region of the source database.

Question 50

Can I set up geo-replication with a Hyperscale database?

Accepted Answer

Yes. Geo-replication can be set up for Hyperscale databases.

Question 51

Can I take a Hyperscale database backup and restore it to my on-premises server, or on SQL Server in a VM?

Accepted Answer

No. The storage format for Hyperscale databases is different from any released version of SQL Server, and you don't control backups or have access to them. To take your data out of a Hyperscale database, you can extract data using any data movement technologies, that is, Azure Data Factory, Azure Databricks, SSIS, etc.

Question 52

Will I be charged for backup storage costs in Hyperscale?

Accepted Answer

Yes. Effective May 4, 2022, backups for all new databases are charged based on the backup storage consumed and selected storage redundancy at rates captured in Azure SQL Database pricing page. For Hyperscale databases created before May 4, 2022, backups will be charged only if backup retention is set to be greater than seven days. To learn more, see Hyperscale backups and storage redundancy.

Question 53

How can I measure backup storage size in my Hyperscale database?

Accepted Answer

Details on how to measure backup storage size are captured in Automated Backups.

Question 54

How do I know what my backup bill will be?

Accepted Answer

To determine your backup storage bill, backup storage size is calculated periodically, and multiplied by the backup storage rate and the number of hours since the last calculation. To estimate your backup bill for a time period, multiply the billable backup storage size for every hour of the period by the backup storage rate, and add up all hourly amounts. To query relevant Azure Monitor metrics for multiple hourly intervals programmatically, use Azure Monitor REST API. Backup billing in the serverless compute tier is the same as in the provisioned compute tier.

Question 55

How will my workload influence my backup storage costs?

Accepted Answer

Backup costs will be higher for workloads that add, modify, or delete large volumes of data in the database. Conversely, workloads that are mostly read-only might have smaller backup costs.

Question 56

How can I minimize backup storage costs?

Accepted Answer

Details on how to minimize the backup storage costs are captured in Automated Backups.

Question 57

Can I geo-restore my Hyperscale database to another service tier, or vice-versa?

Accepted Answer

Currently, non-Hyperscale service tiers (Standard/Premium/General Purpose/Business Critical) backups cannot be geo-restored into a Hyperscale service tier and vice-versa. To convert a non-Hyperscale database to a Hyperscale database, change the service tier after a restore.

Question 58

How much write throughput can I push in a Hyperscale database?

Accepted Answer

Transaction log throughput cap is set to 100 MB/s for any Hyperscale compute size. The ability to achieve this rate depends on multiple factors, including but not limited to workload type, client configuration and performance, and having sufficient compute capacity on the primary compute replica to produce log records at this rate. Log generation rate of 150 MB/s is available as an opt-in preview feature. For more information and to opt-in to 150 MB/s, see Blog: November 2024 Hyperscale enhancements.

Question 59

How many IOPS do I get on the largest compute?

Accepted Answer

IOPS and IO latency will vary depending on the workload patterns. If the data being accessed is cached in RBPEX on the compute replica, you will see similar IO performance as in Business Critical or Premium service tiers.

Question 60

Does my throughput get affected by backups?

Accepted Answer

No. Compute is decoupled from the storage layer. This eliminates the performance impact of backup.

Question 61

Does my throughput get affected as I provision additional compute replicas?

Accepted Answer

Because the storage is shared and there is no direct physical replication happening between primary and secondary compute replicas, the throughput on the primary replica will not be directly affected by adding secondary replicas. However, continuous and aggressive write workloads might be throttled on the primary to allow log apply on secondary replicas and page servers to catch up. This avoids poor read performance on secondary replicas and long recovery after failover to an HA secondary replica.

Question 62

Is Hyperscale well suited for resource-intensive, long-running queries, and transactions?

Accepted Answer

Yes. However, just like in other Azure SQL DB databases, connections might be terminated by very infrequent transient errors, which can abort long-running queries and roll back transactions. One cause of transient errors is when the system quickly shifts the database to a different compute node to ensure continued compute and storage resource availability, or to perform planned maintenance. Most of these reconfiguration events finish in less than 10 seconds. Applications that connect to your database should be built to expect and tolerate these infrequent transient errors by implementing retry logic. Additionally, consider configuring a maintenance window that matches your workload schedule to avoid transient errors due to planned maintenance.

Question 63

How do I diagnose and troubleshoot performance problems in a Hyperscale database?

Accepted Answer

For most performance problems, particularly those not rooted in storage performance, common SQL diagnostic and troubleshooting steps apply. For Hyperscale-specific storage diagnostics, see SQL Hyperscale performance troubleshooting diagnostics.

Question 64

How does the maximum memory limit in serverless compare to provisioned compute?

Accepted Answer

The maximum amount of memory that a serverless database can scale up is 3 GB/vCore times the maximum number of vCores configured as compared to more than 5 GB/vCore times the same number of vCores in provisioned compute. Review serverless Hyperscale resource limits for details.

Jaa

Azure SQL Database Hyperscale FAQ

General questions