Measuring Disk Latency with Windows Performance Monitor (Perfmon)

My name is Flavio Muratore and I am a Senior Support Escalation Engineer with the Windows Core Team. One subject we haven’t written much about in the Core team blog is “disk performance”.

Today I would like to talk a little bit about measuring Physical Disk IO Latency with Windows Performance Monitor (perfmon). Most likely you have some experience with Perfmon, it’s been around since the NT days.
You have probably heard general statements about what are acceptable disk latency measurements: “Less than 10 milliseconds is good and more than 20 milliseconds is bad”. Although these rules of thumb are used to simplify analysis, they do not apply in all cases and may lead to incorrect conclusions. Let’s check how this really works so we can understand these numbers.

Summary: The IO latency measured in perfmon includes all the time spent in the hardware layers as well as the time spent in the Microsoft Port Driver queue (Storport.sys for SCSI). If the running processes generate a large storport queue, the measured latency increases, as IO has to wait before getting dispatched to the hardware layers.

What is disk IO latency?
We can define disk IO latency as: A measure of the time delay from the time a disk IO request is created, until the time the disk IO request is completed.

What counters in Windows Performance Monitor show the physical disk latency?
“Physical disk performance object -> Avg. Disk sec/Read counter” - Shows the average read latency.
“Physical disk performance object -> Avg. Disk sec/Write counter” - Shows the average write latency.
“Physical disk performance object -> Avg. Disk sec/Transfer counter” - Shows the combined averages for both read and writes.
The “_Total” instance is an average of the latencies for all physical disks in the computer.
Each other instance represents an individual Physical Disk.

Note: Do no confuse with Avg. Disk Transfers/sec, which is a completely different counter.

Where does the performance data comes from?
For the “physical disk performance object”, the data is captured at the “Partition Manager” level in the storage stack.
Keep in mind Perfmon does not create any performance data per se; it only consumes data provided by other subsystems within Windows.

Where is the partition Manager in the Storage Stack?

A simplified explanation on the Windows Storage Stack follows.
When an application creates an IO request, it sends it to the Windows IO Subsystem (at the top of the stack). The IO will then make its way all the way down the stack (to the Hardware Disk Subsystem) and then come all the way back up. During this process, each layer will perform its function and then hand over the IO to the next layer.

image

So what are we really measuring with the Physical disk performance object -> Avg. Disk sec/Transfer (or /Read, or /Write) counter?
We are measuring all the time spent below the partition manager level.
When the IO request is sent by the Partition Manager down the stack we time stamp it, when it arrives back we time stamp it again and calculate the time difference. The time difference is the latency.

This means we are accounting for the time spent in the following components:

  1. Class Driver – manages the device type, such as disks, tapes, etc.
  2. Port Driver – manages the transport protocol, such as SCSI, FC, SATA, etc.
  3. Device Miniport Driver – This is the device driver for the Storage Adapter. It is supplied by the vendor of the device (Raid Controller, and FC HBA).
  4. Disk Subsystem – This includes everything below the Device Miniport Driver – This could be as simple as a cable connected to a single physical hard disk, or as complex as a Storage Area Network.

How disk queuing affects the measured latency in Perfmon?
There is only a limited number of IO a disk subsystem can accept at a given time. The excess IO gets queued until the disk can accept IO again. The time IO spends in the queues below the Partition Manager is accounted in the Perfmon physical disk latency measurements. As queues grow larger and IO has to wait longer, the measured latency also grows.

There are a multiple queues below the Partition Manager level:

  • Microsoft Port Driver Queue -SCSIport or Storport queue.
  • Vendor Supplied Device Driver Queue- OEM Device driver.
  • Hardware Queues - such as disk controller queue, SAN switches queue, array controller queue, hard disk queue.
  • Although not a queue, we also account for the time the hard disk spends actively servicing the IO and the travel time all the way back to the partition manager level to be marked as completed.

Finally, special attention to the Port Driver Queue (for SCSI Storport.sys).

The Port Driver is the last Microsoft component to touch an IO before we hand it off to the vendor supplied Device Miniport Driver.
If the Device Miniport Driver can’t accept any more IO because its queue and/or the hardware queues below are saturated, we will start accumulating IO on the Port Driver Queue. The size of the Microsoft Port Driver queue is limited only by the available system memory (RAM) and can grow very large, causing large measured latency.

In Conclusion:
The time the IO spent in queue is added to the disk latency in perfmon.

To keep the queue under control you have to tune your applications to limit the maximum number of outstanding I/O operations they generate. That’s a subject for another blog post.

Reference:
For SCSI Disks (FC/RAID) you can enable Storport tracing to measure the latency below the Port Driver level. This does not account for the time spent in the storport queue or anything above. Essentially this is the lowest level we can possibly monitor the latency inside Windows before the IO is handed over to third party components. Check this excellent blog from NTdebug team for details.
“Storport ETW Logging to Measure Requests Made to a Disk Unit”
https://blogs.msdn.com/b/ntdebugging/archive/2010/04/22/etw-storport.aspx

 

Flavio Muratore
Senior Support Escalation Engineer
Microsoft Enterprise Platforms Support

Comments

  • Anonymous
    January 01, 2003
    @Ah_Chao,The unit is seconds but have millisecond precision.Counter description:Avg. Disk sec/Transfer (Avg. Disk sec/Read, Avg. Disk sec/Write)Displays the average time the disk transfers took to complete, in seconds. Although the scale is seconds, the counter has millisecond precision, meaning a value of 0.004 indicates the average time for disk transfers to complete was 4 milliseconds.This is the counter in Perfmon used to measure IO latency.For other counters see: Windows Performance Monitor Disk Counters Explained - blogs.technet.com/.../windows-performance-monitor-disk-counters-explained.aspx
  • Anonymous
    January 01, 2003
    Very nice article. In this article you mentioned about [The size of the Microsoft Port Driver queue is limited only by the available system memory (RAM) and can grow very large, causing large measured latency. ] Can you please elaborate more on that. From SQL Server standpoint, Log Block is size of 64KB and cannot go or grow beyond that as per design. So I want to know how RAM limitation comes into play in regards to your article and statement.
  • Anonymous
    January 01, 2003
    Hello Guys.This blog post does apply both to Windows VMs and for the for the Hyper-V host as well.The special consideration for virtual machines has to do with how VMs calculate time. Because the VMs do not have a hardware real time clock, their time may drift slightly and affect the numbers in the counters. Although this is true, perfmon deals with averages and the results should be useful for a disk performance analysis.The concept described here will be complemented by a blog I am finishing up on how disk queue works. Stay tuned!Thank you all for the comments.Flavio
  • Anonymous
    January 01, 2003
    It would be nice if we could limit disk i/o per process and/or per virtual machine like in vmware..
  • Anonymous
    February 07, 2012
    Interesting post. Is this something that works with Cluster Shared Volumes? For both owner and non-owner node?
  • Anonymous
    February 07, 2012
    Gran bel post! (Great post!)
  • Anonymous
    February 13, 2012
    Nice post!It was short and well written, haven't seen this simple explanation of these perfmon counters. Would love to see similar short post on other important performance counters. The picture added great value, Great work, txs.
  • Anonymous
    February 16, 2012
    The comment has been removed
  • Anonymous
    February 16, 2012
    @Chang,read vpivot.com/.../inaccuracy-of-in-guest-performance-countersSome counters are fine, but time based ones need to be treated with Suspicion.  I was sure there was a document floating around which listed which ones where safe.
  • Anonymous
    April 17, 2012
    what is the latency unit of the counter listed in this article? ms??
  • Anonymous
    May 16, 2012
    Excellent, just what I was looking for thank you.
  • Anonymous
    January 27, 2013
    Good Stuff, Flavio!!  Glad you got learnt something in ELS and sharing it out! :)
  • Anonymous
    April 13, 2013
    The comment has been removed
  • Anonymous
    November 25, 2013
    Nice explanation, short & conclusive.
  • Anonymous
    January 07, 2014
    Pingback from Solving disk latency: Disk or memory based caching? ??? My Virtual Vision
  • Anonymous
    January 07, 2014
    Pingback from Solving disk latency: Disk or memory based caching? ??? My Virtual Vision
  • Anonymous
    January 07, 2014
    Pingback from Solving disk latency: Disk or memory based caching? ??? My Virtual Vision
  • Anonymous
    January 07, 2014
    Pingback from Solving disk latency: Disk or memory based caching? ??? My Virtual Vision
  • Anonymous
    June 29, 2014
    is it OK when maximum value of Avg. Disk sec/Transfer gets over 1s during standard Windows Performamce 1 minute test ??
  • Anonymous
    July 04, 2014
    I am a student and found this interesting. I tried to read the stats but unable to find them on my machine when perfmon is opened. Can you guide me?
  • Anonymous
    November 10, 2014
    Nice explanation about physical disk latency
    I would like to get the total time spend for IO request. In this blog it's clearly explained about time spent below the partition manager level and how we can calculate the total time. In conclusion of blog it was mentioned as "The time the IO spent in queue is added to the disk latency in perfmon" which means it's considering the time above partition manager level. Can we use Avg. Disk sec/Transfer counter which can include all the time spend for IO request?
  • Anonymous
    January 28, 2015
    Hi Flavio,

    You mentioned a few queues (Microsoft, vendor, Hardware queues....). Which queue is related to [Avg. Dsik Queue Length] counter? Microsoft?
  • Anonymous
    February 09, 2015
    may i know what is the amount can be considered has I/O latency then?can i assume we still have to apply the rules any I/O less than 10 miliseconds is considered fine?
  • Anonymous
    December 17, 2015
    Second, MS, Micro Seconds? This is so ridiculous I have to ask.
  • Anonymous
    April 06, 2016
    Thank you Jeff for this ultra clear explanation. The article is really well written and makes you understand how to go about understanding the metrics measured by Perfmon.
  • Anonymous
    September 01, 2016
    Have a query pertaining to Avg. Disk sec/Read counterWhy is it an average.Let us say I am measuring it at 1 second preiodicity, isnt the value absolute. If at all this is called average, is there a max and min as well for the same?Regards,Venkat K
  • Anonymous
    December 01, 2016
    hi,please tell me if i am getting unknown error for service "Average Logical Disk Seconds Per Transfer". what can i do to correct it. and the error is only for 2003 R2 servers. i do monitoring for nagios. please tell me ways to correct in it. is there anything i can change in script or in NSClient ++ folder.thanksneha