Create a GPU Accelerated Virtual Machine

Completed

The Microsoft Azure cloud offers many Virtual Machine (VM) sizes with various features. GPU optimized VM sizes are specialized virtual machines available with single, multiple, or fractional GPUs. These sizes are designed for compute-intensive, graphics-intensive, and visualization workloads. In this section, we deploy a GPU Accelerated Virtual Machine to host an installation of the NVIDIA Triton Inference Server.

Create a GPU accelerated virtual machine

Note

Depending on your Azure Subscription configuration, you may not have the capability to deploy a GPU accelerated Virtual Machine. If that's the case, you may still proceed with this module by deploying any CPU based Virtual Machine.

  1. Sign in to the Azure portal and select Create a resource from the menu drop-down on the left.

    The create a resource item is shown selected.

  2. In the resulting screen, look for the virtual machine section, then select create.

    The create a virtual machine item is shown selected.

  3. In the Basics section:

    1. Create a new resource group.

    2. Supply a Virtual machine name that is globally unique.

    3. Choose an appropriate Region.

    4. Leave Availability options at the default setting. No infrastructure redundancy required.

    5. Leave Security type at the default setting, Standard.

    6. For Image select Configure VM generation, then select Generation 1, then Apply. The virtual machine generation options are shown.

      1. Next, select Ubuntu Server 18.04 LTS - Gen1 in the Image section.
    7. Leave Azure Spot instance unchecked.

    8. For Size, if you wish to use a CPU instance then leave this option as-is, to select a GPU accelerated offering select the See all sizes option.

      1. To view available GPU offerings in your subscription, select Add filter then select Family, then select only the GPU option and the list updates. The GPU filter is shown selected.

      2. Select an appropriate N-Series option under Non-premium storage VM sizes. We suggest NC6. Then choose Select. The NC6 virtual machine option is shown selected.

    9. In the Administrator Account section, select Password.

    10. At this point, the selected options should look like the following image.

      The basic options up to this point are shown.

    11. Supply a Username and Password for the Administrator Account.

    12. In the Inbound port rules section, select Allow selected ports, then verify that Select inbound ports shows SSH (22) is enabled.

  4. The remaining options should look like the following image. When you verify that your options look correct, select Review + create.

    The additional basic options up to this point are shown.

    1. The resulting screen should show a Validation passed message, and displays a summary of your virtual machine configuration. If everything looks good, select Create to begin the virtual machine deployment.

      The virtual machine summary and create button are shown.

  5. The deployment creates several resources: the virtual machine, a network security group, and a public ip address. When provisioning is complete, select the Go to resource button or navigate to your virtual machine using the Azure portal.

    The completed deployment options are shown.

  6. On the Overview screen for your virtual machine, copy the Public IP address, and save this value to your text editor of choice. We use this value to access the virtual machine via a terminal emulator such as TeraTerm or Windows Terminal.

    The virtual machine overview screen is shown with the public ip address highlighted.