Warning: Not all InfiniBand HCAs have a PSID
If you read the documents posted by Mellanox about their new 2..0.5 build 4453 InfiniBand drivers you may have noticed the advice to update your firmware. If so, you will need to discover your PSID. This should be pretty straight forward, just install the drivers and then using the HPC Management console Run a Command feature run vstat on the node you wish to update. If you are lucky, you’ll see something like this:
NODE-08 -> Finished
-------------------------------------------------------------------------------------------------
hca_idx=0
uplink={BUS=PCI_E, SPEED=2.5 Gbps, WIDTH=x8, CAPS=2.5*x8}
vendor_id=0x08f1
vendor_part_id=0x6278
hw_ver=0xa0
fw_ver=4.08.0200
PSID=VLT0040010001
node_guid=0008:f104:0399:2054
num_phys_ports=2
port=1
port_state=PORT_ACTIVE (4)
link_speed=5.0 Gbps (2)
link_width=4x (2)
rate=20 Gbps
port_phys_state=LINK_UP (5)
active_speed=5.0 Gbps (2)
sm_lid=0x0001
port_lid=0x0009
port_lmc=0x0
max_mtu=2048 (4)
port=2
port_state=PORT_DOWN (1)
link_speed=NA
link_width=NA
rate=NA
port_phys_state=POLLING (2)
active_speed=2.5 Gbps (1)
sm_lid=0x0000
port_lid=0x0000
port_lmc=0x0
max_mtu=2048 (4)
If, like me, you are unlucky, you will not have a PSID line in the output. Like this:
NODE-07 -> Finished
--------------------------------------------------------------------------------------
hca_idx=0
uplink={BUS=PCI_E, SPEED=2.5 Gbps, WIDTH=x8, CAPS=2.5*x8}
vendor_id=0x066a
vendor_part_id=0x6274
hw_ver=0xa0
fw_ver=0x100020000
node_guid=0006:6a00:9800:f356
num_phys_ports=1
port=1
port_state=PORT_ACTIVE (4)
link_speed=5.0 Gbps (2)
link_width=4x (2)
rate=20 Gbps
port_phys_state=LINK_UP (5)
active_speed=5.0 Gbps (2)
sm_lid=0x0001
port_lid=0x000a
port_lmc=0x0
max_mtu=2048 (4)
If this happens to you, don’t waste time trying to find the PSID of your HCA. As soon as I find a way to pick the right firmware upgrade for my own HCAs without PSIDs, I’ll post how to do it. Until then, you and I must run on whatever firmware we already have.
Sorry,
Frankie