Page 1 of 1

3PAR 7400 replacement node - Emulex HBA issue

Posted: Wed Feb 15, 2023 7:31 am
by admkumar
Hello guys,

We are trying to replace a dead 7400 node with used spare one but get stuck with error
> FC HBA Emulex e200 at 3.0.0 (PCI slot 2) [No Rev Info]
> *** Fatal error: Code 15, sub-code 0x0 (2). hw_node_hba: Slot 2
> Disabled FSBC boot watchdog.

This Emulex adapter card has been moved from dead controller to spare.
Array is running 3.2.2
Info from live node report for PCI cards:
--------------------PCI Cards--------------------
Node Slot Type -Manufacturer- -Model-- --Serial--
0 0 SAS LSI 9205-8e Onboard
0 1 FC EMULEX LPe12002 Onboard
0 2 FC EMULEX LPE16002 5CE53205RW
0 3 Eth Intel e1000e Onboard
Revision 30, Firmware version 10.6.248.8

Can you please advise how this can be solved to boot up new node and join cluster?

Thanks

Re: 3PAR 7400 replacement node - Emulex HBA issue

Posted: Wed Feb 15, 2023 2:34 pm
by MammaGutt
admkumar wrote:Hello guys,

We are trying to replace a dead 7400 node with used spare one but get stuck with error
> FC HBA Emulex e200 at 3.0.0 (PCI slot 2) [No Rev Info]
> *** Fatal error: Code 15, sub-code 0x0 (2). hw_node_hba: Slot 2
> Disabled FSBC boot watchdog.

This Emulex adapter card has been moved from dead controller to spare.
Array is running 3.2.2
Info from live node report for PCI cards:
--------------------PCI Cards--------------------
Node Slot Type -Manufacturer- -Model-- --Serial--
0 0 SAS LSI 9205-8e Onboard
0 1 FC EMULEX LPe12002 Onboard
0 2 FC EMULEX LPE16002 5CE53205RW
0 3 Eth Intel e1000e Onboard
Revision 30, Firmware version 10.6.248.8

Can you please advise how this can be solved to boot up new node and join cluster?

Thanks


Did you try removing the HBA in slot 2? Could that be dead?

Re: 3PAR 7400 replacement node - Emulex HBA issue

Posted: Thu Feb 16, 2023 11:21 am
by admkumar
Hi MammaGutt,

Yes, spare node boots without this HBA, but do not join cluster as is with different s/n and same nodeID=0.
Live node is 1618669-0 with 3.2.2 and spare is 1631924-0 with 3.1.3
I am not 100% sure that HBA is live -- but it was live a couple days ago when seated in old controller. And there's no replacement adapter to check.
Can that be related to some sort of old FW/microcode on controller?
> Vendor Emulex device 0xe200 in slot 2 not yet qualified.
> FC HBA Emulex e200 at 3.0.0 (PCI slot 2) [No Rev Info]
> *** Fatal error: Code 15, sub-code 0x0 (2). hw_node_hba: Slot 2

Re: 3PAR 7400 replacement node - Emulex HBA issue

Posted: Thu Feb 16, 2023 2:16 pm
by MammaGutt
admkumar wrote:Hi MammaGutt,

Yes, spare node boots without this HBA, but do not join cluster as is with different s/n and same nodeID=0.
Live node is 1618669-0 with 3.2.2 and spare is 1631924-0 with 3.1.3
I am not 100% sure that HBA is live -- but it was live a couple days ago when seated in old controller. And there's no replacement adapter to check.
Can that be related to some sort of old FW/microcode on controller?
> Vendor Emulex device 0xe200 in slot 2 not yet qualified.
> FC HBA Emulex e200 at 3.0.0 (PCI slot 2) [No Rev Info]
> *** Fatal error: Code 15, sub-code 0x0 (2). hw_node_hba: Slot 2



So you need to fix the other problem first....

Did the old node die, or did the old node disk die?

If the old node die, move the disk and see if that helps. You might need to do more ( https://www.3parug.com/viewtopic.php?f=18&t=3173 )

If the old node disk died, take the node disk from your spare and insert into the old node and do a node rescue.

Also, make sure that the new controller is exact same type and the failed one.

Re: 3PAR 7400 replacement node - Emulex HBA issue

Posted: Fri Feb 17, 2023 5:27 am
by admkumar
So yes, finally that was related to some old FW/BIOS microcodes.
Booted with old disk without HBA, updated nodeid and s/n, rebooted and joined cluster.
Pulled out new node, added back HBA, pulled back and successfully booted and joined cluster.