O350 Voltage Fault:ATTN: 1.8V low fault limit reached @ 1.396V.
#1
O350 Voltage Fault:ATTN: 1.8V low fault limit reached @ 1.396V.
Hi SGI'ers,

I de-racked my O350 machines and swapped out all the Snaphat Oscillator/Battery packs over the weekend.

Of the 5 Snaphat 3 where dead and 2 had very low voltage so they have all expired in sync.

Having gone to the trouble of pulling out the IO9 boards to swap the batteries (see my blog for pics), I did some shifting of PCI board so all the PCIX boards (Neterion 10GbE, LSI SAS3442X-R SAS/SATA & LSI 4Gbit/sec Fibre Channel) were in the top two PCI slots only to get PCIX benefits).

As a result I pulled an Adapter Firewire 4300 (DM10) out of one machine and put into into another with free slots.

On power up the O350 with additional DM10 board is now reporting a low voltage failure:

>> MXXXXXXX6-001-L2>power up
>> 001c04
>> 001c04 ATTN: 1.8V low fault limit reached @  1.396V.
>> 001c04
>> 001c04 ATTN: brick auto power down in 30 seconds
>> 001c04
>> 001c04 ATTN: brick auto power down in 25 seconds
>> MXXXXXXX6-001-L2>env001c04
>> 001c04 ATTN: brick auto power down in 20 seconds


A check of the environment (via L2) shows the problem on the 1.8v line:

>> MXXXXXXX6-001-L2>env
>> 001c04 ATTN: brick auto power down in 20 seconds
>>
>> 001c04:
>> Environmental monitoring is enabled and running.
>>
>> Description    State      Warning Limits    Fault Limits      Current
>> -------------- ----------  -----------------  -----------------  -------
>>          1.8V      Fault  10%  1.62/  1.98  20%  1.44/  2.16    1.382.      <<===== This one
>>            12V    Enabled  10%  10.80/ 13.20  20%  9.60/ 14.40  12.125
>>        12V #2    Enabled  10%  10.80/ 13.20  20%  9.60/ 14.40  12.063
>>          3.3V    Enabled  10%  2.97/  3.63  20%  2.64/  3.96    3.337
>>        12V IO    Enabled  10%  10.80/ 13.20  20%  9.60/ 14.40  12.125
>>        5V AUX    Enabled  10%  4.50/  5.50  20%  4.00/  6.00    5.070
>>      3.3V AUX    Enabled  10%  2.97/  3.63  20%  2.64/  3.96    3.302
>>    PCI 5V AUX    Enabled  10%  4.50/  5.50  20%  4.00/  6.00    5.096
>>      PCI 3.3V    Enabled  10%  2.97/  3.63  20%  2.64/  3.96    3.337
>>      PCI 2.5V    Enabled  10%  2.25/  2.75  20%  2.00/  3.00    2.509
>>        PCI 5V    Enabled  10%  4.50/  5.50  20%  4.00/  6.00    4.966
>>  XIO 12V BIAS <not present>
>>        XIO 5V <not present>
>>      XIO 2.5V <not present>
>>  XIO 3.3V AUX <not present>
>>  IP59 3.3V AUX    Enabled  10%  2.97/  3.63  20%  2.64/  3.96    3.302
>>    IP59 5V AUX    Enabled  10%  4.50/  5.50  20%  4.00/  6.00    5.070
>>      IP59 12V    Enabled  10%  10.80/ 13.20  20%  9.60/ 14.40  12.063
>>      IP59 VCPU    Enabled  10%  1.14/  1.40  20%  1.02/  1.52    1.297
>>      IP59 SRAM    Enabled  10%  2.25/  2.75  20%  2.00/  3.00    2.483
>>      IP59 1.5V    Enabled  10%  1.35/  1.65  20%  1.20/  1.80    1.495
>>

As this machine has dual Power Supplies I am wondering if the problem is with the Voltage regulator module (VRM) on IP59_4CPU processor board.

Has anyone had any experience with faulty VRM ?

Does this require replacement of VRM or can these be fixed by replacing capacitors or such ?

Thank you for any tips.

Cheers from Oz,


John.
(This post was last modified: 11-16-2020, 05:27 AM by jwhat.)
jwhat
Octane/O350/Fuel User

Trade Count: (0)
Posts: 513
Threads: 29
Joined: Jul 2018
Location: Australia
Find Reply
11-16-2020, 05:18 AM


Messages In This Thread

Forum Jump:


Users browsing this thread: 1 Guest(s)