Onyx IR issues -
mosiniak - 06-10-2020
Hi,
I have some issues with my IR graphics. There is for sure at least one problem (there is no output signal). I ran irsaudit and got errors, but I'm not sure where the problem lies and what it is connected with.
First of all I ran the gfxinfo and irsaudit without any options and I got strange output from gfxinfo and error on irsaudit with DG. Then I ran irsaudit with -ge -rm and -dg options and each time I got error. During irsaudit -rm the was and error and Onyx was restarted. What do You think, are all these errors connected together and there is a single cause - e.g. DG board?
I attached all the results from irsaudit, gfxinfo and hinv.
Code:
/usr/gfx/gfxinfo
Graphics board 0 is "KONAS" graphics.
Unmanaged 104x38555
Display has 2623476 channels
88 GEs (of 803405443), occmask = 0x7ec
GEs BEF ram, 16bit path
205583996 RM7 boards (of 2024) 68/7ec/2ce29759/68
Texture Memory: ///
boards pixel depth
0K cmap
GPI interface device detected
Xvc info not available for unmanaged boards
Code:
hinv
2 196 MHZ IP25 Processors
CPU: MIPS R10000 Processor Chip Revision: 2.5
FPU: MIPS R10010 Floating Point Chip Revision: 2.5
Main memory size: 512 Mbytes, 2-way interleaved
Instruction cache size: 32 Kbytes
Data cache size: 32 Kbytes
Secondary unified instruction/data cache size: 1 Mbyte
Integral SCSI controller 0: Version WD33C95A, single ended, revision 0
Integral SCSI controller 1: Version WD33C95A, differential, revision 0
Disk drive: unit 1 on SCSI controller 1
Integral SCSI controller 90: Version WD33C95A, differential, revision 0
Integral SCSI controller 91: Version WD33C95A, differential, revision 0
Integral EPC serial ports: 8
Integral EPC parallel port: Ebus slot 11
Integral EPC parallel port: Ebus slot 9
Graphics board: InfiniteReality
Integral Ethernet controller: et0, Ebus slot 11
I/O board, Ebus slot 11: IO4 revision 1
I/O board, Ebus slot 9: IO4 revision 1
VME bus: adapter 45
VME bus: adapter 0 mapped to adapter 45
EPC external interrupts
RE: Onyx IR issues -
jan-jaap - 06-10-2020
You have some truly insane numbers in your 'gfxinfo' output. 88 GE's, 205583996 RM's, 2623476 display channels ...
IR stores some config data in an eeprom on the GE board. I'd check out it's sanity first with ireeprom:
https://wiki.preterhuman.net/InfiniteReality:_The_Graphics_Engine_EEPROM_Commands
Also, IR requires the very last revision VCAM on the IO4. It must be part # 030-0500-206
RE: Onyx IR issues -
mosiniak - 06-10-2020
(06-10-2020, 02:54 PM)jan-jaap Wrote: You have some truly insane numbers in your 'gfxinfo' output. 88 GE's, 205583996 RM's, 2623476 display channels ...
No one has such a powerfull system
(06-10-2020, 02:54 PM)jan-jaap Wrote: IR stores some config data in an eeprom on the GE board. I'd check out it's sanity first with ireeprom: https://wiki.preterhuman.net/InfiniteReality:_The_Graphics_Engine_EEPROM_Commands
I started ireeprom -i and got a sad result (see below). On the page you mentioned, I can read: "The graphics engine (GE) in Silicon Graphics SGI Onyx2 and SGI Onyx3 Infinite Reality (IR) stores the microcode in EEPROM". Do you know if this also applies to Onyx1?
Code:
./ireeprom -i
==== Pipe 0 ====
graphics not responding... trying again
graphics not responding... trying again
graphics not responding...loading ucode
ucode load complete...
Cannot communicate with graphics board
(06-10-2020, 02:54 PM)jan-jaap Wrote: Also, IR requires the very last revision VCAM on the IO4. It must be part # 030-0500-206
My IO4 is 030-0815-00x and VCAM is 030-0500-206 so this is not the reason.
RE: Onyx IR issues -
ghost180sx - 06-14-2020
I expect you have tried removing the DG board and re-seating it? Are there any cables or connector boards that need to be checked?
Was this system working the last time you started it and this happened while it was sitting?
RE: Onyx IR issues -
mosiniak - 06-15-2020
(06-14-2020, 05:21 AM)ghost180sx Wrote: I expect you have tried removing the DG board and re-seating it? Are there any cables or connector boards that need to be checked?
Yes, I removed all the boards and very carefully inspected them, their connectors and connectors on the backplane (I was looking for bent pins). Everything was OK. I took photos and I will upload them soon. I checked also the triangle bus connector and it looks OK.
(06-14-2020, 05:21 AM)ghost180sx Wrote: Was this system working the last time you started it and this happened while it was sitting?
No, I bought this system in December 2016. First, I kept it warm (at room temperature) for 6 months (to get rid of the stale smell), and then disassembled and cleaned the whole system (there were a lot of dead insects and spiders inside). Then after turning on the power the system worked (it was serial console output), but I could not get video output (there was a problem with the monitor). After some time, I bought a SOG-accepting monitor, but there was also no video output. There was also a hard disk failure and later I bought a new IBM HVD disk. In May 2020, I installed IRIX and ran irsaudit to see if something is wrong.
irsaudit can detect hardware:
Code:
TEST sysprobe JTAG System Probe
TRCE GE12 board has 4 GE11s.
INFO GE board is a GE12.
TRCE BEF has 4MB of memory.
TRCE GPI cable for Sirius video option not present (ok).
TRCE RM #0 is fully populated RM6.
TRCE RM #0 has 64MB (1Mx16 parts) of texture memory.
TRCE RM #1 is fully populated RM6.
TRCE RM #1 has 64MB (1Mx16 parts) of texture memory.
TRCE RM #2 not detected via JTAG.
TRCE RM #3 not detected via JTAG.
TRCE DG has 8 VOC configuration.
TRCE No DG option board claimed by board ID.
TRCE DG has 32K colormap RAMs.
TRCE DG has 64K external gamma/deep colormap RAM.
TRCE PAB2 or DVP option not detected.
RSLT sysprobe PASS
TEST nicprobe NIC Board Probe
TRCE Reading ARM ucode from: /usr/diags/IR/ucode/arm1_nic.u
TRCE GE NIC #: 0000.0018.8473 (family: 0b)
TRCE Serial #: GGJ232
TRCE Part #: 030-0681-003
TRCE rev_code: D
TRCE name: GE12-4
NIC# PNGE-030-0681-003 SN-GGJ232 (0000.0018.8473)
TRCE RM0 NIC #: 0000.000a.4cf8 (family: 0b)
TRCE Serial #: CLX422
TRCE Part #: 030-0684-004
TRCE rev_code: F
TRCE name: RM6-64
NIC# PNRM-030-0684-004 SN-CLX422 (0000.000a.4cf8)
TRCE RM1 NIC #: 0000.000e.a707 (family: 0b)
TRCE Serial #: FDS273
TRCE Part #: 030-0684-004
TRCE rev_code: F
TRCE name: RM6-64
NIC# PNRM-030-0684-004 SN-FDS273 (0000.000e.a707)
TRCE DG NIC #: 0000.0014.17df (family: 0b)
TRCE Serial #: EAA991
TRCE Part #: 030-0687-004
TRCE rev_code: E
TRCE name: DG4-8
NIC# PNDG-030-0687-004 SN-EAA991 (0000.0014.17df)
RSLT nicprobe PASS
but gfxinfo can't:
Code:
Graphics board 0 is "KONAS" graphics.
Unmanaged 0x0
Display has 0 channel
0 GE (of 0), occmask = 0x00
Internal BEF ram, 32bit path
0 RM6 board (of 0) 0/0/0/0
Texture Memory: -/-/-/-
Small pixel depth
32K cmap
Xvc info not available for unmanaged boards
Any ideas, what I could be doing wrong?
RE: Onyx IR issues -
jan-jaap - 06-16-2020
It's possible that you need to run ireeprom with graphics running (/usr/gfx/startgfx). Can't try right now -- I'm using the screen elsewhere...
irsaudit definitely needs to be run with exclusive access to the graphics hardware (/usr/gfx/stopgfx).
Both ireeprom and irsaudit work with all IR revisions, not only IR2 / IR3. They have been in IRIX since IRIX 6.2.
RE: Onyx IR issues -
mosiniak - 06-16-2020
(06-16-2020, 07:29 AM)jan-jaap Wrote: It's possible that you need to run ireeprom with graphics running (/usr/gfx/startgfx). Can't try right now -- I'm using the screen elsewhere...
I will check it. I didn't try running ireeprom with graphics running.
(06-16-2020, 07:29 AM)jan-jaap Wrote: irsaudit definitely needs to be run with exclusive access to the graphics hardware (/usr/gfx/stopgfx).
I already noticed that.
(06-16-2020, 07:29 AM)jan-jaap Wrote: Both ireeprom and irsaudit work with all IR revisions, not only IR2 / IR3. They have been in IRIX since IRIX 6.2.
Thanks for confirming that.
RE: Onyx IR issues -
mosiniak - 06-18-2020
(06-16-2020, 07:29 AM)jan-jaap Wrote: It's possible that you need to run ireeprom with graphics running...
You were right. With graphics running the ireeprom -i output is better, but there is problem still.
Code:
/usr/gfx/stopgfx
/usr/gfx/KONA/bin/ireeprom -i
==== Pipe 0 ====
graphics not responding... trying again
graphics not responding... trying again
graphics not responding...loading ucode
ucode load complete...
Cannot communicate with graphics board
/usr/gfx/startgfx
Sent SIGHUP to xdm
IRIS 8# IR0: ARM: Welcome to ARMLand - 0/0x0d00
IR0: ARM: running...(sherwood-root 0310062200)
IR0: ARM: ******************************************************
IR0: ARM: * InfiniteReality/Reality Software, IRIX 6.5 release *
IR0: ARM: ******************************************************
/usr/gfx/KONA/bin/ireeprom -i
==== Pipe 0 ====
diag-map: Resource busy
WARNING: IR0: mopup timeout.
WARNING: IR0: Fatal error. bdata = 0xa800000000970000 death_reason = 17
Graphics FIFO depth is 26 words.
I have idea to move IO4+VCAM from rack to deskside machine with working RE2 graphics to see if IO4+VCAM are OK. Does it make any sense?
I also have another crazy idea - I have onyx deskside system with RE2 graphics and KONA ready backplane. I can move IR set to this machine (IO4+VCAM+GE+RM+DG), but there is a problem with power modules. IR graphics requires 3.3V supply (RE2 requires 5V). In deskside systems there is place for narrow (303) power modules, but in rack system I have wide (303x2) modules. What do You think is it possible to insert wide module into deskside system and left the cover open for a short time for tests?
What else could I check to find out what is going on?
RE: Onyx IR issues -
CB_HK - 06-18-2020
(06-18-2020, 10:26 AM)mosiniak Wrote: (06-16-2020, 07:29 AM)jan-jaap Wrote: It's possible that you need to run ireeprom with graphics running...
You were right. With graphics running the ireeprom -i output is better, but there is problem still.
Code:
/usr/gfx/stopgfx
/usr/gfx/KONA/bin/ireeprom -i
==== Pipe 0 ====
graphics not responding... trying again
graphics not responding... trying again
graphics not responding...loading ucode
ucode load complete...
Cannot communicate with graphics board
/usr/gfx/startgfx
Sent SIGHUP to xdm
IRIS 8# IR0: ARM: Welcome to ARMLand - 0/0x0d00
IR0: ARM: running...(sherwood-root 0310062200)
IR0: ARM: ******************************************************
IR0: ARM: * InfiniteReality/Reality Software, IRIX 6.5 release *
IR0: ARM: ******************************************************
/usr/gfx/KONA/bin/ireeprom -i
==== Pipe 0 ====
diag-map: Resource busy
WARNING: IR0: mopup timeout.
WARNING: IR0: Fatal error. bdata = 0xa800000000970000 death_reason = 17
Graphics FIFO depth is 26 words.
I have idea to move IO4+VCAM from rack to deskside machine with working RE2 graphics to see if IO4+VCAM are OK. Does it make any sense?
I also have another crazy idea - I have onyx deskside system with RE2 graphics and KONA ready backplane. I can move IR set to this machine (IO4+VCAM+GE+RM+DG), but there is a problem with power modules. IR graphics requires 3.3V supply (RE2 requires 5V). In deskside systems there is place for narrow (303) power modules, but in rack system I have wide (303x2) modules. What do You think is it possible to insert wide module into deskside system and left the cover open for a short time for tests?
What else could I check to find out what is going on?
Alright, take this with a grain of salt (and some caution). I have read that yes, you can in fact install the 303x2 modules in the desk side system. Yes, the back plate and cover have to be left off in order to do this. HOWEVER, I would wait until someone else either confirms this or says not to do it.
I'm looking for the post where I saw this written, but someone has tried this successfully before. If you're just doing it to test the boards I don't see it being an issue (specifically the rear of the machine being exposed).