Nixdoc is gone
#21
RE: Nixdoc is gone
We've been working on it off and on. I can't afford anything until my locksmithing school tuition is over with tbh, then I'll start paying my devs more.

I'm the system admin of this site. Private security technician, licensed locksmith, hack of a c developer and vintage computer enthusiast. 

https://contrib.irixnet.org/raion/ -- contributions and pieces that I'm working on currently. 

https://codeberg.org/SolusRaion -- Code repos I control

Technical problems should be sent my way.
Raion
Chief IRIX Officer

Trade Count: (9)
Posts: 4,256
Threads: 535
Joined: Nov 2017
Location: Eastern Virginia
Website Find Reply
08-23-2023, 01:00 AM
#22
RE: Nixdoc is gone
Hi JPS & Raion,

yes I found out about the pre-rendered cat, which appears to have terminal control (ANSI ??) stuff in it.

There are quite a few "ansi to html" renderers out there, can you confirm that the SGI cat pages are ANSI ?

This seems like a pretty simple case of:

Code:
if (manX dir) {
  do nroff -> html stuff
} else if (catX dir) {
  do ansi -> html stuff
}

Cheers from Oz,


jwhat/John
(This post was last modified: 08-23-2023, 09:14 AM by jwhat.)
jwhat
Octane/O350/Fuel User

Trade Count: (0)
Posts: 513
Threads: 29
Joined: Jul 2018
Location: Australia
Find Reply
08-23-2023, 09:12 AM
#23
RE: Nixdoc is gone
They're not. It's most than just putting up HTML and I didn't find any out of the box solutions.

I'm the system admin of this site. Private security technician, licensed locksmith, hack of a c developer and vintage computer enthusiast. 

https://contrib.irixnet.org/raion/ -- contributions and pieces that I'm working on currently. 

https://codeberg.org/SolusRaion -- Code repos I control

Technical problems should be sent my way.
Raion
Chief IRIX Officer

Trade Count: (9)
Posts: 4,256
Threads: 535
Joined: Nov 2017
Location: Eastern Virginia
Website Find Reply
08-23-2023, 10:22 AM
#24
RE: Nixdoc is gone
HI Raion,

I can see from looking at the uncompressed file all the codes (using vi) and it is un-readable, but when you cat the same file in xterm window it renders fine, so it must be something pretty standard...

Doing some searchs on ANSI -> HTML engines now to see how its looks piped through one of these.

EDIT: Tried:
  -- aha - rubbish default rendering, but if you play with options it looks better, so maybe starting point (no hyperlinks)
  -- ansi2html (shell script + gawk + gsed) - renders page simply and faithfully, no hyperlinks
  -- lots of go, python, ruby, rust and other variants, but I don't want to download a GB of stuff just to do something simple, where did the C programmers go ;-)


Cheers from Oz,

jwhat/John
(This post was last modified: 09-01-2023, 11:23 PM by jwhat.)
jwhat
Octane/O350/Fuel User

Trade Count: (0)
Posts: 513
Threads: 29
Joined: Jul 2018
Location: Australia
Find Reply
08-23-2023, 10:48 AM
#25
RE: Nixdoc is gone
HI Raion,

I may have saved you some money/time on programming effort ;-)

I have been testing with "sh" man page and it is rendering quite nicely with "dummy" URL links (as I have not got driver programming going yet).
This is a good test case, as it long entry and has a lot of links which are presented in different ways.

To get "nice" output I used, pretty standard tools (on FreeBSD):
- ul - underline cleanup
- aha - ANSI HTML Adaptor
- xstlproc - to do a little bit of XML/HTML style sheet cleaning up
- sed - to add HTML links into the page.

The result is a page that renders on par with original SGI Infosearch: https://www.graphica.com.au/sh.ul.aha.xslt.sed.html

I have put code here: https://github.com/zebity/freebsd-doc/tree/main/IRIX

What needs to be done is:
1. Write a driver script that:

Code:
- Looks to see if it has pre-rendered HTML page and
    if so serve it up
  Otherwise
    Find requested page in IRIX cat/man directories
    Run through HTML rendering pipeline
    Save result to cached pages
    Go back to start ...


Now that all the futzing around with the IRIX formating is solved this should be pretty easy.
Would you be able to host pages ?

Cheers from Oz,


John.
(This post was last modified: 09-01-2023, 11:25 PM by jwhat.)
jwhat
Octane/O350/Fuel User

Trade Count: (0)
Posts: 513
Threads: 29
Joined: Jul 2018
Location: Australia
Find Reply
08-26-2023, 05:31 PM
#26
RE: Nixdoc is gone
We need to do it in PHP, and I want the pages indexed in a db which makes it more difficult. I'm probably going to just restart the code base when I have the money completely all over because the guy who was doing it didn't really follow what I asked for. What I ideally want is a PHP script and a cron that will collectively:

1. Take the awf formatted pages and put them into a database.

2. Index them into a comprehensive search.

3. Preserve all formatting and output it.

I'm not looking for an out of the box solution because nothing out there really is going to work for what I was going for. I don't really have any more time to mess with this. Locksmithing school ends this week and maybe I'll have more time after that but no guarantees.

I'm the system admin of this site. Private security technician, licensed locksmith, hack of a c developer and vintage computer enthusiast. 

https://contrib.irixnet.org/raion/ -- contributions and pieces that I'm working on currently. 

https://codeberg.org/SolusRaion -- Code repos I control

Technical problems should be sent my way.
Raion
Chief IRIX Officer

Trade Count: (9)
Posts: 4,256
Threads: 535
Joined: Nov 2017
Location: Eastern Virginia
Website Find Reply
08-26-2023, 06:09 PM
#27
RE: Nixdoc is gone
Apologies for the abruptness of my last one but what I'm trying to do here especially with my new disposable income that will be around is making sure that we can create tools that are reusable but yet aren't as basic as say what the bsds or others are using. With your method we could easily create an IRIX 6.5.x manpage set, but I want to go beyond that and cover all major versions.

To that end I have to be aware of and account for those things. This will cost money and time but ultimately I think the end result will be much better than trying to use an out-of-the-box tool that was not designed to work with what we are looking at here. Especially since the Cartman paths vary from version to version

I'm the system admin of this site. Private security technician, licensed locksmith, hack of a c developer and vintage computer enthusiast. 

https://contrib.irixnet.org/raion/ -- contributions and pieces that I'm working on currently. 

https://codeberg.org/SolusRaion -- Code repos I control

Technical problems should be sent my way.
Raion
Chief IRIX Officer

Trade Count: (9)
Posts: 4,256
Threads: 535
Joined: Nov 2017
Location: Eastern Virginia
Website Find Reply
08-28-2023, 09:05 PM
#28
RE: Nixdoc is gone
Yeah after looking over the codebase we're starting over. Sucks, but that's life for you. I'll advise when work begins and link the repo.

I'm the system admin of this site. Private security technician, licensed locksmith, hack of a c developer and vintage computer enthusiast. 

https://contrib.irixnet.org/raion/ -- contributions and pieces that I'm working on currently. 

https://codeberg.org/SolusRaion -- Code repos I control

Technical problems should be sent my way.
Raion
Chief IRIX Officer

Trade Count: (9)
Posts: 4,256
Threads: 535
Joined: Nov 2017
Location: Eastern Virginia
Website Find Reply
08-30-2023, 01:14 AM
#29
RE: Nixdoc is gone
Hi Raion,

np, I have been slowly getting HTML render cleaned up.

I believe to make this searchable simplest way is to feed in via web based HTML pages.

I have now got these cleaned up by removing extra headers/footers/empty lines & links would work if pages were on web server.

I stopped doing via sed/awk and just did the final cleanup via perl (awk’s regex does not cut it).

Here is link to new cleaned up sh man page (as example): https://www.graphica.com.au/sh.ul.aha.xslt.perl.html

Processing pipeline is now: cat | ul | aha | xslt | perl

Using what I have posted on github, you could just do "find" of all cat.z pages and feed them into the "irix-catman2html.sh" script to generate a full set of html cross-referenced pages...

Once the pages (via web server) are available you can point an indexing/search engine at them to get fully searchable result.

That is my simple view of how to get something ASAP.

BTW: What is awf ?

Cheers from Oz,

jwhat/john
(This post was last modified: 09-01-2023, 11:27 PM by jwhat.)
jwhat
Octane/O350/Fuel User

Trade Count: (0)
Posts: 513
Threads: 29
Joined: Jul 2018
Location: Australia
Find Reply
08-31-2023, 08:17 AM
#30
RE: Nixdoc is gone
The format of IRIX manpage is awf

I'm the system admin of this site. Private security technician, licensed locksmith, hack of a c developer and vintage computer enthusiast. 

https://contrib.irixnet.org/raion/ -- contributions and pieces that I'm working on currently. 

https://codeberg.org/SolusRaion -- Code repos I control

Technical problems should be sent my way.
Raion
Chief IRIX Officer

Trade Count: (9)
Posts: 4,256
Threads: 535
Joined: Nov 2017
Location: Eastern Virginia
Website Find Reply
08-31-2023, 11:38 AM


Forum Jump:


Users browsing this thread: 1 Guest(s)