Last updates: Mon Jan 18 09:27:08 2021
This directory contains releases of a prettyprinter for HTML that has received extensive use in cleaning up and standardizing HTML Web pages, making it much easier to apply other tools, such as awk, that can then do simple pattern matching to extract data of interest, and output that data in other formats.
Although it might appear from release dates that this code is obsolete or unmaintained, that is not the case. The program has been in extensive daily use for more than two decades, and has processed hundreds of millions of lines of HTML. The code stability testifies to its robustness. The code is also portable, and has been successfully built on hundreds of different operating systems in the Unix family, and should be acceptable to any C or C++ compiler.
These packages are available in several functionally-equivalent distribution formats with identical contents: .jar (Java archive), .tar.bz2 (bzip2-compressed UNIX tar archive), .tar.gz (gzip-compressed UNIX tar archive), .tar.lz (lzip-compressed UNIX tar archive), .tar.xz (xz-compressed UNIX tar archive), .tar.zstd (zstd-compressed UNIX tar archive), .zip (InfoZip), and .zoo (zoo), together with companion contents listings ending in -lst. Each unpacks into a subdirectory named by the basename of the distribution file, e.g., htmlpty-1.02.
The shortened name htmlpty is a legacy of inadequate filesystems on certain desktop operating systems: the installed program on Unix-family systems has always been called html-pretty.