=====================Start Part 4 of 4 parts==================== Michael Sperberg-McQueen University of Illinois at Chicago; David G. Durand (dgd@cs.bu.edu) Boston University Computer Science. The Exeter group and David Durand are keeping lists of bug reports, modifications, etc. If you are in the UK and have access to JANET, you may use blue book file transfer to obtain the files from uk.ac.exeter.sgml1; use the following address and protocol: uk.ac.exeter.sgml1 username:sgmlbox, password:sgmlbox. If you have any problems contact sgml@uk.ac.exeter, or phone (+44) 392 263946, or fax (+44) 392 211630. Exeter plans to have anonymous-FTP support by the end of calendar 1991. The summary above was adapted from a public posting by Paul Ellison to TEI-L, 18-July-1991. Fuller description may be found in the associated README files on the relevant file servers, and in "SGML Parser Materials Donated to [the] SGML Users' Group," SGML Users' Group Newsletter 20 (September 1991) 3-4. The ARC-SGML parser materials are available via anonymous FTP from several sites, typically under a directory ARC-SGML: Host sgml1.ex.ac.uk (144.173.6.61) Or uk.ac.exeter.sgml1 on JANET - see <113> above Location /sgmlbox Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/ARC-SGML Host ftp.ifi.uio.no (129.240.88.1) Location: /pub/SIGhyper/ARC-SGML/distrib Host src.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml DIRECTORY rwxr-xr-x 512 Dec 13 05:03 ARC-SGML <120> UNIX Port of ARC-SGML, by James Clark The sources included in ARC-SGML have been modified to facilitate compiling under UNIX (James Clark); the sources and patches are available on several anonymous-FTP hosts. The filename is typically: arcsgml-1.0jclark.tar.Z or something similar. Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/ARC-SGML.UNIX FILE rw-r--r-- 289727 Aug 5 11:19 arcsgml-1.0jclark.tar.Z Host src.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/ARC-SGML.UNIX FILE rw-r--r-- 289727 Aug 5 11:19 arcsgml-1.0jclark.tar.Z Host nic.funet.fi (128.214.6.100) Location: /pub/languages/sgml FILE rw-rw-r-- 289727 Aug 4 16:17 arcsgml-1.0jclark.tar.Z Host ftp.ifi.uio.no (129.240.88.1) <121> 'SGMLS' Version of ARC-SGML, by James Clark "A beta-test version of sgmls, a SGML parser derived from the ARCSGML parser materials, is now available for anonymous ftp. . . Sgmls outputs a simple, line oriented, ASCII representation of a SGML document's Element Structure Information Set, which can easily be parsed by awk, perl, C or whatever. The idea is that sgmls can be used as the front end for a structure-controlled SGML application. It is primarily intended for Unix systems, but it works also on MS- DOS. I've tested it on the following architecture/OS/compiler combinations: sparc/SunOS 4.1.1/cc, sparc/SunOS 4.1.1/gcc, 386/SVR3.2/cc, 286/MS-DOS/Borland C++ 2.0. It should be straightforward to port to most systems that have 8-bit bytes and a character set consistent with ISO 646 IRV. (The beta version of sgmls (arguably) contains minor bugs which prevent it from being fully conforming, but the author "aims to make sgmls fully conforming to ISO 8879.") Adapted from a posting by James Clark: jjc@jclark.com (James Clark) to Usenet Newsgroup comp.text.sgml; Subject: sgmls available; Date: 28- Oct-91. Provisionally (December 1991) 'sgmls' is available on several anon-FTP hosts: Host ftp.ifi.uio.no (129.240.88.1) Location: /SIGhyper/ARC-SGML/jclark FILE r--r--r-- 757760 Oct 28 18:23 sgmls-0.3.tar Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/jclark FILE rw-r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z Host utsun.s.u-tokyo.ac.jp (133.11.11.11) Location: /misc/sgml FILE rw-r--r-- 249967 Oct 31 11:46 sgmls-0.3.tar.Z Host src.doc.ic.ac.uk (146.169.3.7) Location: /other3/text/sgml/jclark FILE r--r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z Host isfs.kuis.kyoto-u.ac.jp (130.54.20.1) Location: /ftpmail/utsun.s.u-tokyo.ac.jp/misc/sgml FILE rw-rw-r-- 249967 Dec 19 19:31 sgmls-0.3.tar.Z Host puffin.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/jclark FILE r--r--r-- 249967 Oct 29 11:35 sgmls-0.3.tar.Z <122> Macintosh Port of ARC-SGML, by Wally Wedel According to a posting to Usenet Newsgroup comp.text.sgml by Wally Wedel (31-December-1991), a version of ARC-SGML ported to the Macintosh platform is available via anonymous-FTP from the site rascal.ics.utexas.edu. The path to 'rascal' is not clearly indicated, but the archive files are available on several other servers, as listed below. "I have recently finished porting ARC SGML 1.0 as released by the SGML Users' Group to the Apple Macintosh environment. This archive is being released to the InterNet community on an as-is, no-support basis. The Compactor archive has been submitted to rascal.ics.utexas.edu. . ." "ARC SGML 1.0 for the Macintosh: This archive is a port of an SGML parser called ARC SGML to the Apple Macintosh environment. Original sources of this parser are available from the SGML Users Group and various FTP sites around the world. Users should read the document LICENSE.DOC in the docs directory to determine the legal status of the original code. My modifications impose no further restrictions on use. This code is supplied on an as-is basis with no warranties of any kind. . ." "This code does contain errors. James Clark has caught and fixed some in his SGMLS code for UNIX. There are no doubt more. Moreover this code is not for the beginner or the timid. A copy of the SGML Handbook by your side when working with it is highly recommended. . .This port has been a 'spare time' project to take a close look at the implementation. I have no plans for further enhancements in spite of the appearance of the todo list in the docs directory." "This archive unpacks a folder which I maintain inside my MPW Folder hierarchy. I build the Think C version by copying the vm.9 project and the sources from sgmlc and sgmlh into a folder named ARC SGML in my Think C Development folder. . . A good place to start reading about this port is the document 'Macintosh port' in the docs directory." (Wally Wedel 12/30/91. Archive Name: ARC SGML 1.0.sea) Contact: Wally Wedel; U S WEST Advanced Technologies; 4001 Discovery Drive, Suite 390; Boulder, CO 80303 USA; Email (Internet): wwedel@uswest.com; AppleLink: D5100; TEL: (1 303) 541-6052 Anonymous-ftp locations for ARC-SGML.MAC are (or were -- use ARCHIE): Host mailer.cc.fsu.edu (128.186.6.103) Location: /pub/sgml/ARC-SGML.MAC FILE rw-r--r-- 461687 Jan 7 14:25 arc-sgml-10.hqx Host sumex-aim.stanford.edu (36.44.0.6) Location: /info-mac/app FILE rw-r--r-- 461687 Jan 3 12:27 arc-sgml-10.hqx Host nro.cs.athabascau.ca (131.232.1.1) Location: /wuarchive/mirrors/info-mac/app FILE r--r--r-- 461687 Jan 3 12:27 arc-sgml-10.hqx Host lth.se (130.235.16.3) Location: /pub/mac/info-mac/app FILE rw-r--r-- 461687 Jan 5 01:18 arc-sgml-10.hqx Host src.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/ARC-SGML.MAC FILE r--r--r-- 404959 Jan 7 14:25 arc-sgml-10.hqx.Z Location: /micros/mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /micros/mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Location: /mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Host puffin.doc.ic.ac.uk (146.169.3.7) Location: /text/sgml/ARC-SGML.MAC FILE r--r--r-- 404959 Jan 7 14:25 arc-sgml-10.hqx.Z Location: /micros/mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /mac/sumex/app FILE r--r--r-- 404959 Jan 3 12:27 arc-sgml-10.hqx.Z Location: /micros/mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Location: /mac/umich/etc/compsci FILE r--r--r-- 391587 Jan 5 07:06 arcsgml1.0.cpt.hqx.Z Host sics.se (192.16.123.90) Location: /pub/info-mac/app FILE rw-r--r-- 461687 Jan 8 05:14 arc-sgml-10.hqx Host shark.mel.dit.csiro.au (144.110.16.11) Location: /info-mac/app FILE rw-r--r-- 461687 Jan 4 10:32 arc-sgml-10.hqx Host ditmela.mel.dit.csiro.au (144.110.16.11) Location: /info-mac/app FILE rw-r--r-- 461687 Jan 4 10:32 arc-sgml-10.hqx Host wuarchive.wustl.edu (128.252.135.4) Location /mirrors/info-mac/app FILE -r--r--r-- archive 461687 Jan 3 06:27 arc-sgml-10.hqx <123> Amsterdam (Free University) Parser A description of the Amsterdam Parser (ASP) is found in the article cited sub section 1 above, <45>. "The Amsterdam SGML Parser uses an LL(1) parser generator, notably LLgen, for both DTD and document parsing. Actually, SGML is not LL(1), [so the developers] used the 'conflict resolvers' from LLgen to overcome the problems [they] came across." Questions regarding the parser and its current status might best be directed to the Vrije Universiteit: Prof. Dr. J.C. van Vliet, Internet email: hans@cs.vu.nl. As a last resort, contact Jos Warmer at PTT Research (no longer at VU): jos@pttrnl.nl OR possibly JB_Warmer@pttrnl.nl. The Amsterdam Parser is available via anonymous-FTP on the hosts listed below, though support for the parser (obtained in this manner) should not necessarily be expected. Host star.cs.vu.nl (130.37.24.6) DIRECTORY rwxr-xr-x 512 Aug 7 10:34 Sgml FILE rw-r--r-- 419803 Aug 7 10:23 Sgml.tar.Z Host nic.funet.fi (128.214.6.100) Location: /pub/languages/sgml FILE rw-r--r-- 419803 Aug 12 02:14 Sgml.tar.Z Host isfs.kuis.kyoto-u.ac.jp (130.54.20.1) Location: /ftpmail/ftp.ricoh.co.jp/pub/misc/sgml FILE rw-rw-r-- 419803 Dec 18 17:44 Sgml.tar.Z Host theta.iis.u-tokyo.ac.jp (130.69.48.4) Location: /sgml FILE rw-rw-r-- 419803 Dec 13 13:12 Sgml.tar.Z <124> Software Exoterica - XGML Validator (XGML Normalizer, XGML Translator, XGML OmniMark) An entry-level validating SGML parser distributed by EPSIG and GCA for just sixty-five (65) US dollars is Software Exoterica's Validator(tm). XGML Validator is an application of Software Exoterica's XGML Translator, and contains a subset of features implemented in the fully-functional XGML Normalizer and XGML Tester. Validator is a batch SGML parser with error detection but no entity expansion or normalization features. XGML Normalizer also produces a version of the document with all shortrefs and entities expanded; Tester will generate an even more detailed version of the document in various formats, including the format suggested by the newest ISO standard for conformance testing. The Validator parser validates SGML declarations, DTDs and document instances: it detects all errors in a single pass, reports errors in a diagnostic transcript, recovers intelligently from errors and always parses the full document. Validator requires a PC (MS-DOS) with 386 CPU and at least 4 megabytes of memory. See further description in: (1) "EPSIG Sells XGML Validator," The Seybold Report on Publishing Systems 20/9 (January 28 1991) 28; (2) "EPSIG and GCA Selling Exoterica Validator," SGML Users' Group Newsletter 20 (September 1991) 32-33, or (3) 17 (December 1990) 4, or EPSIG News 3/4 (December 1990) 9. Ordering from GCA or EPSIG: EPSIG; c/o OCLC, Mail Code 278; 6565 Frantz Road; Dublin, OH 43017- 0702; TEL: (1 614) 764-6195; FAX: (1 614) 764-6096. GCA address: Graphic Communications Association; 100 Daingerfield Road, Alexandria, VA 22314; TEL: (1 703) 519-8160; Telex: 510-600-0889; FAX: (1 703) 548-2867. Software Exoterica's related products include an SGML editor and several tools for SGML translation/transduction. (1) XGML CheckMark is a full-screen SGML editor available for the Mac II. It was originally developed as a conversion tool, but can be used for a variety of authoring/writing tasks. CheckMark uses actual DTDs, not rules files, and can validate documents while they are being edited. Tags may be entered into documents from menus or directly from the keyboard. (2) XGML Translator is an SGML-based string transducer which uses a string conversion language called XTRAN. Translator is available for Unix systems and 386 DOS microcomputers. Translator's functions are up-, down-, context- and cross-translations -- creating various kinds of SGML <<->> non-SGML conversions of documents and structured information. Exoterica considers XGML Translator appropriate for third generation applications. See Brian Travis, "Review of the Exoterica XGML Translator," 17 (December 1990) 5- 11. (3) XGML OmniMark is Exoterica's tool for fourth generation applications using markup languages. OmniMark script-writing and document translation facilities can be used to convert SGML documents to input languages of other languages, and to convert the output languages of other to SGML-defined languages (e.g., AAP, CALS), and to convert between arbitrary languages and data formats. OmniMark is available for MS-DOS (386-class machines), Microsoft Windows 3.0, OS/2, Macintosh, SCO UNIX, DEC VAX, and most UNIX systems. See EPSIG News 4/4 (December 1991) 5-6. Queries about XGML Validator and the full line of Exoterica's XGML products may be directed to: Software Exoterica Corporation; 383 Parkdale Avenue, Suite 406; Ottawa, Ontario; CANADA K1Y 4R4; TEL: 1- 613-722-1700; TEL: 1-800-565-9465 (1-800-565-XGML); FAX: 1-613-722- 5706. <125> SEMA Group/Yard Software Systems Limited (Mark-It Light) Mark-It Light is a "low cost" entry-level validating parser from the SEMA Group, and runs on MS-DOS 286 and 386 class microcomputers. It is based upon the so-called Sobemap parser, named after the European company now known as the SEMA Group. Mark-It Light supports a subset of features found in SEMA's fuller Mark-It package. Mark-It includes, as part of the package, a structured document editor called Write-It, which is equivalent to IBM's TextWrite and SoftQuad's Author/Editor. The Mark-It parser supports SGML more completely than most other SGML parsers: multiple concurrent documents, subdocuments, data tags, attributes, marked sections, rank, tag omission (minimization), short tags, short references, formal public identifiers, model inclusions, links (simple, implicit and explicit links). Additionally, Mark-It uses SGML together with a regular expression pattern recognizer to control the process of converting files both into and out of SGML form: "users can easily convert to and from TeX, including the LaTeX and JLaTeX forms, without having to be provided with a preconfigured TeX conversion package." Other components in the suite include Parse-It (file creator and parser), Compile-It (SGML DTD compiler for Write-It) and Lisp-It (an SGML-aware interpreter useful for applications development). Compiled versions of Mark-It and related tools are also available for UNIX and most mainframe computers, while C source code is available to OEM developers. The Mark-It Light parser itself thus supports basic validation: markup using the complete range of tag minimization features (data tagging, SGML tag ranking option), and interactive correction of markup errors; it also provides access to the full set of 256 character codes (extended ASCII). See: (1) "Yard Release[s] Mark-It Light," SGML Users' Group Newsletter 18 (November 1990) 22; (2) "Mark-It Light," EPSIG News 4/1 (March 1991) 4; (3) Martin Bryan and Ed Warnshuis, "Comments from our readers (letter to the editor)." The Seybold Report on Publishing Systems 20/10-11 (February 25, 1991) 2-4; (4) "Five New Products from Sema," EPSIG News 4/3 (September 1991) 7. Contacts: In Europe, Products Manager; SEMA Group Systems Ltd.; Avonbridge House, Bath Road, Chippenham, Wiltshire SN15 2BB; TEL +44- 249-656-194 (Paul Moorhead); FAX +44-249-655-723; or, Martin Bryan; Yard Software Systems; 29 Oldbury Orchard; Churchdown; Glos GL3 2PU, UNITED KINGDOM; TEL: +44 452 714029; or Griet Descheemaeker; SEMA Group Belgium; Place du Champ de Mars 5; Boite 40;B-1050 Brussels; BELGIUM; TEL: +32 2 508 53 23; In the US: John W. Oster II; Yard Software USA; 125 North Main Street; Bel Air, MD 21014; TEL: 301/838- 1911; FAX: 301/838-1913. <126> SoftQuad (Author/Editor) Transparent SGML parsing facilities are incorporated into SoftQuad's popular Author/Editor and related SGML products. Author/Editor (A/E) is an SGML(-aware) text editor and word-processor used as a major component in electronic publishing by a wide range of developers. Author/Editor uses pre-compiled DTDs rather than directly-editable ASCII DTD files (viz, DTDs based upon the AAP tagsets, CALS-SGML DTDS, ATA DTDs, or other DTDs shipped with A/E). Users who wish to customize standard DTDs or build new DTDs rely upon a second SoftQuad product, RulesBuilder (or its batch counterpart, 'buildRules'). The user is informed by A/E of all legal tags at any given point in the text by means of pull-down menus, from which the tags may be selected. Valid SGML document structure (i.e., conforming to the rules file) is enforced because it is not possible to type SGML tags directly into the file. Recent (1991) enhancements to A/E include a macro programming language (Scheme, based upon LISP), a new tables editor and graphics module. A/E interfaces with a number of other related products to support text and graphics formatting of the SGML files (SoftQuad Publishing Software). A/E is currently (1991) supported for both Apple Macintosh and DOS (MS-Windows), as well as for SunOS, Ultrix, Sun Open Windows, DEC Motif, etc. Prices (1991, with 12-month support contract) are about 1995 dollars US (DOS), 3295 dollars US (UNIX) and 1695 dollars US (Mac), though academic discounts will likely be arranged. See further description: (1) "SoftQuad Author/Editor: An SGML Context-sensitive Text Entry System." Pp. 99- 103 in Tools for Humanists, 1989. A Guidebook to the Software and Hardware Fair Held in Conjunction with the Dynamic Text [6-9 June 1989 Toronto]. Toronto, Ontario: Centre for Computing in the Humanities, 1989; (2) "SoftQuad - New Language/Platform," SGML Users' Group Newsletter 19 (April 1991) 21; (3) "SoftQuad Adds Table Editor," EPSIG News 4/4 (December 1991) 4. Contact: In North America: SoftQuad Inc.; 56 Aberfoyle Crescent, Suite 810; Toronto, Ontario; Canada M8X 2W4; TEL: +1 416 239 4801; +1-800- 387-2777; FAX: +1 416 239 7105. Email: dns@sq.com (David Slocombe). In Europe: Open Information Technology (UK) Ltd.; Marble Arch; King Street; Knutsford; Cheshire WA16 6HD; UNITED KINGDOM; TEL: +44 565 50021; FAX: +44 565 51093. <127> E2S (EASE -- E2S Advanced SGML Editor) "EASE enables the production of Basic SGML documents (see clause 15.1.1 of ISO 8879), using any PC running DOS 3.30 (or higher) and with 550 kB free memory. It also runs on UNIX graphical workstations, under X-windows. EASE consists of two integrated parts, the DTD- editor and the DI-Editor, both of which offer on-line and context- sensitive control and help in the use of SGML (syntax- oriented editing). They also offer on-line, context-sensitive (and hypertext- like) help in the use of the editors themselves. The user interface is window-based, with pull-down menus for the various options/functions -- which can be accessed not only via the mouse, but also via the keyboard or function keys. All nine official languages of the E.C. are supported simultaneously, and unformatted output can be sent straight to an ASCII or PostScript printer. On the PC version, individual copies of the software require a protection key (dongle) to be fitted to the Centronics (parallel) port." "The DTD-editor is used to create, edit, validate and compile DTDs for use with the DI-editor. Inexperienced users can use options from the pull-down menus to build DTDs, whilst others can enter the text directly (with DTD-editor automatically picking up any syntactic errors). Other DTDs stored in plain ASCII files, can be imported into the DTD-editor for parsing, validation, and compilation (provided that they conform to the requirements suitable for producing Basic SGML documents). On the basis of the (compiled) DTD, the DI-editor displays the elements for the minimal logical structure of a document of that type. Users may then either edit the logical structure of the document (all actions being checked for semantic correctness), or select appropriate elements and swap into the text window for data entry (and/or the entry of further valid elements). It is also possible to import a plain text file, conforming to ISO 2022, and add valid markup according to the DTD; the resulting document will be a validated Basic SGML document (extended for multilinguism)." Reference: "New SGML Editor from E2S," EPSIG News 4/3 (September 1991) 6-7. Normal licence fees: are 1,250 dollars US (PC); 6,500 dollars US, (workstation); University licence fees: PC: 125 dollars US + medium, documentation, administration and shipment: 500 dollars US; Workstation: 650 dollars US + medium, documentation, administration and shipment: 500 dollars US. This information thanks to Caroline de Vleeschauwer (Email: cdv%e2s@relay.EU.net, Date: Thu, 19 Dec 91) Contact: Joost CARDOEN, Managing Director, E2S, Moutstraat 100; B-9000 Gent, Belgium, TEL: +32/91/21.03.83, FAX: +32/91/20.31.91, e-mail: jca@e2s.be; OR Ronny Verkest, Sales Manager; E2S; Moutstraat 100; B- 9000 Gent; Belgium; TEL: +32(91)21 03 83; FAX: +32(91)20 31 91; Email: e2s@e2s.be (Internet). <128> Electronic Book Technologies (MS-Windows) DynaText SGML Browse/Search DynaText supports SGML document indexing/searching/browsing and is implemented for MS-DOS (Windows), as well as for various UNIX platforms. It allows users to read, query, dynamically display and annotate electronic "books" or other structured information. DynaText uses SGML element tags to automatically generate hyperlinks to associated material such as diagrams, tables, and explicit cross references; it allows users to add their own link types/behavior through simple style sheet entries. Electronic style sheets are held in ascii-editable files with SGML syntax. This mechanism can be employed by users who want to create dynamic multi-media documents. Style definitions may be used to set the display characteristics (font type, size, color) including visibility or suppression of each SGML element. The principle of conditional visibility of elements (and element classes) in response to style sheets and icon clicks permits rapid customization of electronic books where a variety of document editions is desired. DynaText builds a full text index of the SGML document and (unlike other indexers that simply report occurrences within an entire document) can report occurrences within SGML components. Hit-list statistics for each document section provides an unprecedented level of search precision that enables users to find terms within the relevant sections of the document quickly. Wild cards and regular expressions may be used in queries, eliminating the need for exact string matches; Boolean logic (AND, OR) may also be specified. The indexer supports synonym lists that act like special purpose thesauri that enable access to information though a variety of synonymous terms. This feature is especially useful in acronym-laden technical reference manuals. In addition to displaying SGML-structured character text, DynaText supports a variety of popular raster formats (such as PICT, TIFF, Sun Raster and CCITT fax formats) to facilitate capture of associated artwork. The system also supports an open architecture for integration with multi-media applications allowing sound, animation and video supplements to be added to existing reference documents. Discounts up to 80 percent are available to academic institutions. See fuller descriptions of the product in: (1) "DynaText: Electronic Book Engine from EBT [Electronic Book Technologies]: First to Handle any SGML Application," Seybold Report on Publishing Systems 20/2 (September 24, 1990) 18-22; (2) Steven J. DeRose, "DynaText: Electronic Book Indexer/Browser," EPSIG News 3/4 (December 1990) 1-2; (3) "Electronic Book Technologies," SGML Users' Group Newsletter 20 (September 1991) 32; (4) "Electronic Book Technologies' Dynatext (SGML compiler). Text Tools: Beyond Search and Retrieval." Release 1.0 91/7 (July 31, 1991) 15-17. Contacts: (US): Electronic Book Technologies, Inc.; One Richmond Square; Providence, RI 02906 USA; TEL: (401) 421-9550; FAX: (401) 421-9551; (Internet): sjd%ebt-inc@uunet.uu.net (Steven J. DeRose), or ebt-inc!sjd@uunet.uu.net (that's ebt- inc!sjd@uunet.uu.net); (Internet): lrr%ebt-inc@uunet.uu.net (Louis R. Reynolds); in Europe: EBT International; 20, Pre de la Ferme; 1261 Gingins; SWITZERLAND; TEL: +41-22-69-24-24; FAX: +41-22- 69-24-25. <129> Avalanche Development Company (FastTag, IMSYS) Intelligent Autotagging Retro-fitting electronic data files held in proprietary format with SGML tagging, or converting paper documents to SGML-tagged files is frequently accomplished with the use of intelligent automatic markup software. The "autotagging" software from Software Exoterica (XGML Translator, XGML OmniMark) was noted briefly above,; see sub <124> and compare <44>. Other products for SGML conversion/translation are Avalanche Development's tools FastTag and IMSYS: they incorporate autotagging software using visual recognition techniques to generate structured output text. Using a VRE(tm) Visual Recognition Engine, FastTag reads visual clues: (1) from an output file in the case of electronic files or, (2) in conjunction with OCR, to convert paper- based documents into SGML electronic file formats. FastTag reckons with graphics objects, tables and similar text structures. Input may be ASCII, Calera PDA files, DCA/RTF files, WordPerfect or Word files (other wordprocessor formats supported), OCR/ICR scanner files, DECWrite, Interleaf, and so forth; output is user-definable, but may be specified as SGML, RTF, FrameMaker (Maker Interchange Format, MIF), Ventura Publisher, Interleaf, Datalogics, troff, nroff, tbl and so forth. FastTag and IMSYS are supported on MS-DOS, as well as on a variety of UNIX, VMS and Ultrix systems. The autotagging technology is licensed by a number of OEM developers (Xerox Information Systems (Kurzweil), Bell Atlantic (DocuSource), Shaffstall, IBM and DEC). See: (1) "Avalanche Development's FastTAG." Release 1.0 91/7 (July 31, 1991) 14-16; (2) "Avalanche - New Developments," SGML Users' Group Newsletter 19 (April 1991) 18-19; (3) "Avalanche FASTTAG to Support EMS DTDs," EPSIG News 3/2 (June 1990) 5-6 (Avalanche announces intention to support DTDs of the EPSIG/AAP Electronic Manuscript Standard); (4) "Systems Solutions for the '90s: In-Plant, Commercial. [Avalanche Development]," Seybold Report on Publishing Systems 18/14 (April 24, 1989) 26-43 (reviews Avalanche's Imsys.CALS, based on its Visual Recognition Engine, which interprets visual cues in the output file of word processors and generates a tagged ASCII file. It thus provides means of updating files to CALS-compliant SGML format). Related products include Proof Positive (spelling, grammar and style checking facilities for use with Interleaf and FrameMaker) and hypertext interface facilities. Contact: Eileen Quirk, Director of Marketing and Sales; Avalanche Development Company; 947 Walnut Street; Boulder, CO 80302 USA; (303) TEL: 449-5032; FAX: (303) 449-3246. Email (Internet): support@avalanche.com, sales@avalanche.com, apps@avalanche.com. <130> qwertz/FORMAT -- SGML to LaTeX (and nroff/troff) Translator Version 1.1 of the qwertz SGML documenting processing system is now available. Format is an SGML to LaTeX (and nroff/troff) translator. It consists of: (1) SGML document type definitions for the LaTeX document styles (articles, books, reports, letters, slides), for BibTeX bibliographies and for Unix manual pages; (2) Programs for translating SGML documents of the these types into LaTeX and nroff/troff; (3) A command for extracting source code from documentation, which is useful for a simple kind of "literate programming". Format allows LaTeX documents to be created using powerful and comfortable SGML editors, such as Author/Editor, in a quasi WYSIWYG manner. It also provides a layer of abstraction from LaTeX, allowing documents to be more easily translated into formats for other document processing systems, such as nroff/troff or MS-WORD. Anonymous-FTP locations (use ARCHIE for updates): Host gmdzi.gmd.de (129.26.1.90) Location: /pub/gmd FILE rw-r--r-- 959 Sep 30 11:51 sgml2latex-format.readme FILE rw-r--r-- 1336911 Sep 30 11:51 sgml2latex-format.tar.Z Host utsun.s.u-tokyo.ac.jp (133.11.11.11) Location: /misc/sgml FILE rw-r--r-- 1336911 Oct 31 11:48 sgml2latex-format.tar.Z Host liasun3.epfl.ch (128.178.36.30) Location: /pub/tex FILE rw-r--r-- 1336911 Oct 23 20:28 sgml2latex-format.tar.Z Contact: Thomas F. Gordon; German National Research Center for Computer Science (GMD); Schloss Birlinghoven, D-5205 Sankt Augustin 1, Germany; Internet email: thomas@gmdzi.gmd.de; phone: (+49 22441) 14- 2665. (Adapted from Usenet posting to comp.text.sgml; Subject: New Version of Format; Date: 30-Sep-91) <131> NIST "SGML parser materials" The older NIST parser materials are reported to be out-of-date and somewhat unreliable. Furthermore, questions persist concerning the NIST's support for these code sources, and, in light of apparently stronger current support for ODA (e.g., NIST ODA SIG in the OSI Implementors Workshop), for SGML more generally. It would be wise to consult with NIST and with a recognized SGML guru before spending time building on these SGML parser tools. Host stag.math.lsa.umich.edu (141.211.64.23) FILE rw-r--r-- 1731 Sep 15 1990 nist-sgml.cheater FILE rw-r--r-- 202923 Sep 15 1990 nist-sgml.tar.Z Host nic.funet.fi (128.214.6.100) Location: /pub/languages/sgml FILE rw-r--r-- 1731 Aug 12 02:29 nist-sgml.cheater FILE rw-r--r-- 202923 Aug 12 02:29 nist-sgml.tar.Z Host theta.iis.u-tokyo.ac.jp (130.69.48.4) Location: /sgml FILE rw-rw-r-- 871 Dec 13 13:13 sgml.cheater.Z FILE rw-rw-r-- 198363 Dec 13 13:15 sgml.cpio.Z <132> Public DTD Collections Public DTDs and entity sets for SGML are printed in the standards documents and in most handbooks, and are usually shipped with commercial SGML software. The nineteen (19) character entity sets published in Annex D of the ISO 8879 SGML Standard are legally unencumbered, and will probably become widely available in electronic format (use ARCHIE to search for filenames like "ISOlat1," "ISOlat2," ISOgrk1," "ISOgrk2," "ISOgrk3," "ISOgrk4," "ISOcyr1," "ISOcyr2," "ISOnum," "ISOdia," "ISOpub," "ISObox," "ISOtech," "ISOamso," "ISOamsb," "ISOamsr," "ISOamsa," "ISOamsn," "ISOamsc," etc.). Currently, sample DTDs in usable ASCII format may be obtained on the academic networks from several sources: (a) DTDs produced by the Text Encoding Initiative (TEI) are available from the BITNET listserv(er) on UICVM. The filenames and content of these DTDs can be expected to undergo change during 1992, but they currently have a VM/CMS filespec of ' DTD' (e.g., TEI1 DTD; TEIHDR1 DTD; TEIWSD1 DTD; TEIBASE1 DTD; TEIFRON1 DTD; TEIBACK1 DTD; TEILOW1 DTD; TEICRYS1 DTD; TEILING1 DTD; TEIREND1 DTD; TEIDRAM1 DTD; TEITC1 DTD; TEITC1 DTD). A number of related files on TEI syntax are also available (SYNTAX MEMO; OVERVIEW SYNTAX; BASICS SYNTAX; DOCUMENT SYNTAX; SGMLDECL SYNTAX; DTD SYNTAX). Use interactive BITNET command or mail to request the files, "get " and item <114> above for other standard BITNET LISTSERVer syntax. The DTDs may be obtained also from the LISTSERVer in Europe (Goettingen) which hosts MARKUP-L: LISTSERV@DGOGWDG1 on BITNET or listserv@ibm.gwdg.de on the Internet; see <116>. (b) MAJOUR (Modular Application for Journals). The EWS (European Workgroup on SGML) has published a DTD for the headers of scientific articles (MAJOUR-Header DTD). In cooperation with the CERN EPS (European Physical Society), EWS has now also produced a "body" standard DTD for complete scientific articles. The DTDs developed by EWS and EPS/CERN are based upon AAP article DTDs. See above entry <110> for further details on EWS. Free copies of the MAJOUR DTD(s) may be obtained from the STM (International Group of Scientific Technical and Medical Publishers): contact Ms Harriet de Hoog, STM Secretariat, Keizersgracht 462, 1016 GE Amsterdam, THE NETHERLANDS; FAX +31-20-38-15-66. See further in "European Workshop on SGML," SGML Users' Group Newsletter 20 (September 1991) 22. Alternately, contact the GCA for a free copy of MAJOUR (with purchase of other materials from the GCA's printed materials); see SGML Users' Group Newsletter 20 (September 1991) 32. (c) A HyTime DTD (declarations set) is located on the online SIGhyper forums: Host ftp.ifi.uio.no (129.240.88.1) and Host mailer.cc.fsu.edu (128.186.6.103). Disk space has been secured at the University of Oslo, Department of Informatics for a larger collection of public DTDs: see ftp.ifi.uio.no:/pub/SGML for contributions that may be available from that archive (e.g., for the "general" document DTD 'PUBLIC "ISO 8879:1986//DTD General Document//EN"' from ISO 8879 Annex E as amended). (d) Public DTDs and entity sets should be available from Exeter. One may periodically check the online library of the Exeter SGML Project for its collections: anon-FTP to sgml1.ex.ac.uk (144.173.6.61). See item <113> above. (e) The CALS-BBS forum described sub item <117> above contains several CALS-related DTDs, usually in self-extracting DOS compressed files (filename.exe). (f) DTDs supporting the AAP/EPSIG manuscript standard are available from EPSIG for 10 US dollars plus shipping; see <107>. The same files may be accessible from file servers: use ARCHIE to search for AAPARTCL.TXT; AAPBOOK.TXT; AAPSERL.TXT; SHORTREF.TXT or similar filenames. (g) The "Information Architecture" working group of the OSF Documentation Special Interest Group (IA WG of the OSF Doc SIG) will be releasing public DTDs during the first and second quarters of 1992. Contact Fred Dalrymple, the Open Software Foundation; TEL: (1 617) 621-8855, or John Bowe, Email (Internet): bowe@osf.org. =========================================================== FURTHER BIBLIOGRAPHY AND RESOURCE GUIDES FOR SGML =========================================================== <133> Cover, Robin; Duncan, Nicholas; Barnard, David. Bibliography on SGML (Standard Generalized Markup Language) and Related Issues. Technical Report 91-299. Queen's University, Kingston, Ontario. February, 1991. ISSN 0836-0227. 312 pages. A revised print version of a bibliographic and information database (compiled by Robin Cover), structured in SGML-database and formatted with SGML ->> BibTeX utilities developed at Queen's University by Nick Duncan and David Barnard. For print copies, contact: (1) Department of Computing and Information Science; Queen's University; Kingston, Ontario, Canada K7L 3N6; TEL: (613) 545- 6056; Email (Internet): heather@qucis.queensu.ca, or (2) the Graphic Communications Association, <105>. The printed version of the database contains a "Short Bibliography" of 67 essential references, and a fuller "Main Bibliography" with 1403 citations (many with abstracts). The second major section is an SGML Directory for some 117 SGML- supporting groups in academia, government, or industry: each entry supplies addresses, descriptions of software products or SGML services, and references. In detail: Table of Contents 1 Introduction.............................................1 1.1 What is SGML? (Robin Cover)..............................2 1.2 Content of the Bibliography (Robin Cover)................2 1.3 Format of the Bibliography (Nicholas Duncan).............9 1.4 Database Access and Collection Maintenance...............9 2 Acknowledgments (Robin Cover)...........................13 3 Acronym List............................................15 4 Short Bibliography......................................20 5 Main Bibliography.......................................25 6 Directory of SGML Supporting Groups....................160 6.1 Index of Entries.......................................160 6.2 Directory..............................................163 7 Appendices.............................................261 A Processing the Bibliography............................261 B BiBTeX Enhancements....................................262 C Document Type Definitions (DTDs).......................265 D ISO 8879-1986 Entities (LAT1, LAT2, ISOdia, ISOmin)....270 E Language Codes for SGML (ISO 639 and ANSI/MARC)........293 New bibliographic references and other SGML information for this database are welcome: please send citations (published or unpublished materials: technical reports, working papers, internal memoranda, articles, product announcements, product reviews) to Robin Cover via electronic or postal mail. Plans are underway to make the SGML database available electronically via a BITNET listserver (listserv@uicvm.bitnet or listserv@uicvm.uic.edu) and on Internet via anonymous FTP. <134> The SGML Source Guide. The Graphic Communications Association's Guide to Standard Generalized Markup Language (SGML) Systems, Software, Service, Consultants, Seminars and Resources. Edited by Marion Elledge. Graphic Communications Association, February, 1991. 6" x 8". 105 pages. ISBN: 0-93505-13-2. Several SGML-related standards documents distributed by GCA are listed and annotated in this Guide. Listings of SGML suppliers are in alphabetical order and provide information on the type of business, name and description of products or services, and prices. The Guide is issued on a subscription basis in looseleaf format; updates are issued quarterly or as information is accumulated. <135> SGML Products and Services. A document covering primarily CALS- SGML, produced by Joan Smith for the CALS in Europe SIG. Cost is approximately 20 UK pounds. Contact: David Ardron, Secretary, CALS in Europe SIG; Ferranti Computer Systems Ltd,; Western Road, Bracknell, Berkshire RG12 1RA; UNITED KINGDOM; TEL: +44-344-483232. ===================== POSTSCRIPT ===================== COPYLEFT. The bibliography above is copyright (c) Robin Cover, December 1991. Everyone is granted permission to freely copy and distribute the file in any format whatever, for non-commercial purposes, so long as the entire document is reproduced, together with this notice. Corrections and additions will be received with gratitude. PERSPICUITY IN PUNCTUATION. Several painful compromises have been required to produce this network-safe ASCII version. Somewhat against the spirit of SGML, I have used angle brackets to delimit entry numbers, both serially and as cross-references; any editor supporting regular-expressions can be used to convert these particular delimiters to conventional square-brackets. Likewise, the use of parentheses for square brackets in running text is infelicitous, where the latter more clearly signified (editorially) added text, and nested better within parenthesized expressions. SGML character entities have been used to represent extended ASCII accented characters, for which I offer no apology: anyone wishing to print the bibliography on paper copy should first change out these entities (e.g., é, É, ü, à, ø, ç, č). Thus, with the exception of @ ("@" IRV 4/0 = decimal 64), I have restricted the character set to the 'safe' subset of (non- national-use) ISO 646 IRV characters, but at the risk of complete clarity in a few contexts. According to network authorities, use of any characters outside the following 'safe' ISO 646 subset is dangerous, especially across national boundaries, since unanticipated and often undetectable instances of EBCDIC-ASCII translation during transmission will have mutagenic effects upon the 'non-safe' IRV/ASCII characters. This wisdom warns that the 'safe' ISO 646 subset contains ONLY the following (non national-use) characters: a b c d e f g h i j k l m n o p q r s t u v w x y z A B C D E F G H I J K L M N O P Q R S T U V W X Y Z 0 1 2 3 4 5 6 7 8 9 " % & ' ( ) * + , - . / : ; < = > ? _ SPACE (SP, IRV 2/0) ---------------------------------------------------------------------- Robin Cover BITNET: zrcc1001@smuvm1 ("one-zero-zero-one") 6634 Sarah Drive Internet: zrcc1001@vm.cis.smu.edu Dallas, TX 75236 USA Internet: robin@utafll.uta.edu ("uta-ef-el-el") TEL: (214) 296-1783 Internet: robin@ling.uta.edu FAX: (214) 709-3387 Internet: robin@txsil.sil.org ---------------------------------------------------------------------- ===============Cut Here: End Part 4 of 4 parts====================