=====================Start Part 1 of 4 parts==================== STANDARD GENERALIZED MARKUP LANGUAGE ISO 8879:1986 (SGML) ANNOTATED BIBLIOGRAPHY AND LIST OF RESOURCES Version 2.0 Revised January 1992, (c) Robin Cover CONTENTS: 1. Introductory and General SGML Bibliography 2. SGML Manuals: Commentary and Indices for ISO 8879 3. SGML Applications and Implementations 4. ISO Standards Publications Germane to SGML 5. Serial Publications Dedicated to SGML 6. SGML Supporting Organizations and SIGs 7. SGML Electronic Discussion Forums and Online Support Centers 8. Entry-level SGML Parsers and Related SGML Software 9. Further Bibliography and Resource Guides for SGML 10. Postscript ==================================================== INTRODUCTORY AND GENERAL SGML BIBLIOGRAPHY ==================================================== Bibliographic items <6>, <10>, <13>, <24>, <29>, <39> and <42> in this section are marked with an asterisk ("*") to indicate their relative importance and as primary introductions to SGML. Other titles in section 1 represent general studies, but likewise may be reckoned as essential reading. <1> Adler, Sharon C. "DSSSL- Document Style Semantics and Specification Language. 1/8 (January 1989) 8-9. An overview of the standard by the editor of DSSSL. For brief description of the goals of DSSSL, see the entry below on this Draft International Standard (ISO/IEC DIS 10179), <69>. <2> Amsler, Robert A.; Tompa, Frank W. "An SGML-Based Standard for English Monolingual Dictionaries." In Fourth Annual Conference of the UW Centre for the New Oxford English Dictionary: Information in Text. Proceedings of the Conference. Conference held in Waterloo, Ontario, Canada, 26-28 October 1988. Pages 61-79. Waterloo, Ontario: University of Waterloo, 1988. The 'Dictionary Encoding Initiative' referenced is loosely affiliated with the international Text Encoding Initiative; both projects seek to epploy SGML. For SGML used in dictionary markup, see also <41> below. Several of the Waterloo Annual Conference volumes contain articles germane to descriptively- tagged and SGML-tagged text. For further details on the Waterloo Centre, see <19> below. <3> Ballanti, Anna; Cork, Deborah; Dam, Lex van; Jonghe, Jurgen de; Herwijnen, Eric van; Nijdam, Marco; Samarin, Alexandre; Shave, Tony. "Text Processing at CERN. Part 1: Overview." SGML Users' Group Bulletin 3/2 (1988) 39-54. <4> Barnard, David T.; Fraser, Cheryl A.; Logan, George M. "Generalized Markup for Literary Texts." Literary and Linguistic Computing 3/1 (1988) 26-31. Abstract: Encoding literary texts for analysis, electronic transmission, or publication requires the marking of various substantive, structural and formal features. The development of a standard comprehensive markup language for these purposes is a desideratum. This paper offers a set of requirements for such a language, reviews related work, and describes a newly-created standard based on the Standard Generalized Markup Language. <5> Barnard, David T.; Hayter, Ron; Karababa, Maria; Logan, George M.; McFadden, John. "SGML Based Markup for Literary Texts: Two Problems and Some Solutions." Computers and the Humanities 22/4 (1988) 265-276. ISSN: 0010-4817. (Revision of Technical Report 204, Queen's University Department of Computing and Information Science, 1988, ISSN 0836-0227). Abstract: There is wide agreement on the need for a markup standard for encoding literary texts. The Standard Generalized Markup Language (SGML) seems to provide the best basis for such a standard. But two problems inhibit the acceptance of SGML for this purpose. (1) Computer-assisted textual studies often require the maintenance of multiple views of a document's structure but SGML is not designed to accommodate such views. (2) An SGML-based standard would appear to entail the keyboarding of more markup than researchers are accustomed to, or are likely to accept. We discuss five ways of reducing the burden of markup. We conclude that the problem of maintaining multiple views can be surmounted, though with some difficulty, and that the markup required for an SGML-based standard can be reduced to a level comparable to that of other markup schemes currently in use. <6> *Barron, David. "Why Use SGML?" Electronic Publishing: Origination, Dissemination and Design (EPOdd) 2/1 (April 1989) 3-24. CODEN: EPODEU; ISSN 0894-3982. Abstract: The Standard Generalised Markup Language (SGML) is a recently-adopted International Standard (ISO 8879). The paper presents some background material on markup systems, gives a brief account of SGML, and attempts to clarify the precise nature and purpose of SGML, which are widely misunderstood. It then goes on to explore the reasons why SGML should (or should not) be used in preference to older-established systems. A summary of the article is also printed in "Why Use SGML," SGML Users' Group Newsletter 13 (August 1989) 10. <7> Bryan, Martin. "Creating Informative Document Models." SGML Users' Group Newsletter 20 (September 1991) 12-17. <8> Burnard, Lou. "What is SGML and How Does it Help?" Pp. 65-79 in Modelling Historical Data: Towards a Standard for Encoding and Exchanging Machine-Readable Texts, 1991. See volume information sub the editor, Daniel Greenstein, <21> below. A revised copy of the article in tagged electronic format is available from the TEI-L LISTSERVer (listserv@uicvm on BITNET) as EDW25 LDOC, October 1, 1991. <9> Chamberlin, Donald Dean; Goldfarb, Charles F. "Graphic Applications of the Standard Generalized Markup Language (SGML)." Computers and Graphics 11/4 (1987) 343-358. ISSN: 0097-8493. Abstract: The Standard Generalized Markup Language (SGML) is a language for representing document structure. This paper discusses ways in which the SGML language might be used to represent graphic as well as textual contents of a document. By using SGML markup for both graphics and text, a document processing application can achieve a more uniform treatment and tighter coupling between these two types of materials. <10> *Coombs, James; Renear, Allen; DeRose, Steven J. "Markup Systems and the Future of Scholarly Text Processing." CACM 30/11 (1987) 933-947. ISSN: 0001-0782. Cf. CACM 31/7 (July 1988) 810-811. Abstract: The authors argue that many word processing systems distract authors from their tasks of research and composition, toward concern with typographic and other tasks. Emphasis on "WYSIWYG", while helpful for display, has ignored a more fundamental concern: representing document structure. Four main types of markup are analyzed: Punctuational (spaces, punctuation,...), presentational (layout, font choice,...), procedural (formatting commands), and descriptive (mnemonic labels for document elements). Only some ancient manuscripts have no markup. Any form of markup can be formatted for display, but descriptive markup is privileged because it reflects the underlying structure. ISO SGML is a descriptive markup standard, but most benefits are available even before a standard is widely accepted. A descriptively marked-up document is not tied to formatting or printing capabilities. It is maintainable, for the typographic realization of any type of element can be changed in a single operation, with guaranteed consistency. It can be understood even with no markup formatting software: compare "
" to ".sk 3 a; .in +10 -10; .ls 0; .cp 2". It is relatively portable across views, applications and systems. Descriptive markup also minimizes cognitive demands: the author need only recall (or recognize in a menu) a mnemonic for the desired element, rather than also deciding how it is currently to appear, and recalling how to obtain that appearance. Most of this extra work is thrown away before final copy; descriptive markup allows authors to focus on authorship. (abstract supplied by Steve DeRose) <11> Cover, Robin; Duncan, Nicholas; Barnard, David. "The Progress of SGML (Standard Generalized Markup Language): Extracts from a Comprehensive Bibliography." Literary and Linguistic Computing 6/3 (1991) 200-212. ISSN: 0268-1145. The article includes introductory essay sections delineating the fundamental conceptions of SGML, its broad application, and the advantages it brings to academia, industry and government sectors. For a description of the complete bibliographic database itself, see further under section 9, item <133> below. <12> Cruz, Gil C.; Judd, Thomas J. "The Role of a Descriptive Markup Language in the Creation of Interactive Multimedia Documents for Customized Electronic Delivery. In Electronic Publishing '90: Proceedings of the International Conference on Electronic Publishing, Document Manipulation and Typography (Gaithersburg, Maryland, September 1990). Pages 249-262. The Cambridge Series on Electronic Publishing. Cambridge: Cambridge University Press, 1990. <13> *DeRose, Steven J.; Durand, David G.; Mylonas, Elli; Renear, Allen H. "What is Text, Really?" Journal of Computing in Higher Education 1/2 (Winter 1990) 3-26. ISSN: 1042-1726. Abstract: "The way in which text is represented on a computer affects the kinds of uses to which it can be put by its creator and by subsequent users. The electronic document model currently in use is impoverished and restrictive. The authors agree that text is best represented as an ordered hierarchy of content object[s] (OHCO), because that is what text really is. This model conforms with emerging standards such as SGML and contains within it advantages for the writer, publisher, and researcher. The authors then describe how the hierarchical model can allow future use and reuse of the document as a database, hypertext or network." <14> Ellison, Paul A. "SGML and Related Information Standards." Pp. 17-28 (1-12) in Document Exchange: The Use of SGML in the UK Academic and Research Community. Workshop Proceedings 5-7 March 1990 (see <28>below). Abstract: "This paper explains the position of four ISO 'standards' (only one agreed standard, one draft standard and two draft proposals) in the area of text and office information processing. Those standards are SGML (Standard Generalized Markup Language), the 'Fonts' standard (Font Architecture and Interchange Format), DSSSL (Document Style Semantics and Specification Language), and SPDL (Standard Page Description Language). . . In addition, the paper relates these standards to ODA (Office Document Architecture) and places SGML and ODA in their own contexts." <15> Fahmy, Eanass; Barnard, David T. "Adding Hypertext Links to an Archive of Documents." Canadian Journal of Information Science 15/3 (September 1990) 26-41. Abstract: Texts are characterized by various types of linkages, within themselves and with other documents, which may be either explicit or implicit. When texts are available in machine-readable form, the ability to trace linkages should become much easier, and more complex tracing of linkages should be possible. Hypertext is an electronic document paradigm whose distinguishing feature is machine support for the building and tracing of intra- and inter- document links; a document is viewed as a collection of nodes connected by directed links. A limitation of many hypertext systems is that all links must be created explicitly by the user. This is impractical in many situations, and it is unnecessary if the link structure is inherent in the documents themselves. The work described in our paper is motivated by the perceived need to extend the hypertext paradigm so that links can be derived from a collection of documents. We explore how a rich set of links connecting documents in a text archive can be programmatically generated, and present a set of link types that are useful, specifiable and computable. The documents in the archive are encoded using the Standard Generalized Markup Language, which views a document as a hierarchical organization of document elements. The archive, therefore, consists of a forest of document trees. <16> Gaspart, Jean-Pierre. "Use of the SGML Parser at the Office for Official Publications of the European Communities (OPOCE)." SGML Users' Group Bulletin 2/1 (1987) 29-36. <17> Gennusa, Pamela L. "Advantages of an SGML Implementation for Management of an Electronic Text Database." SGML Users' Group Bulletin 2/2 (1987) 73-86. <18> Goldfarb, Charles F. "A Generalized Approach to Document Markup." Proceedings of the ACM SIGPLAN SIGOA Symposium on Text Manipulation. = SIGPLAN Notices 16/6 (1981) 68-73. Conference proceedings containing this paper also available as SIGOA Newsletter 2/1-2 (Spring/Summer 1981). <19> Gonnet, Gaston. "Examples of PAT Applied to the Oxford English Dictionary." Technical Report OED-87-02. University of Waterloo Centre for the New Oxford English Dictionary. July, 1987. PAT and associated text processing tools are built around descriptively-marked text, even if not specifically SGML text. Compare also "PAT, GOEDEL, LECTOR and More: Text-dominated Database Software, " pp. 83-84 in: Tools for Humanists, 1989. A Guidebook to the Software and Hardware Fair Held in Conjunction with the Dynamic Text 6-9 June 1989 Toronto. Toronto, Ontario: Centre for Computing in the Humanities, 1989. The article describes several software tools developed at the Waterloo Centre, including TRUC (an editor for SGML or SGML-style tagged text). TRUC supports multiple views of a tagged document, based upon use of style-sheets. The University of Waterloo has pioneered several important research efforts in the study of machine-readable lexical databases, machine transduction and generation of descriptively marked-up electronic texts (SGML-style markup). The Centre has also developed software to search, interactively display and format text structured with descriptive markup. These tools were developed for the New Oxford English Dictionary Project with the long range goal of application to other texts. A Newsletter is issued by the Centre describing ongoing research, publications, software enhancements, work of visiting scholars, conferences and other events. Persons interested in the Centre's research and publications may write for a current document list (e.g., especially the several publications and technical reports by Darrell R. Raymond, Donna L. Berg, Gaston H. Gonnet, Timothy J. Benbow, Heather J. Fawcett, Rick Kazman, Frank Wm. Tompa, George V. J. Townsend. See <20>, <33> and <41> in this bibliography. Address: Electronic Text Research; Centre for the New Oxford English Dictionary; Davis Centre; University of Waterloo; Waterloo, Ontario; Canada N2L 3G1 TEL: (1 519) 885- 1211 extension 6183; Email (Internet):newoed@waterloo.edu. <20> Gonnet, Gaston; Tompa, Frank W. "Mind Your Grammar: A New Approach to Modelling Text." Technical Report OED-87-01. University of Waterloo Centre for the New Oxford English Dictionary. February, 1987. Abstract: Beginning to create the New Oxford English Dictionary database has resulted in the realization that databases for reference texts are unlike those for conventional enterprises. While the traditional approaches to database design and development are sound, the particular techniques used for commercial databases have been repeatedly found to be inappropriate for text-dominated databases, such as the New OED. In the same way that the relational model was developed based on experiences gained from earlier database approaches, the grammar-based model presented here builds on the traditional foundations of computer science, and particularly database theory and practice. This new model uses grammars as schemas and "parsed strings" as instances. Operators on the parsed strings are defined, resulting in a "p-string algebra" that can be used for manipulation and view definition. The model is representation-independent and the operators are non-navigational, so that efficient implementations may be developed for unknown future hardware and operating systems. Several approaches to storage structures and efficient processing algorithms for representative hardware configurations have been investigated. <21> Greenstein, Daniel I. (editor). Modelling Historical Data: Towards a Standard for Encoding and Exchanging Machine-Readable Texts. Halbgraue Reihe zur Historischen Fachinformatik, Serie A, Historische Quellenkunden (edited by Manfred Thaller). Band 11. Published for the Max-Planck-Institut für Geschiche, by Scripta Mercaturae Verlag (St. Katharinen), 1991. iv + 223 pages. ISBN: 3-928134-45-0. A collection of fourteen essays on various aspects of conceptual modelling and development of standardized encoding methods for representing knowledge in historical texts. The contributions are by Manfred Thaller, Lou Burnard, Daniel I. Greenstein, Hannes D. Galter, Ingo H. Kropač, Donald A. Spaeth, Hans Jørgen Marker, Thomas Werner, Jan Oldervoll, and Kevin Schurer. The essays reflect interaction with and critique of encoding methods which emerged from the TEI phase I efforts as documented in TEI-P1; see <52> below. <22> Guittet, Christian. "Appendix -- Introduction to SGML. Extract from FORMEX. Published by the EEC Office of Official Publications." SGML Users' Group Bulletin 1/1 (1986) 26-57. <23> Heath, Jim; Welsch, Larry. "Difficulties in Parsing SGML." In Proceedings of the ACM Conference on Document Processing Systems, Santa Fe (5-9 December 1988). Pages 71-77. New York: Association for Computing Machinery, 1988. See similarly, by the same authors, "Difficulties in Parsing: Suggestions to Improve SGML," 10 (July 1989) 8-10. <24> *Joloboff, Vania. "Document Representation: Concepts and Standards." In Structured Documents. Edited by Jacques André, Richard Furuta, and Vincent Quint. Cambridge Series on Electronic Publishing. Pages 75-105. Cambridge: Cambridge University Press, 1989. This article examines the problem of document representation in computer systems for printing, editing or interchange among heterogeneous systems. After a discussion of the various possibilities for defining documentation representation formalisms, it considers a number of standard representations typical of their class: page description languages, SGML, Interscript, ODA. Several other articles in the volume are of direct or marginal relevance to SGML as a metalanguage for document-structuring. <25> Macleod, Ian A. "Storage and Retrieval of Structured Documents." Information Processing and Management 26/2 (1990) 197-208. Abstract: There have been a number of important related activities which suggest the need for a new model for text. ISO standards for document description have been recently developed. These standards view documents as hierarchical objects and it is likely that languages such as SGML will become widely used in the near future for document markup. As structured documents become available, so there will be a need to evolve tools to take advantage of structural knowledge. The goal of the work described here is to develop such tools. A conceptual model for bibliographic data has been designed. The model is known as Maestro (Management Environment for Structured Text Retrieval and Organization). It supports structured documents and provides a query language to retrieve and link information contained in these structures. In this paper an overview of Maestro is presented together with an outline of the basic implementation. <26> Mamrak, Sandra A.; Kaelbling, Michael J.; Nicholas, C.K.; Share, M. "Chameleon: A System for Solving the Data-Translation Problem." IEEE Transactions on Software Engineering 15/9 (September 1989) 1090-1108. ISSN: 0098-5589. Abstract: "There is a need for widespread exchange of electronic documents in domains as diverse as book publishing, automated offices, factories, and research laboratories. The variety of data representations, and the subsequent need for data translation, is a major obstacle to this exchange. This paper describes a comprehensive data translation system with the following characteristics: 1) it is derived from a formal model of the translation task; 2) it supports the building of translation tools; 3) it supports the use of translation tools; and 4) it is accessible to its targeted end-users. A software architecture to achieve the translation capability is fully implemented. Translators have been generated using the architecture, both by the original software developers and by industrial associates who have installed the architecture at their own sites." Further note: A subset of the OSU Chameleon Project's more recent SGML translation tools is to be made freely available to the academic community in 1992. See <44> and "Free Data Translation Software," EPSIG News 4/3 (September 1991) 8-9; cf. "Integrated Chameleon Architecture Plans for 91-92 Year," SGML Users' Group Newsletter 19 (April 1991) 12-13. <27> Moline, Judi; Benigni, Dan; Baronas, Jean (eds). Proceedings of the Hypertext Standardization Workshop [January 16-18, 1990 National Institute of Standards and Technology, Gaithersburg, MD]. NIST Sepcial Publication 500-178, March, 1990. CODEN: NSPUE2. Several papers in this proceedings volume reference SGML, HyTime and SMDL as potentially valuable in creating hypertext/hypermedia standards. Reports from the workshop's Data Interchange Group and User Requirements Discussion Groups likewise identified SGML or SGML-like GIs as having probable priority in emerging standards formulations. <28> Mumford, Anne (editor). Document Exchange: The Use of SGML in the UK Academic and Research Community. Workshop Proceedings 5- 7 March 1990. Advisory Group on Computer Graphics, 1990. This proceedings volume contains several important contributions on SGML (submitted by Anne Mumford, Paul Ellison, Martin Bryan, Angella Scheller, David Duce and Ruth Kidd, Tim Niblett, Lou Burnard, John Larmouth, Paul Bacsich and Paul Lefrere, Malcolm Clark, and Kathleen Crennell). The volume is available from the organizer: Ann M. Mumford, Computer Centre, Loughborough University, Loughborough LE11 3TU, UNITED KINGDOM; TEL: 44 509 222312; FAX: 44 392 211603. See a full list of contributors and presentation-titles in "Document Exchange in UK Universities," SGML Users' Group Newsletter 17 (August 1990) 10. <29> *Naggum, Erik. "Answers to Frequently-Asked-Questions (FAQs) - for the UseNet Newsgroup comp.text.sgml." A draft version (0.0, 1991-12-15) is available via anonymous FTP as ftp.ifi.uio.no:SGML/FAQ.0.0 (ftp.ifi.uio.no has the IP address 129.240.88.1 as of December 1991). The latest version of the FAQ document may be fetched at any time from this public disk region, generously sponsored by The University of Oslo, Department of Informatics with oversight by Erik Naggum. The FAQ will also be found on servers which archive collections of FAQs. Suggestions for additional questions (or answers) to be included in the FAQ may be directed to the author: Erik Naggum; Naggum Software; Boks 1570, Vika; 0118 OSLO, NORWAY; Email: erik@naggum.no OR enag@ifi.uio.no on the Internet. <30> Price, Lynne A. "Graphic Representation of Content Models." 10 (July 1989) 12-16. The article demonstrates the use of tree structures and (more extensively) FSAs to represent SGML content models. FSAs are useful in revealing ambiguity (seemingly equivalent models). The article is derived from the author's tutorial session at the ACM Conference on Document Processing Systems, Santa Fe, New Mexico (5-9 December 1988). <31> Price, Lynne A. "Using SGML and TeX for User Documentation." In TEXniques No. 7: Proceedings, TeX User's Group 1988 Annual Meeting (21-24 August 1988, Montreal). Pages 203-210. TeX User's Group, 1988. Abstract: The Standard Generalized Markup Language (SGML), defined in International Standard (ISO) 8879, is a notation for representing documents and making their inherent structure explicit. The open-ended list of SGML applications includes document interchange, formatting or typesetting, loading databases for information retrieval, stylistic or linguistic analysis, and computer-aided translation. The combination of SGML and TeX is a natural one. This paper reviews the philosophy of SGML and then describes a particular environment where SGML and TeX are used together, giving specific examples of how processing is shared between the SGML application and TeX macros. <32> Price, Lynne A.; Schneider, Joe. "Evolution of an SGML Parser Generator." In Proceedings of the ACM Conference on Document Processing Systems, Santa Fe, 5-9 December 1988. Pages 51-60. New York: Association for Computing Machinery, 1988. Abstract: The Standard Generalized Markup Language (SGML) is a notation for describing classes of structured documents and for coding documents belonging to described classes. An advantage of SGML and other grammar-based document representations is the ability to perform multiple applications on a single document source file. This paper describes the evolution of a software development tool for implementing such applications. It explains the original design as well as enhancements made during the system's first eighteen months. Although not statistically significant, data on the use of the enhanced features are presented. The experience described is relevant to other software engineering tools for text processing. <33> Raymond, Darrell R. "Lector - An Interactive Formatter for Tagged Text." Technical Report OED-90-02. University of Waterloo Centre for the New Oxford English Dictionary and Text Research, August, 1990. 26 pages, 13 figures. Abstract: Lector is an X.11 aplication that provides highly interactive text formatting. Unlike text previewers, Lector handles descriptively marked-up text, supports multiple styles, and interacts well with other programs, including other invocations of Lector. Appropriate selection of texts and styles enables Lector to act as a text previewer, database browser, code prettyprinter, menu utility, and iconic interface. Lector's implementation revolves around a set of tradeoffs involving efficiency, simplicity and generality. The result demonstrated the utility of generalized text display tools. Note: for further details on the Waterloo Centre, see <19>. <34> Rubinsky, Yuri. "Standards for Hypertext Interchange." SGML Users' Group Newsletter 15 (January 1990) 14-15. For more on SGML applied to hypertext/hypermedia, see <109>, <72> and: (1) Yuri Rubinsky, "Standards for Hypertext Interchange Need Not Come out of Thin Air," 11 (October 1989) 4-5; (2) Yuri Rubinsky, "Comments on an SGML Application for Hyper- and Multi- Media Interchange: Informal Report from the GCA Hypertext/Hypermedia Standards Forum," 11 (October 1989) 5-6. <35> Scheller, Angela. "Document Standards: Availability and Products." Computer Networks and ISDN Systems 16/1-2 (September 1988) 138-142. CODEN: CNISE9. ISSN: 0169-7552. Abstract: With the growth in the spread of computer networks the demand by users for document interchange features is becoming increasingly apparent. The prerequirement for the realization of document interchange in a heterogeneous computer environment are internationally accepted standards for the description of documents. Already in early 1986, the Standard Generalized Markup Language SGML was published as an international standard for the structuring of documents. The publication of the Office Document Architecture ODA is expected in the course of 1988. The final text is already available. ODA was originally developed for the pure office environment, whereas the concept for SGML addressed the author/publisher environment. This fact is mirrored in the current pilot projects testing the standards: the manufacturers of office and word-processing systems mainly work with ODA, whereas in the technical scientific and publishing sectors SGML is often implemented. Users requiring an interface both to the office sector as well as to the publishing sector will therefore be confronted with the problems related to working with two different, only partially compatible standards. <36> SGML Users' Group. "A Brief History of the Development of SGML." 3-June-1989. 2 pages. Printed as a separate document, and in the SGML Users' Group Newsletter 14 (October 1989) 6-7, and (being free of copyright restrictions) elsewhere, (1) The SGML Handbook, <49>, Appendix A: pp. 567-570; (2) The SGML Source Guide, <134>. <37> Smith, Joan M. "The Standard Generalized Markup Language (SGML) for Humanities Publishing." Literary and Linguistic Computing 2/3 (1987) 171-175. ISSN: 0268-1145. Abstract: a new methodology, and the core of which is generic coding, has been developed within the International Organization for Standardization (ISO). This is known as the Standard Generalized Markup Language (SGML). Using SGML, the elements of a document are marked up as to their role, be it a paragraph, an abstract, a note, or whatever; the style of presentation is a separate issue and is not addressed by SGML. These elements can form part of a data base, which can be updated at will. So there is the notion of data base publishing. The Standard Generalized Markup Language is presented as a tool for full-text data base publishing, where the options for output are open, an example being given as a marked up document. Its value for all aspects of humanities publishing is addressed: whether for scholarly papers intended for a journal, books, specialist publications, dictionaries, or biographies, indeed whatever is input to an electronic medium with the intention of being imaged subsequently in some form; whether alone, in part, or in combination with other text. SGML represents an advance in publishing methodology, taking advantage of developing technology. It can be exploited as such in an academic environment to give an added dimension to research publications. <38> Smith, Joan M. "Standard Generalized Markup Language and Related Standards." Computing Communications 12/2 (April 1989) 80-84. <39> *SoftQuad, Inc. The SGML Primer. SoftQuad's Quick Reference Guide to the Essentials of the Standard: The SGML Needed for Reading a DTD and Marked-up Documents and Discussing them Reasonably. Version 2.0. Toronto: SoftQuad Inc., May 1991. 36 pages. Available from SoftQuad Inc.; 56 Aberfoyle Crescent, Suite 810; Toronto, Ontario; Canada M8X 2W4; TEL: +1 (416) 239- 4801; FAX: +1 (416) 239-7105. <40> Software Exoterica. "Understanding the SGML Declaration." Release 2.0. Ottawa: Software Exoterica Corporation, February 19, 1991. Technical Report ECM03-0291. iv + 34 pages. Available from: Software Exoterica Corporation; 383 Parkdale Avenue, Suite 406; Ottawa, Ontario; CANADA K1Y 4R4; TEL: +1 (613) 722-1700; TEL: +1 (800) 565-XGML; FAX: +1 (613) 722-5706. <41> Tompa, Frank W. "What is (Tagged) Text?" In Dictionaries in the Electronic Age: Proceedings of the Fifth Annual Conference of the UW Centre for the New Oxford English Dictionary (18-19 September 1989, St. Catherine's College, Oxford). Volume 2. Pages 81-93. Waterloo, Ontario: UW Centre for the New OED, 1989. Note: for further details on the Waterloo Centre, see <19>. <42> *"Use of SGML Markup." Chapter 2 (pp. 9-38) in Guidelines for the Encoding and Interchange of Machine-Readable Texts (Text Encoding Initiative, Draft Version 1.0). See <52> in section 3 below. <43> Vooren, Ludo van. "Implementing SGML: Where Do You Start?" 13 (February 1990) 5-7. This contribution proposes implementing SGML in several stages: Document Analysis, Process Design, Document Type Declaration Writing, Document Preparation. Also published in SGML Users' Group Newsletter 17 (August 1990) 5-7. <44> Walter, Mark. "OSU's Chameleon Architecture: A Grammatical Approach to Translation and DTDs." Seybold Report on Publishing Systems 20/7 (December 24, 1990) 17-23. Describes the approach taken by the Chameleon Research Group at the Department of Computer and Information Science at Ohio State University in building SGML translators and DTDs. See more on Chameleon sub <26>. <45> Warmer, Jos; Egmond, Sylvia van. "The Implementation of the Amsterdam SGML Parser." Electronic Publishing: Origination, Dissemination and Design (EPOdd) 2/2 (July 1989) 65-90. ISSN: 0894-3982. Abstract: The Standard Generalized Markup Language (SGML) is an ISO Standard that specifies a language for document representation. This paper gives a short introduction to SGML and describes the (Vrije Universiteit) Amsterdam SGML Parser and the problems we encountered in implementing the Standard. These problems include the interpretation of the Standard in places where it is ambiguous and the technical problems in parsing SGML documents. Note: the parser is available electronically via anonymous-FTP; see <123>. <46> Wu, Gilbert. SGML Theory and Practice. British Library Research Paper 68. British Library Research and Development Department, 1989. ISSN: 0269-9257 No. 68; ISBN 0-7123-3211-1. 93 pages. =========================================================== SGML MANUALS: COMMENTARY AND INDICES FOR ISO 8879 =========================================================== *All five volumes in this section are available for purchase from the GCA collection "Information Technologies Publications and Resources," as well as from the named publishers or distributors. GCA addresses are given in entry <105>. Publications <48> and <51> are also available from the SGML Users' Group Bookstore and may be ordered from the SGMLUG Secretary; see <106>. <47> Bingham, Harvey W. SGML Syntax Summary. Cambridge, MA: Interleaf, 2-June-1988. 46 pages. The document supplies cross- reference information which is not given or optimally accessible in the ISO 8879 standard itself. The syntax summary covers the primary ISO document (8879), Amendment 1 (Fall 1987) and Amendment 1, Corrections (May 1988). Copies of the syntax summary were mailed to subscribers of with issue 1/4 (1988). Updates are (were?) available from Interleaf. <48> Bryan, Martin. SGML: An Author's Guide to the Standard Generalized Markup Language. Wokingham/Reading/New York: Addison-Wesley, 1988. ISBN: 0-201-17535-5 (pbk); LC CALL NO: QA76.73.S44 B79 1988. 380 pages. A highly detailed and useful manual explaining and illustrating features of ISO 8879. The book: (1) shows how to analyse the inherent structure of a document; (2) illustrates a wide variety of markup tags; (3) shows how to design your own tag set; (4) is copiously illustrated with practical examples; (5) covers the full range of SGML features. Technical and non-technical authors, publishers, typesetters and users of desktop publishing systems will find this book a valuable tutorial on the use of SGML and a comprehensive reference to the standard. It assumes no prior knowledge of computing or typography on the part of its readers. <49> Goldfarb, Charles F. The SGML Handbook. Edited and with a foreword by Yuri Rubinsky. Oxford: Oxford University Press, 1990. ISBN: 0-19-853737-1. 688 pages. This volume contains the full annotated text of ISO 8879 (with amendments), authored by IBM Senior Systems Analyst and acknowledged "father of SGML," Charles Goldfarb. The book was itself produced from SGML input using a DTD which is a variation of the "ISO.general" sample DTD included in the annexes to ISO 8879. The SGML Handbook includes: (1) the up-to-date amended full text of ISO 8879, extensively annotated, cross-referenced, and indexed (2) a detailed structured overview of SGML, covering every concept (3) additional tutorial and reference material (4) a unique "push- button access system" that provides paper hypertext links between the standard, annotations, overview, and tutorials. <50> Herwijnen, Eric van. Practical SGML. Dordrecht/Hingham, MA: Wolters Kluwer Academic Publishers. 200 pages. ISBN: 0-7923- 0635-X. The book is designed as a "practical SGML survival-kit for SGML users (especially authors) rather than developers," and itself constitutes an experiment in SGML publishing. The book provides a practical and painless introduction to the essentials of SGML, and an overview of some SGML applications. See the reviews by (1) Carol Van Ess-Dykema in Computational Linguistics 17/1 (March 1991) 110-116, and (2) Deborah A. Lapeyre in 16 (October 1990) 12-14. <51> Smith, Joan M.; Stutely, Robert S. SGML: The Users' Guide to ISO 8879. Chichester/New York: Ellis Horwood/Halsted, 1988. 173 pages. ISBN: 0-7458-0221-4 (Ellis Horwood) and ISBN: 0-470- 21126-1 (Halsted). LC CALL NO: QA76.73.S44 S44 1988. The book (1) supplies a list of some 200 syntax productions, in numerical and alphabetical sequence; (2) gives a combined abbreviation list; (3) includes highly useful subject indices to ISO 8879 and its annexes (4) supplies graphic representations for the ISO 8879 character entities; (5) lists SGML keywords and reserved names. An overview of the book may be found in the SGML Users' Group Newsletter 9 (August 1988) 9. ================================================ SGML APPLICATIONS AND IMPLEMENTATIONS ================================================ <52> ACH/ACL/ALLC. Guidelines for the Encoding and Interchange of Machine Readable Texts. Edited by C. Michael Sperberg-McQueen and Lou Burnard. TEI-P1, Version 1.1 October 1990. xx + 289 pages. This volume represents the results of work in Phase I of the International Text Encoding Initiative, sponsored by ACH/ACL/ALLC and several advisory associations. The publication describes and illustrates mechanisms (some experimental) for SGML markup of many kinds of documents, especially for humanities fields (literary, linguistic, historical, philosophical studies). The TEI encoding standard is an SGML application. Contact the editors: in the US, Michael Sperberg- McQueen; BITNET: u35395@uicvm; Computer Center (M/C 135); University of Illinois at Chicago; Box 6998; Chicago, IL 60680; TEL: (312) 996-2981; in the UK, Lou Burnard; JANET: lou@vax.ox.ac.uk; Oxford University Computing Services; 13 Banbury Road; Oxford OX2 6NN; TEL: (44) 865-273238. Version note: Draft Version TEI-P2 is scheduled to appear January 1992, and Version 3 is to be produced at the end of the current TEI project, April-June, 1992. Summary descriptions of the TEI effort may be found in several ===============Cut Here: End Part 1 of 4 parts====================