SGML-based publishing, especially tagging of data element, is a time-consuming work. Elsevier that publishes more than 1000 journals, extends tagging of title page data to all journals, however selects strictly journals in tagging of full-text data. If a SGML-oriented and author-friendly electronic submission is developed for an academic article, it will be an efficient method to publish an academic journal. Massachusetts Medical Society [6] adopted an electronic submission with WordPerfect in publishing SGML-based textbook: "The AIDS Knowledge Base". The style of the electronic manuscript is constructed a tagged format, such as a title is followed by "@T=". Y. Tanaka and I adopted a similar tagged format in the case of JJSIK in 1990. However, such a format is not easy to an author.
One of solutions of that problem is RTF-to-SGML conversion. Here, RTF is an exchange format for word processing text data, and is supported by popular word processing software: Microsoft Word, WordPerfect, PageMaker, etc. However several RTF-to-SGML converter tools[7] exist, Y. Tanaka, I and T. Ito [8] developed RTF-to-SGML converter program written in AWK, since self-developed program is more flexible than a converter tool, and since flexibility is suitable for a writing style of an academic article. We also adopted a style function available in Microsoft Word, WordPerfect, PageMaker, and successfully applied to electronic submission to BCSJ in 1993. I will report this electronic submission method in detail.
I will also introduce that this method will be included in one of Japanese standards: SIST (Standards for Information of Science and Technology) [9] No. 14 (draft) "Guideline for electronic submission", which is now considered in SIST Committee, and will be published in near future.
Ito, Tanaka and I adopted style function (STYLE) and RTF combination on Windows PC or Macintosh in the case of BCSJ. Here, STYLE specifies a style name to a corresponding data element, such as section-title, and its parameters, such as, centering, font, and its size, etc. RTF can include information of STYLE. Only ASCII character or character-string is used in RTF to indicate any control command, such as font and its size and so on, and style-name, start-tag-of-the-style, end-tag-of-the-style.
The reasons of adopting STYLE and RTF are as follows; (1) STYLE and RTF are available in a popular word processing software, such as Microsoft Word on Windows/Macintosh, WordPerfect, PageMaker, etc., (2) STYLE serves easy assignment of data element by a contributor, and supports layout functions, such as centering, character size, etc., (3) while STYLE does not show any control characters on a display or a printed page, RTF including STYLE specifies data corresponding to SGML data element-name, start-tag, and end-tag (4) RTF text data including STYLE may automatically convert to SGML text data.
Table 1 shows a list of styles with display format of electronic manuscript for submission to BCSJ. Figure 1 shows a sample of an electronic manuscript written w ith this method. Style: Synopsis is selected on pull down menu of style in Fig. 1. A contributor may use bold, italic, superscript, subscript. He/she may type Greek alphabet using 'symbol font'. A special character is typed using a corresponding entity name defined in the DTD (Document Type Definition) for BCSJ; for example, 'right double arrow' is typed as '⇒'. The DTD includes entity name sets for special characters defined in ISO 8879 Annex and ISO/IEC TR 9573-13. Referring to a bibliographic reference is indicated superscript number with a parenthesis, such as 1). Referring to a table or a figure is indicated underlined character string like "... Table 1 shows ...". Bodies of a figure, a table, and a complicated equation are not included in this electronic submission yet, because of incompatibility among word processing software.
style name | data element | display format |
Category | category of article | 12 p(point) |
Running title | running title | 14 p, centering |
Title | article title | 14 p, centering, bold |
Sub title | sub title | 12 p, centering, bold |
Author | author's name | 12 p |
Address | affiliation and address | 12 p, centering |
Received | received date | 12 p, centering |
Synopsis | synopsis | 12 p, decreased line width |
Normal | paragraph | 12 p, indent of 1st line |
Section title | section title | 12 p, centering, bold |
Subsection title | subsection title | 12 p, bold |
References | references | 12 p |
Table | table captions | 12 p |
Scheme | scheme captions | 12 p |
Figure | figure captions | 12 p |
Chart | chart captions | 12 p |
CI-Title | title for CI(contents with illustration) | 12 p |
CI-Author | author's name for CI | 12 p |
CI-Summary | summary for CI | 12 p |
Profile | author's profile (only for accounts) | 12 p |
Table 1 Styles used in electronic submission to Bulletin of the Chemical Society of Japan
Figure 1 Writing an electronic manuscript with style file on MS Word
Conversion program from RTF to SGML-text is written in AWK according to the follo wing methods.
Four methods are applicable to write an electronic manuscript; (A) STYLE and RTF combination, (B) using macro function of word processing software, (C) using some special character sequence as if the case of "The AIDS Knowledge Base" published by Massachusetts Medical Society, and (D) LaTeX. Method A and B are easier to a contributor than method C and D. Method A is easier to maintain a style than method B, because of method A using GUI while method B programming command macro for a word processing software. However, method B is more tunable to each type of electronic manuscript than method A. Method D is suitable for description a complicated mathematical equation.
Method A is most suitable to the members of CSJ, since most of them use Microsoft Word. CSJ have a plan of method D, because some members, who study physical chemistry, prefer LaTeX.
[2] H.Ishizuka, "The reception of SGML based electronic publishing by Japanese scientific community," Proc. of 47th FID (International Federation for Information and Documentation) Conference and Congress, pp.505-508 (in Omiya Japan, October 1994).
[3] H. Ishizuka, "Multimedia and publishing", Journal of Information Processing and Management (Joho Kanri), Vol.38, No.4, pp.353-368 (1995) (in Japanese); H. Ishizuka, "Electronic publishing: its concept and technology", Journal of The Institute of Electronics, Information and Communication Engineers (DenshiJohoTsushinGakkaiShi), Vol.78, No.9, pp.891-898(1995) (in Japanese).
[4] <URL:http://www.computer.org/epub/>
[5] Peter J. Denning; Bernard Rous, "The ACM Electronic Publishing Plan", Communication of the ACM, Vol.38,No.4,pp.97-103(1995)
[6] EPSIG news, Vol.1, No.1, (Sept. 1987).
[7] <http://www.nicetech.com/venprod.htm>
[8] H.Ishizuka, T.Ito, et al., "Generation of full-text database based on SGML through an electronic contribution by an author -- An experiment in Chemical Society of Japan," IPSJ (Information Processing Society of Japan) SIG-FI Note, No.35, pp.1-8 (1994) (in Japanese).
[9] Science and Technology Agency, "SIST hand-book," 3rd ed., Japan Information Center for Science and Technology, 1992, 441p. (in Japanese)
[10] T.Shida, T.Ito, "Recruit monitors to examine a style file to submit a paper for Bull. Chem. Soc. Jpn.", Chemistry & Chemical Industry (Kagaku to Kougyou), Vol.47, No.3, p.270 (1994) (in Japanese).
[11]<URL:http://wwwsoc.nacsis.ac.jp/csj/journals/BCSJ-style_file/manual.html>
[12]<URL:http://pubs.acs.org/instruct/jacsat.html>