ISR11
Scientific Report No. ISR-11 Information Storage and Retrieval
Operating Instructions for the SMART Text Processing and Document Retrieval System
chapter
M. E. Lesk
Harvard University
Gerard Salton
Use, reproduction, or publication, in whole or in part, is permitted for any purpose of the United States Government.
11-27
in column 1, SM[OCRerr]T and all auxiliary programs ignore their
contents, except that the first five (or fewer) such cards
are collected as BCD identification. Usually, this consists
of the title, author, and publication source of the document.
The text proper is assumed to begin on the first card without
a $ in column 1, and any further cards with $ in column 1 are
treated as normal text. The identification may be Omitted;
in this case, the first card of the text `mist not have a $
in column 1.
d) The convention used to hyphenate a word between two cards is
to place a minus sign (Il-punch) followed by a blank at the
end of the first part of the word. SM[OCRerr]T ignores the
remainder of the card, throws away the minus sign, and
continues from column 1 of the next card. For exBmple, hyphen-
ateis properly hyphenated. Note that this rule makes the
following construction illegal: `tsub- and super-scripts." Also,
note that normally hyphenated words which are broken between
two lines at the hyphen [OCRerr]List have a double minus sign to
be properly recognized by the input programs; thus, Runge--
Kutta. Hyphenated words in the middle of a line are typed
normaliy; thus, Runge-Kutta.
e) Hardware restrictions require that only upper-case letters be
used. Special characters are treated as follows:
? QUE (preceded by a space),
EP (preceded by a space),
I (for both open and close& quotes. There should be
no space before a close quote or after an open
quote; similarly, parentheses and commas are spaced
normally),
-DASH (Il-punch minus sign followed by the word DASH,
if a dash is meant; for hyphens see d) above),
`(8-4 punch, as in IBM Scientific character set H),
,. (not preceded by space),
(preceded by space, and ends a sentence).
Any other conventions could also be used, if the dictionaries were