HyperText Transfer format
This is a draft for internal discussion. TBD means "To be defined".
Comments please to timbl@vxcern.cern.ch. This copy was printed from the
source on .
Introduction
This document defines the text mark-up format used for network transfer
of hypertext infoirmation. The ASCII (ISO-XXXX) character set [ref] is
used, without extension.The format is marked up in SGML style, with
control information being introduced by a less than sign "<", followed
by an alphanumeric "tag". Following the tag, a number of attributes may
be specified, of the form
attribute_name = value
where the attribute name is alphanumeric, and the value is any printable
string not containing spaces.
Control information is terminated by a closing greater than ">" sign.
We refer to this mark up language as HyperText Markup Language, or HTML.
Default text
Unless otherwise defined by tags, text is transmitted as a stream of
lines. The division of the stream of characters into lines is arbitrary,
and only made in order to allow the text to be passed through systems
which can only handle text with a limited line length. The recommended
line length for transmission is 80 characters. The division into lines
has no significance (except in the case of the XMP and PLAINTEXT tags -
see below).
The names of tags and attributes are not case sensitive: they may be in
lower, upper, or mixed case with exactly the same meaning. (In this
document they are gemerally represented in upper case.)
Tags
It is recommended that a parser ignores those tags which it does not
recognize. A list of tags currently defined follows: