The eXtensible Markup Language (XML)

XML Infoset

The XML Information Set (Infoset)  (Candidate Recommendation 14 May 2001) defines an abstract data set in order to provide consistent definitions for use in other XML specifications.  The Infoset has 11
information items:

Document
Element
Attribute
Processing Instruction
Unexpanded Entity
Character
Comment
Document Type Declaration
Unparsed Entity
Notatioin
Namespace
A well-formed XML document contains at least a document information item and may contain other items.  All other information items are accessible from the document information item, either directly or indirectly.  The document information item has 9 defined properties:
Children - Ordered list of child information items, with exactly one element information item.  Other children may be processing instructions,  comments, and docuemnt type declaration.

Document element - The element information item corresponding to the document element.

Notations - Declared in DTD (discussed in the next lecture).

Uneparsed Entities - Placeholders for unexpanded entities.

Base URI - The base URI of the document entity.

Character Encoding Scheme - Document's character encoding scheme .

Standalone - Given in the XML Declaration as yes or no.

Version - A string from the XML Declaration.

All Declarations Processed - An indication of whether the parser has completed the DTD, true or false.

The other elements are similarly described in the textbook and the W3C website.