What is Markup ?

Markup is to identify parts of document for display or to provide meaning to the content of a document. Here the document is different from the traditional document, which means we are referring to electronic documents. Here the electronic document with markup is made of elements, not just the files stored in the disk as bits and bytes. We will come back to this topic later.

Markup Languages

There are many languages that uses markups and called the markup languages. Here is a short list,

  • HTML – Hypertext Markup Language
  • KML – Keyhole Markup Language
  • MathML – Mathematical Markup Language
  • SGML – Standard Generalized Markup Language
  • XHTML – eXtensible Hypertext Markup Language
  • XML – eXtensible Markup Language

Note that each language ends up with ML which means “markup language”. The XML is not a markup language only, in fact, it has rules for creating markup languages. The markup is like a symbol and markup language is a set of symbols that are placed inside of a document to demarcate and label parts of the document.

Why use Markups?

Markup tells the computer how to handle a document, else the computer needs to scan the full document.

Consider this example,

"In India, I went to India Gate".

There are two references to the word – India. The first one is talking about a place, and the second, talking about a historical monument. Only markup can help identity the proper word with context.

Imagine a file full of information like this, the computer needs to scan the full document to get to the right information without markup.

Let us take a look at the document with some markup text in XML.

<street>Seasane street</street>
<city>Chennai <emphasis>600034</emphasis> </city>
<paragraph>The country information is not required.</paragraph>
 <graphic fileref="areamap.pict"/></paragraph> 

Let us try to understand the structure of markup document. The elements <address> </address> makes up a tag. There are two parts for every tag – markups and the content.

Address tag

The address tag marks the start and end of the document. It is nesting all other tags within itself,

Emphasis tag

Emphasis is similar to textual document emphasis; it decorates the inline text within a document. In a line of text, the emphasis changes a text to appear little italicized.

Paragraph tag

The paragraph tag takes content that requires lot of space like a paragraph of text or other information.

Graphics tag

The graphics tag embeds pictures in the document.

Marks in XML documents

What role does markup play inside an XML document?

The markup set the boundaries of the document, defines the content, set regions within document, and link other type of contents to the XML documents.

Set Boundaries

The < address > tag mark the boundaries of the collection of text which is address information and its label is “address”. There is a start and end for the document.

Set Regions

What is the region of text doing inside document? is a textual paragraph, not list, picture, etc. It is possible to set different types of regions inside a document.

Position Elements

If you look carefully, the address information and its elements like door no, building, street, and city are positioned properly in an order. The information is displayed in that order. Regions of text has positions. comes before, so it will show or printed that way.

Nesting of Elements

<emphasis> is under <city> which is under <address>. The XML processor will be treating this “Nesting” and the content differently, depending on where it appears. Title may be bold if there is one. The emphasis is italicized.


Text can used to link to a resource. link to a picture from XML fragment to show the picture in the document. We used the <graphics> tag.


In this lesson, you learned about markup and markup languages. What are the features of a markup text and how the document is made of elements. The advantage of markup languages documents is wide and its range of application is also huge.

We will talk about of these in future lessons.