Economics Network CHEER Virtual Edition

Volume 8, Issue 3, 1994

Setting up a World Wide Web Server

Introduction

The CTI Centres, along with all the 20 or so other CTI Centres were charged earlier this year with setting up either a World Wide Web or Gopher server. By far the majority of Centres have chosen to make available WWW servers to their constituencies. The overwhelming reason for this is almost certainly the unique flexibility provided by the WWW to enable users to transfer text, image (still and video) and sound files in a variety of user friendly ways.

This paper is aimed primarily at the Internet/World Wide Web novice who wishes to understand better how the WWW works and would like to know how to create HTML files, either to enhance their own information searching effectiveness or as a means of becoming an effective information provider on the Web. It describes the tools currently available for distribution or personal use. The paper briefly introduces the World Wide Web client/server architecture and discusses in more detail the Hypertext Markup Language (HTML) used to produce source documents and the linkages that constitute the Net. Sufficient additional reference material is provided to enable the reader to collate all necessary data in order rapidly to become proficient in the creation of HTML documents.

Essential Jargon

No self respecting discipline is without its quota of jargon - computing is no exception - and the Internet is replete with buzz words. Here are some of the more important from our perspective.

World Wide Web
Often referred to simply as: Web, W3 or WWW.
Client/server
Software which needs to be present on the host - the Server, and also on the searching computer - the Client.
Browser
Often used interchangeably when referring to client software such as NCSA Mosaic or CELLO.
NCSA
National Center for Supercomputing Applications (USA) - the development home of the Mosaic client software.
CERN
The Physics Laboratories in Switzerland - the spiritual home of the Web (it was born here).
SGML
Standard Generalised Markup Language. This is the international standard for describing markup languages.
DTD
Document Type Definition. A specific markup language written using SGML.
HTML
HyperText Markup Language. HTML is an SGML DTD. In practical terms HTML is a collection of styles used to define the various components of a World Wide Web document.
TCP/IP
Transfer Control Protocols/Internet Protocols. The WWW client software runs on the Internet under TCP/IP control.
URL
Uniform Resource Locator. A standard format for identifying a resource location on a server.
HTTP
HyperText Transfer Protocol. Defines the server, path and file to be accessed.

WWW Client Server Architecture

There are specific WWW Server programs that are available. These are well documented on the Web either from NCSA or CERN and beyond the scope of this paper. The Client architecture is rather different. There are several ways in which users can access the Web. Typically this would be via either a text client such as Lynx (this enables text information held on the Web to be accessed using VT100 terminals and while useful misses out on the graphical power of the Web) or a graphical client such as Cello or the NCSA Mosaic. Both of these graphical browsers are available via the Internet as freeware (downlaod using PC/NFS) for system X, MS Windows and Macintosh. They provide access to graphical information as well as the ability to display text in a configurable format (fonts, styles and sizes). Figure 1 shows a typical MS Windows Mosaic interface.


Figure 1

At the CTI Centre for Economics we extensively use the MS Windows version of the NCSA Mosaic client software. The version shown in figure 1 is the 2.0 Alpha 2a 16 bit MS Windows version. You should be aware that this software is still very much under development. There is also a 32 bit version available for those with suitable hardware.

The other browser, CELLO, has a much simpler interface - see figure 2.


Figure 2

Tools for Creating HTML Documents

There are a number of authoring tools available for use with MS Windows and as add on templates for wordprocessing packages (Word for Windows Version 2.0 and 6.0) to aid the creation of HTML documents. We shall take a look at two of these. First, we will briefly consider ANT_HTML.DOT, a template add in for MS Word for Windows 6.0. For those who have access to Winword 6.0 this template is well worth acquiring. Again it is currently available as freeware. Further details can be obtained from the author:

Jill Swift jswift@freenet.fsu.edu
.

When implemented it looks as shown in figure 3.

Figure 3
Secondly we will look at and work with HTML Assistant Version 1.0 (see figures 4 and 5). This version is stable but subject to some limitations, most notably it cannot handle files larger than 32Kbytes. This may not be a problem in most cases.

Figure 4

Figure 5
There is a revised Beta 1.3 version currently available and this has some enhancements. However the file size restriction of 32Kb remains and the interface has changed (see figures 6 and 7.)

Figure 6

Figure 7

Creating HTML Documents

We will outline the use of HTML Assistant to create some simple documents with URL links to other servers and describe the function and use of Tags. You can see from the HTML Assistant Inteface in figure 8 that much of the tagging can apparently be done at the click of the mouse button. This is true but we do need to conform to a number of conventions.

Figure 8

Figure 9
There are two basic approaches to the construction of a simple HTML document. You can either create the document directly using HTML Assistant, or you can create a simple document in Notepad or a wordprocessor (saved as a txt file) and then tag it with HTML Assistant. The structural HTML elements in every tagged document are identified by start and end tags (in most cases!) as shown in figure 9.

The new paragraph indicator < P > is an exception under current HTML conventions and is acceptable under HTML+ conventions. The <TITLE> tag indicates the title of the document and will be displayed by the browser in the title box at the head of the document. The tag < H1 > defines the first level of header, and the browsers will display up to six levels of headers. The <EM> tags are used to emphasise text.

While this represents the bare bones of an HTML document and might be correctly interpreted by most browsers, there are some additional tags which need to be added to ensure that the document is interpreted correctly 100% of the time. These are <HTML>, <HEAD> and <BODY>. They ensure that the spiders (or robots) used for indexing and auditing function correctly and efficiently and hence that the document is correctly interpreted to the client browser. Figure 10 shows that by adding these tags and including a link to a server a simple but complete HTML document results.

Figure 10
By now you should have grasped the essential features of HTML tagging. Using HTML Assistant a good strategy is to creaste the text first and then to apply the tags. Select the text with the mouse by clicking and dragging in the usual way and then click on the appropriate buttons on the HTML Assistant Toolbar. Remeber, if you plan to edit or revise your source documents or are keen to make them as readable as possible then use plenty of spacing and lines to enhance the readability. All the blank space and line feeds are ignored by browsers when the file is interpreted.

One curiosity of HTML Assistant Version 1.0 is that (unlike version 1.3 Beta) it does not include tags for , or . You do have the facility to add these to the "User Tools" button as custom tools if it is your intention to create full HTML documents for use on a server.

Mike Emslie

CTI Centre for Economics, University of Bristol

Notes: (i) This is a shortened and edited version of a paper presented at the CALECO conference at Portsmouth in September 1994. (ii) As from this issue of CHEER the full text and graphics content of CHEER will be mounted on the Bristol server for access via the Web using NCSA Mosaic or similar client software.

Top | CHEER Home

Copyright 1989-2007