Assignment 1: Name: Satyarthsinh Gohil Regno: 16BCE1125 Course: Internet and Web Programming Faculty: Sandhya.P Slot: E2
Assignment 1: Name: Satyarthsinh Gohil Regno: 16BCE1125 Course: Internet and Web Programming Faculty: Sandhya.P Slot: E2
2.Internet Overview
The Internet is a collection of computers connected by network cables or
through satellite links. Rather than connecting every computer on the Internet
with every other computer, individual computers in an organization are
normally connected in a local area network (LAN). One node on this local area
network is physically connected to the Internet. So the Internet is a network of
networks. There are millions of computing devices that are connected to this
network either permanently or for a short duration. These devices run network
applications that communicate through copper or fibre optic cables, radio or
satellite transmission. The communication is governed by protocols established
by an international body - Internet Engineering Task Force.
The Internet connectivity is provided by Internet Service Providers (ISP).
These corporations dedicate computers to act as servers - that is they make
information (such as Web pages or e-mail) available to users of the Internet.
One can look upon the structure of the Internet as having an edge and a
core. At the edge are host systems that run application or client programs or
provide a service through server programs. At the core of the Internet is a
network of routers.
3.Web Protocols
World Wide Web Protocols
Technically the World-Wide Web hinges on three enabling protocols, the
Hyper Text Mark-up Language (HTML) that specifies a simple mark-up
language for describing hypertext pages, the Hypertext Transfer Protocol
(HTTP) which is used by web browsers to communicate with web clients, and
Uniform Resource Locators (URL's) which are used to specify the links
between documents.
Hyper Text Mark-up Language
The hypertext pages on the web are all written using the Hypertext Mark-
up Language (HTML), a simple language consisting of a small number of tags
to delineate logical constructs within the text. Unlike a procedural language
such as postscript (move 1 inch to the right, 2 inches down, and create a green
WWW in 15 pointer bold Helvetica font), HTML deals with higher level
constructs such as "headings," "lists," "images," etc. This leaves individual
browsers free to format text in the most appropriate way for their particular
environment; for example, the same document can be viewed on a MAC, on a
PC, or on a line mode terminal, and while the content of the document remains
the same, the precise way in which it is displayed will vary between the
different environments.
The earliest version of HTML (subsequently labelled HTML1), was
deliberately kept very simple to make the task of browser developers easier.
Subsequent versions of HTML will allow more advanced features. HTML2
(approximately what most browsers support today) includes the ability to embed
images in documents, layout fill-in forms, and nest lists to arbitrary depths.
HTML3 (currently being defined) will allow still more advanced features such
as mathematical equations, tables, and figures with captions and flow-around
text.
Hypertext Transfer Protocol
Although most Web browsers are able to communicate using a variety of
protocols, such as FTP, Gopher and WAIS, the most common protocol in use on
the Web is that designed specifically for the WWW project, the Hype rText
Transfer Protocol. In order to give the fast response time needed for hypertext
applications, a very simple protocol which uses a single round trip between the
client and the server is used.
In the first phase of a HTTP transfer the browser sends a request for a
document to the server. Included in this request is the description of the
document being requested, as well as a list of document types that the browser
is capable of handling. The Multipurpose Internet Mail Extensions (MIME)
standard is used to specify the document types that the browser can handle,
typically a variety of video, audio, and image formats in addition to plain text
and HTML. The browser is able to specify weights for each document type, in
order to inform the server about the relative desirability of different document
types.
In response to a query the server returns the document to the browser using one
of the formats acceptable to the browser. If necessary, the server can translate
the document from the format it is stored in into a format acceptable to the
browser. For example, the server might have an image stored in the highly
compressed JPEG image format, and if a browser capable of displaying JPEG
images requests the image it would be returned in this format. However, if a
browser capable of displaying images only if they are in GIF format requested
the same document the server would be able to translate the image and return
the (larger) GIF image. This provides a way of introducing more sophisticated
document formats in future but still enabling older or less advanced browser to
access the same information.
In addition to the basic "GET" transaction described above the HTTP is also
able to support a number of other transaction types, such as "POST" for sending
the data for fill-out forms back to the server and "PUT" which might be used in
the future to allow authors to save modified versions of documents back to the
server.
TCP/IP
TCP (Transmission Control Protocol) and IP (Internet Protocol) are two
different procedures that are often linked together. The linking of several
protocols is common since the functions of different protocols can be
complementary so that together they carry out some complete task. The
combination of several protocols to carry out a particular task is often called a
"stack" because it has layers of operations. In fact, the term "TCP/IP" is
normally used to refer to a whole suite of protocols, each with different
functions. This suite of protocols is what carries out the basic operations of the
Web. TCP/IP is also used on many local area networks. The details of how the
Web works are beyond the scope of this article but I will briefly describe some
of the basics of this very important group of protocols. More details can be
found in the references in the last section.
When information is sent over the Internet, it is generally broken up into
smaller pieces or "packets". The use of packets facilitates speedy transmission
since different parts of a message can be sent by different routes and then
reassembled at the destination. It is also a safety measure to minimize the
chances of losing information in the transmission process. TCP is the means for
creating the packets, putting them back together in the correct order at the end,
and checking to make sure that no packets got lost in transmission. If necessary,
TCP will request that a packet be resent.
Internet Protocol (IP) is the method used to route information to the
proper address. Every computer on the Internet has to have its own unique
address known as the IP address. Every packet sent will contain an IP address
showing where it is supposed to go. A packet may go through a number of
computer routers before arriving at its final destination and IP controls the
process of getting everything to the designated computer. Note that IP does not
make physical connections between computers but relies on TCP for this
function. IP is also used in conjunction with other protocols that create
connections.
UDP and ICMP
Another member of the TCP/IP suite is User Datagram Protocol (UDP).
(A datagram is almost the same as a packet except that sometimes a packet will
contain more than one datagram.) This protocol is used together with IP when
small amounts of information are involved. It is simpler than TCP and lacks the
flow-control and error-recovery functions of TCP. Thus, it uses fewer system
resources.
A different type of protocol is Internet Control Message Protocol (ICMP).
It defines a small number of messages used for diagnostic and management
purposes. It is also used by Ping and Traceroute.
Mail Protocols POP3 and SMTP
Email requires its own set of protocols and there are a variety, both for
sending and for receiving mail. The most common protocol for sending mail is
Simple Mail Transfer Protocol (SMTP). When configuring email clients, an
Internet address for an SMTP server must be entered. The most common
protocol used by PCs for receiving mail is Post Office Protocol(POP). It is now
in version 3 so it is called POP3. Email clients require an address for a POP3
server before they can read mail. The SMTP and POP3 servers may or may not
be the same address. Both SMTP and POP3 use TCP for managing the
transmission and delivery of mail across the Internet.
A more powerful protocol for reading mail is Interactive Mail Access
Protocol (IMAP). This protocol allows for the reading of individual mailboxes
at a single account and is more common in business environments. IMAP also
uses TCP to manage the actual transmission of mail.
Web browsers are used by people to find and look at web sites on
the Internet. The first web browser was created in 1990. Many different web
browsers are available for free. All web browsers can go to websites but each
browser has good things and bad things about it. For example, some browsers
focus on data security and keeping computers safe from viruses. Other browsers
are made so that web pages appear on screen faster.
Internet Explorer
Google Chrome
Mozilla Firefox
Opera
Safari
Flock
Maxthon
Epiphany
Galeon
Lynx
Sleipnir
Web Servers
A Web server is a program that uses HTTP (Hypertext Transfer Protocol)
to serve the files that form Web pages to users, in response to their requests,
which are forwarded by their computers' HTTP clients. Dedicated computers
and appliances may be referred to as Web servers as well.
The process is an example of the client/server model. All computers that
host Web sites must have Web server programs. Leading Web servers include
Apache (the most widely-installed Web server), Microsoft's Internet
Information Server (IIS) and nginx (pronounced engine X) from NGNIX. Other
Web servers include Novell's NetWare server, Google Web Server (GWS) and
IBM's family of Domino servers.
Web servers often come as part of a larger package of Internet- and
intranet-related programs for serving email, downloading requests for File
Transfer Protocol (FTP) files, and building and publishing Web pages.
Considerations in choosing a Web server include how well it works with the
operating system and other servers, its ability to handle server-side
programming, security characteristics, and the particular publishing, search
engine and site building tools that come with it.
Because of the server-side processing, many Web sites are now organized
as three-tiered architectures consisting of a Web server, an application
server, and a database server.
User data comes from an HTML form, specifying the program and
parameters.
Server-side scripting technologies are used to generate dynamic content:
Microsoft: Active Server Pages (ASP.NET)
Sun: Java Server Pages (JSP)
Netscape: JavaScript
Free Software Foundation: PHP
8.URL
Addresses for web sites are called URLs (Uniform Resource Locators).
Most of them begin with http (HyperText Transfer Protocol), followed by a
colon and two slashes. For example, the URL for the Florida Center for
Instructional Technology is http://fcit.usf.edu/.
Some of the URL addresses include a directory path and a file name.
Consequently, the addresses can become quite long. For example, the URL of a
web page may be: http://fcit.usf.edu/holocaust/default.htm. In this example,
"default.htm" is the name of the file which is in a directory named "holocaust"
on the FCIT server at the University of South Florida.
9.Domain Name
An Internet address has four fields with numbers that are separated by
periods or dots. This type of address is known as an IP address. Rather than
have the user remember long strings of numbers, the Domain Name System
(DNS) was developed to translate the numerical addresses into words. For
example, the address fcit.usf.edu is really 131.247.120.10