Mod 2
Mod 2
• The world wide web (‘www’ or ‘web’ for short) is a collection of webpages found on this
global network of computers. Your web browser uses the internet to access the world
wide web.
• The world wide web or web is a software that makes it easy and possible for nearly
anyone to publish and browse hypertext documents on the internet.
• You can think of the web as encompassing all of the information available through web
browsers. The web has developed a huge following because of its ease of use and visual
appeal. It is called a web because the interconnection between documents resembles a
spider’s web.
• The world wide web also referred to as www or w3 or simply “the web is the universe of
information available via hypertext transfer protocol (HTTP).
• The largest networked collection of linked documents is known as the world wide web.
• There are 3 components of the web: Uniform Resource Locator (URL): URL serves as a
system for resources on the web. Hyper Text Transfer Protocol (HTTP): HTTP specifies
communication of browser and server. Hyper Text Markup Language (HTML): HTML
defines the structure, organization and content of a web page.
HTTP
• HTTP (Hypertext Transfer Protocol) is a fundamental protocol of the Internet, enabling
the transfer of data between a client and a server. It is the foundation of data
communication for the World Wide Web.
• HTTP provides a standard between a web browser and a web server to establish
communication. It is a set of rules for transferring data from one computer to another.
Data such as text, images, and other multimedia files are shared on the World Wide
Web. Whenever a web user opens their web browser, the user indirectly uses HTTP. It is
an application protocol that is used for distributed, collaborative, hypermedia
information systems.
• HTTP is a network protocol used to deliver virtually all files and other data (collectively
called resources) on the www, whether they are html files, image files, query results, or
anything else.
WEB BROWSER
• A web browser is defined as a software application, a typical HTTP client that helps the
internet visitor to interpret the html documents and display the content from web
servers or in file systems. A browser is the most commonly used kind of user agent.
• Chrome, internet explorer, Firefox.
• Web browsers communicate with web servers primarily using HTTP to fetch webpages.
HTTP allows web browsers to submit information to web servers as well as fetch
webpages from them.
• Webpages are located by means of a URL, which is treated as an address, beginning with
HTTP: for HTTP access.
BOOKMARKS
DIRECTORIES, SEARCH ENGINES AND META SEARCH ENGINES
DIRECTORIES
The method of finding and organizing web information is the directory approach. A directory
offers a hierarchical representation of hyperlinks to web pages and presentations broken down
into topics and subtopics.
SEARCH ENGINES
META SEARCH ENGINES
SEARCH ENGINE WORKING
1. User interface
• The user interface must provide a mechanism by which a user can submit queries to the
search engines.
• This is universally done using forms.
• The user interface should be friendly and visually appealing. Hypertexts are present to
navigate to different pages.
• Finally, the user interface needs to display the results of the search in a convenient way.
2. Searcher
• The searcher is a program that uses the search engine’s index and database to see if any
matches can be found for the query.
• Your query must be first be transformed into a syntax that the searcher can process.
• Since the database associated with search engines are extremely large, a highly efficient
search strategy must be applied.
3. Evaluator
4. Gatherer
SEARCH STRATEGIES
When you post queries to the search engine, two common situations can occur, either your
query does not turn up a sufficient no. of hits, or your query turns up too many hits. Two
strategies to deal with these are::
1. Search Generalization:
2. Search Specialization