SlideShare a Scribd company logo
2 December 2005
Web Technologies
Web Architectures
Prof. Beat Signer
Department of Computer Science
Vrije Universiteit Brussel
http://www.beatsigner.com
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 2October 6, 2017
Basic Client-Server Web Architecture
 Effect of typing http://www.vub.ac.be in the broswer bar
(1) use a Domain Name Service (DNS) to get the IP address for
www.vub.ac.be (answer 134.184.129.2)
(2) create a TCP connection to 134.184.129.2
(3) send an HTTP request message over the TCP connection
(4) visualise the received HTTP response message in the browser
Internet
Client Server
HTTP Request
HTTP Response
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 3October 6, 2017
Web Server
 Tasks of a web server
(1) setup connection
(2) receive and process
HTTP request
(3) fetch resource
(4) create and send
HTTP response
(5) logging
 The most prominent web servers are the Apache HTTP
Server and Microsoft's Internet Information Services (IIS)
 A lot of devices have an embedded web server
 printers, WLAN routers, TVs, ...
Worldwide Web Servers, http://news.netcraft.com
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 4October 6, 2017
Example HTTP Request Message
GET / HTTP/1.1
Host: www.vub.ac.be
User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:24.0) Gecko/20100101
Firefox/24.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: en-gb,en;q=0.5
Accept-Encoding: gzip, deflate
Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7
Connection: keep-alive
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 5October 6, 2017
Example HTTP Response Message
HTTP/1.1 200 OK
Date: Thu, 03 Oct 2013 17:02:19 GMT
Server: Apache/2.2.14 (Ubuntu)
X-Powered-By: PHP/5.3.2-1ubuntu4.15
Content-Language: nl
Set-Cookie: lang=nl; path=/; domain=.vub.ac.be; expires=Mon, 18-Sep-2073
17:02:16 GMT
Content-Type: text/html; charset=utf-8
Keep-Alive: timeout=15, max=987
Connection: Keep-Alive
Transfer-Encoding: chunked
<!DOCTYPE html>
<html lang="nl" dir="ltr">
<head>
...
<title>Vrije Universiteit Brussel | Redelijk eigenzinnig</title>
<meta name="Description" content="Welkom aan de VUB" />
...
</html>
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 6October 6, 2017
HTTP Protocol
 Request/response communication model
 HTTP Request
 HTTP Response
 Communication always has to be initiated by the client
 Stateless protocol (no sessions)
 HTTP can be used on top of various reliable protocols
 TCP is by far the most commonly used one
 runs on TCP port 80 by default
 Latest version: HTTP/2.0 (May 2015)
 HTTPS scheme used for encrypted connections
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 7October 6, 2017
Uniform Resource Identifier (URI)
 A Uniform Resource Identifier (URI) uniquely
identifies a resource
 There are two types of URIs
 Uniform Resource Locator (URL)
- contains information about the exact location of a resource
- consists of a scheme, a host and the path (resource name)
- e.g. https://vub.academia.edu/BeatSigner
- problem: the URL changes if resource is moved!
• idea of Persistent Uniform Resource Locators (PURLs) [https://purl.oclc.org]
 Uniform Resource Name (URN)
- unique and location independent name for a resource
- consists of a scheme name, a namespace identifier and a namespace-specific
string (separated by colons)
- e.g. urn:ISBN:3837027139
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 8October 6, 2017
HTTP Message Format
 Request and response messages have the same format
<html>
...
</html>
HTTP/1.1 200 OK
Date: Thu, 03 Oct 2013 17:02:19 GMT
Server: Apache/2.2.14 (Ubuntu)
X-Powered-By: PHP/5.3.2-1ubuntu4.15
Transfer-Encoding: chunked
Content-Type: text/html
header field(s)
blank line (CRLF)
message body (optional)
start line
HTTP_message = start_line , {header} , "CRLF" , {body};
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 9October 6, 2017
HTTP Request Message
 Request-specific start line
 Methods
 GET : get a resource from the server
 HEAD : get the header only (no body)
 POST : send data (in the body) to the server
 PUT : store request body on server
 TRACE : get the "final" request (after it has potentially been modified by proxies)
 OPTIONS : get a list of methods supported by the server
 DELETE: delete a resource on the server
start_line = method, " " , resource , " " , version;
method = "GET" , "HEAD" , "POST" , "PUT" , "TRACE" ,
"OPTIONS" , "DELETE";
resource = complete_URL | path;
version = "HTTP/" , major_version, "." , minor_version;
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 10October 6, 2017
HTTP Response Message
 Response-specific start line
 Status codes
 100-199 : informational
 200-299 : success (e.g. 200 for 'OK')
 300-399 : redirection
 400-499 : client error (e.g. 404 for 'Not Found')
 500-599 : server error (e.g. 503 for 'Service Unavailable')
start_line = version , status_code , reason;
version = "HTTP/" , major_version, "." , minor_version;
status_code = digit , digit , digit;
reason = string_phrase;
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 11October 6, 2017
HTTP Header Fields
 There exist general headers (for requests and
responses), request headers, response headers, entity
headers and extension headers
 Some important headers
 Accept
- request header definining the Media Type that the client will accept
• formerly known as Multipurpose Internet Mail Extensions (MIME type)
 User-Agent
- request header specifying the type of client
 Keep-Alive (HTTP/1.0) and Persistent (HTTP/1.1)
- general header helping to improve the performance since otherwise a new
HTTP connection has to be established for every single webpage element
 Content-Type
- entity header specifing the body's MIME type
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 12October 6, 2017
HTTP Header Fields ...
 Some important headers ...
 If-Modified-Since
- request header that is used in combination with a GET request (conditional
GET); the resource is only returned if it has been modified since the specified
date
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 13October 6, 2017
Media Types
 The Media Type defines the request or response body's
content (used for appropiate processing)
 7 top-level media types
 Standard Media Types are registered with the Internet
Assigned Numbers Authority (IANA) [RFC-6838]
mediaType = toplevel_type , "/" , subtype;
Media Type Description
text/plain Human-readable text without formatting information
text/html HTML document
image/jpeg JPEG-encoded image
... ...
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 14October 6, 2017
HTTP Message Information
 Various tools for HTTP message logging
 e.g. HttpFox add-on for Firefox browser
 Simple telnet connection
 Until 1999 the W3C has been working on HTTP Next
Generation (HTTP-NG) as a replacement for HTTP/1.1
 never introduced
 recently HTTP/2.0 has been released
- inspired by Goggle’s development of SPDY
telnet wise.vub.ac.be 80 (press Enter)
GET /beat-signer HTTP/1.1 (press Enter)
Host: wise.vub.ac.be (press Enter 2 times)
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 15October 6, 2017
Proxies
 A web proxy is situated between the client and the server
 acts as a server to the client and as a client to the server
 can for example be specified in the browser settings; used for
- firewalls and content filters
- transcoding (on the fly transformation of HTTP message body)
- content router (e.g. select optimal server in content distribution networks)
- anonymous browsing, ...
Internet
Client Server
Proxy
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 16October 6, 2017
Caches
 A proxy cache is a special type of proxy server
 can reduce server load if multiple clients share the same cache
 often multi-level hierarchies of caches (e.g. continent, country
and regional level) with communication between sibling and
parent caches as defined by the Internet Cache Protocol (ICP)
 passive or active (prefetching) caches
Internet
Client 1
ServerProxy Cache
Client 2
1
2
12
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 17October 6, 2017
Caches ...
 Special HTTP cache control header fields
 Expires
- expiration date after which the cached resource has to be refetched
 Cache-Control: max-age
- maximum age of a document (in seconds) after it has been added to the cache
 Cache-Control: no-cache
- response cannot be directly served from the cache (has to be revalidated first)
 ...
 Validators
 Last-modified time as validator
- cache with resource that has been last modified at time t uses an
If-Modified-Since t request for updates
 Entity tags (ETag)
- changed by the publisher if content has changed; If-None-Match etag request
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 18October 6, 2017
Caches ...
 Advantages
 reduces latency and used network bandwidth
 reduces server load (client and reverse proxy caches)
 transparent to client and server
 Disadvantages
 additional resources (hardware) required
 might get stale data out of the cache
 creates additional network traffic if we use an active caching
approach (prefetching) but achieve a low cache hit rate
 server loses control (e.g. access statistics) since no longer all
requests have to be sent to the server
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 19October 6, 2017
Tunnels
 Implement one protocol on top of another protocol
 e.g. HTTP as a carrier for SSL connections
 Often used to "open" a firewall to protocols that would
otherwise be blocked
 e.g. tunneling of SSL connections through an open HTTP port
Internet
SSL Client SSL Server
SSL
HTTP
SSL
HTTP[SSL] HTTP[SSL]
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 20October 6, 2017
Gateways
 A gateway can act as a kind of "glue" between
applications (client) and resources (server)
 translate between two protocols (e.g. from HTTP to FTP)
 security accelerator (e.g. HTTPS/HTTP on the server side)
 often the gateway and destination server are combined in a single
application server (HTTP to server application translator)
Internet
HTTP Client FTP Server
HTTP/FTP
Gateway
HTTP
FTP
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 21October 6, 2017
Session Management
 HTTP is a stateless protocol
 Session (state) tracking solutions
 use of IP address
- problem: IP address is often not uniquely assigned to a single user
 browser login
- use of special HTTP authenticate headers
- after a login the browser sends the user information in each request
 URL rewriting
- add information to the URL in each request
 hidden form fields
- similar to URL rewriting but information can also be in body (POST request)
 cookies
- the server stores a piece of information on the client which is then sent back to
the server with each request
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 22October 6, 2017
Cookies
 Introduced by Netscape in June 1994
 A cookie is a piece of information that is
assigned to a client on their first visit
 list of <key,value> pairs
 often just a unique identifier
 sent via Set-Cookie or Set-Cookie2 HTTP response headers
 Browser stores the information in a "cookie database" and
sends it back every time the same server is accessed
 Potential privacy issues
 third-party websites might use persistent cookies for user tracking
 Cookies can be disabled in the browser settings
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 23October 6, 2017
Hypertext Markup Language (HTML)
 Dominant markup language for webpages
 If you never heard about HTML have a look at
 http://www.w3schools.com/html/
 More details in the exercise and in the next lecture
<!DOCTYPE html>
<html lang="en">
<head>
<title>Beat Signer: Interactive Paper, PaperWorks, Paper++, ...</title>
</head>
<body>
Beat Signer is Associate Professor of Computer Science at the VUB
and co-director of the WISE laboratory ...
</body>
</html>
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 24October 6, 2017
Dynamic Web Content
 Often it is not enough to serve static web pages but
content should be changed on the client or server side
 Server-side processing
 Common Gateway Interface (CGI)
 Java Servlets
 JavaServer Pages (JSP)
 PHP: Hypertext Preprocessor (PHP)
 ...
 Client-side processing
 JavaScript
 Java Applets
 Adobe Flash
 ...
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 25October 6, 2017
Common Gateway Interface (CGI)
 CGI was the first server-side processing solution
 transparent to the user
 certain requests (e.g. /account.pl) are forwarded via CGI to a
program by creating a new process
 program processes the request and creates an answer with
optional HTTP response headers
Internet
Client Server
HTTP Request
HTTP Response
Program in
Perl, Tcl, C,
C++, Java, ..
HTML Pages
CGI
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 26October 6, 2017
Common Gateway Interface (CGI) ...
 CGI Problems
 a new process has to be started for each request
 if the CGI program for example acts as a gateway to a database,
a new DB connection has to be established for each request
which results in a very poor performance
 FastCGI solves some of the problems by introducing
persistent processes and process pools
 CGI/FastCGI becomes more and more replaced by other
technologies (e.g. Java Servlets)
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 27October 6, 2017
Java Servlets
 A Java servlet is a Java class that has to extend the
abstract HTTPServlet class
 The Java servlet class is loaded by a servlet container
and relevant requests (based on a servlet binding) are
forwarded to the servlet instance for further processing
Internet
Client Server
HTTP Request
HTTP Response
HTML Pages
Servlet
Container
Servlets
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 28October 6, 2017
Java Servlets ...
 Main HttpServlet methods
 Servlet life cycle
 a servlet is initialised once via the init() method
 the doGet(), doPost() methods may be executed multiple times
(by different HTTP requests)
 finally the servlet container may unload a servlet (upcall of the
destroy() method before that happens)
 Servlet container (e.g. Apache Tomcat) either integrated
with web server or as standalone component
doGet(HttpServletRequest req, HttpServletResponse resp)
doPost(HttpServletRequest req, HttpServletResponse resp)
init(ServletConfig config)
destroy()
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 29October 6, 2017
Java Servlet Example
 In the exercise you will learn how to process parameters etc.
package org.vub.wise;
import java.io.*;
import java.util.Date;
import javax.servlet.http.*;
import javax.servlet.*;
public class HelloWorldServlet extends HttpServlet {
public void doGet (HttpServletRequest req, HttpServletResponse res)
throws ServletException, IOException {
PrintWriter out = res.getWriter();
out.println("<html>");
out.println("<head><title>Hello World</title></head>");
out.println("<body>The time is " + new Date().toString() + "</body>");
out.println("</html>");
out.close();
}
}
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 30October 6, 2017
JavaServer Pages (JSP)
 A "drawback" of Java servlets is that the whole page
(e.g. HTML) has to be defined within the servlet
 not easy to share tasks between web designer and programmer
 Add program code through scriptlets and markup to
existing HTML pages
 These JSP documents are then either interpreted on the
fly (Apache Tomcat) or compiled into Java servlets
 The JSP approach is similar to PHP or Active Server
Pages (ASP)
 Note that Java Servlets become more and more an
enabling technology (as with JSP)
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 31October 6, 2017
JavaScript
 Interpreted scripting language for client-side processing
 JavaScript functionality often embedded in HTML
documents but can also be provided in separate files
 JavaScript often used to
 validate data (e.g. in a form)
 dynamically add content to a webpage
 process events (onLoad, onFocus, etc.)
 change parts of the original HTML document
 create cookies
 ...
 Note: Java and JavaScript are completely different
languages!
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 32October 6, 2017
JavaScript Example
 More details about JavaScript in lecture 6 and in the
exercise session
<html>
<body>
<script type="text/javascript">
document.write("<h1>Hello World!</h1>");
</script>
</body>
</html>
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 33October 6, 2017
Java Applets
 A Java applet is a program delivered to the client side in
the form of Java bytecode
 executed in the browser using a Java Virtual Machine (JVM)
 an applet has to extend the Applet or JApplet class
 runs in the sandbox
 Advantages
 the user automatically always has the most recent version
 high security for untrusted applets
 full Java API available
 Disadvantages
 requires a browser Java plug-in
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 34October 6, 2017
Java Applets ...
 Disadvantages ...
 only signed applets can get more advanced functionality
- e.g. network connections to other machines than the source machine
 More recently Java Web Start (JavaWS) is replacing
Java Applets
 program no longer runs within the browser
- less problematic security restrictions
- less browser compatibility issues
 Math and Physics Applet Examples
 http://www.falstad.com/mathphysics.html
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 35October 6, 2017
Exercise 2
 Hands-on experience with the HTTP protocol
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 36October 6, 2017
References
 David Gourley et al., HTTP: The Definitive
Guide, O'Reilly Media, September 2002
 R. Fielding et al., RFC2616 - Hypertext Transfer
Protocol - HTTP/1.1
 http://www.faqs.org/rfcs/rfc2616.html
 N. Freed et al., RFC6838 - Media Type Specifications
and Registration Procedures
 http://www.faqs.org/rfcs/rfc6838.html
 HTML and JavaScript Tutorials
 http://www.w3schools.com
Beat Signer - Department of Computer Science - bsigner@vub.ac.be 37October 6, 2017
References ...
 M. Knutson, HTTP: The Hypertext Transfer
Protocol (refcardz #172)
 http://refcardz.dzone.com/refcardz/http-hypertext-transfer-0
 W. Jason Gilmore, PHP 5.4 (refcardz #23)
 http://refcardz.dzone.com/refcardz/php-54-scalable
 Java Servlet Tutorial
 http://www.tutorialspoint.com/servlets/
2 December 2005
Next Lecture
HTML5 and the Open Web Platform

More Related Content

Similar to Web Architectures - Web Technologies (1019888BNR) (20)

PDF
Unit v
APARNA P
 
PPT
WebEssentials- lecture 3.ppt
SachinKundu10
 
PPTX
www and http services
Jenica Salmorin
 
PPT
A detailed presentation on the World Wide Web
G.B. Pant University of Agriculture and Technology
 
PPT
Http VS. Https
Raed Aldahdooh
 
PDF
HTTP In-depth
Vinayak Hegde
 
PPTX
computer network introduction. psc notes . Assisant professor in cse.
bushraphd2022
 
PPTX
Www and http
SanthiNivas
 
PPTX
Hypertext Transfer Protocol
Shubham Srivastava
 
PPTX
Web technology introduction to the web and its history
BKReddy3
 
PDF
Advanced Web Design And Development BIT 3207
Lori Head
 
PPT
www | HTTP | HTML - Tutorial
MSA Technosoft
 
PPT
Web Services 2009
Cathie101
 
PPT
Web Services 2009
Cathie101
 
PDF
[DSBW Spring 2009] Unit 02: Web Technologies (1/2)
Carles Farré
 
PPTX
Web technologies: HTTP
Piero Fraternali
 
PPTX
Http
NITT, KAMK
 
PPTX
Browser
Shweta Oza
 
PPT
KMUTNB - Internet Programming 2/7
phuphax
 
PDF
Communicating on the web
Adrian Cardenas
 
Unit v
APARNA P
 
WebEssentials- lecture 3.ppt
SachinKundu10
 
www and http services
Jenica Salmorin
 
A detailed presentation on the World Wide Web
G.B. Pant University of Agriculture and Technology
 
Http VS. Https
Raed Aldahdooh
 
HTTP In-depth
Vinayak Hegde
 
computer network introduction. psc notes . Assisant professor in cse.
bushraphd2022
 
Www and http
SanthiNivas
 
Hypertext Transfer Protocol
Shubham Srivastava
 
Web technology introduction to the web and its history
BKReddy3
 
Advanced Web Design And Development BIT 3207
Lori Head
 
www | HTTP | HTML - Tutorial
MSA Technosoft
 
Web Services 2009
Cathie101
 
Web Services 2009
Cathie101
 
[DSBW Spring 2009] Unit 02: Web Technologies (1/2)
Carles Farré
 
Web technologies: HTTP
Piero Fraternali
 
Browser
Shweta Oza
 
KMUTNB - Internet Programming 2/7
phuphax
 
Communicating on the web
Adrian Cardenas
 

More from Beat Signer (20)

PDF
Use Cases and Course Review - Lecture 8 - Human-Computer Interaction (1023841...
Beat Signer
 
PDF
HCI Research Methods - Lecture 7 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
PDF
Evaluation Methods - Lecture 6 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
PDF
Design Guidelines and Models - Lecture 5 - Human-Computer Interaction (102384...
Beat Signer
 
PDF
Human Perception and Cognition - Lecture 4 - Human-Computer Interaction (1023...
Beat Signer
 
PDF
Requirements Analysis and Prototyping - Lecture 3 - Human-Computer Interactio...
Beat Signer
 
PDF
HCI and Interaction Design - Lecture 2 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
PDF
Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
PDF
Indoor Positioning Using the OpenHPS Framework
Beat Signer
 
PDF
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...
Beat Signer
 
PDF
Cross-Media Technologies and Applications - Future Directions for Personal In...
Beat Signer
 
PDF
Bridging the Gap: Managing and Interacting with Information Across Media Boun...
Beat Signer
 
PDF
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming Curricula
Beat Signer
 
PDF
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions
Beat Signer
 
PDF
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Beat Signer
 
PDF
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Beat Signer
 
PDF
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Beat Signer
 
PDF
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
Beat Signer
 
PDF
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Beat Signer
 
PDF
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Beat Signer
 
Use Cases and Course Review - Lecture 8 - Human-Computer Interaction (1023841...
Beat Signer
 
HCI Research Methods - Lecture 7 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
Evaluation Methods - Lecture 6 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
Design Guidelines and Models - Lecture 5 - Human-Computer Interaction (102384...
Beat Signer
 
Human Perception and Cognition - Lecture 4 - Human-Computer Interaction (1023...
Beat Signer
 
Requirements Analysis and Prototyping - Lecture 3 - Human-Computer Interactio...
Beat Signer
 
HCI and Interaction Design - Lecture 2 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)
Beat Signer
 
Indoor Positioning Using the OpenHPS Framework
Beat Signer
 
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...
Beat Signer
 
Cross-Media Technologies and Applications - Future Directions for Personal In...
Beat Signer
 
Bridging the Gap: Managing and Interacting with Information Across Media Boun...
Beat Signer
 
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming Curricula
Beat Signer
 
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions
Beat Signer
 
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Beat Signer
 
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Beat Signer
 
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Beat Signer
 
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
Beat Signer
 
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Beat Signer
 
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Beat Signer
 
Ad

Recently uploaded (20)

PPTX
HUMAN RESOURCE MANAGEMENT: RECRUITMENT, SELECTION, PLACEMENT, DEPLOYMENT, TRA...
PRADEEP ABOTHU
 
DOCX
Lesson 1 - Nature and Inquiry of Research
marvinnbustamante1
 
PDF
Week 2 - Irish Natural Heritage Powerpoint.pdf
swainealan
 
PDF
Vani - The Voice of Excellence - Jul 2025 issue
Savipriya Raghavendra
 
PPTX
care of patient with elimination needs.pptx
Rekhanjali Gupta
 
PPTX
How to Send Email From Odoo 18 Website - Odoo Slides
Celine George
 
PPTX
How to Create a Customer From Website in Odoo 18.pptx
Celine George
 
PPTX
Marketing Management PPT Unit 1 and Unit 2.pptx
Sri Ramakrishna College of Arts and science
 
PPTX
Lesson 1 Cell (Structures, Functions, and Theory).pptx
marvinnbustamante1
 
PPTX
EDUCATIONAL MEDIA/ TEACHING AUDIO VISUAL AIDS
Sonali Gupta
 
PPTX
How to Configure Re-Ordering From Portal in Odoo 18 Website
Celine George
 
PDF
STATEMENT-BY-THE-HON.-MINISTER-FOR-HEALTH-ON-THE-COVID-19-OUTBREAK-AT-UG_revi...
nservice241
 
PPTX
ENGlish 8 lesson presentation PowerPoint.pptx
marawehsvinetshe
 
PPTX
PLANNING FOR EMERGENCY AND DISASTER MANAGEMENT ppt.pptx
PRADEEP ABOTHU
 
PDF
AI-assisted IP-Design lecture from the MIPLM 2025
MIPLM
 
PDF
Lean IP - Lecture by Dr Oliver Baldus at the MIPLM 2025
MIPLM
 
PPTX
Post Dated Cheque(PDC) Management in Odoo 18
Celine George
 
PPTX
Ward Management: Patient Care, Personnel, Equipment, and Environment.pptx
PRADEEP ABOTHU
 
PDF
IMPORTANT GUIDELINES FOR M.Sc.ZOOLOGY DISSERTATION
raviralanaresh2
 
PPTX
Building Powerful Agentic AI with Google ADK, MCP, RAG, and Ollama.pptx
Tamanna36
 
HUMAN RESOURCE MANAGEMENT: RECRUITMENT, SELECTION, PLACEMENT, DEPLOYMENT, TRA...
PRADEEP ABOTHU
 
Lesson 1 - Nature and Inquiry of Research
marvinnbustamante1
 
Week 2 - Irish Natural Heritage Powerpoint.pdf
swainealan
 
Vani - The Voice of Excellence - Jul 2025 issue
Savipriya Raghavendra
 
care of patient with elimination needs.pptx
Rekhanjali Gupta
 
How to Send Email From Odoo 18 Website - Odoo Slides
Celine George
 
How to Create a Customer From Website in Odoo 18.pptx
Celine George
 
Marketing Management PPT Unit 1 and Unit 2.pptx
Sri Ramakrishna College of Arts and science
 
Lesson 1 Cell (Structures, Functions, and Theory).pptx
marvinnbustamante1
 
EDUCATIONAL MEDIA/ TEACHING AUDIO VISUAL AIDS
Sonali Gupta
 
How to Configure Re-Ordering From Portal in Odoo 18 Website
Celine George
 
STATEMENT-BY-THE-HON.-MINISTER-FOR-HEALTH-ON-THE-COVID-19-OUTBREAK-AT-UG_revi...
nservice241
 
ENGlish 8 lesson presentation PowerPoint.pptx
marawehsvinetshe
 
PLANNING FOR EMERGENCY AND DISASTER MANAGEMENT ppt.pptx
PRADEEP ABOTHU
 
AI-assisted IP-Design lecture from the MIPLM 2025
MIPLM
 
Lean IP - Lecture by Dr Oliver Baldus at the MIPLM 2025
MIPLM
 
Post Dated Cheque(PDC) Management in Odoo 18
Celine George
 
Ward Management: Patient Care, Personnel, Equipment, and Environment.pptx
PRADEEP ABOTHU
 
IMPORTANT GUIDELINES FOR M.Sc.ZOOLOGY DISSERTATION
raviralanaresh2
 
Building Powerful Agentic AI with Google ADK, MCP, RAG, and Ollama.pptx
Tamanna36
 
Ad

Web Architectures - Web Technologies (1019888BNR)

  • 1. 2 December 2005 Web Technologies Web Architectures Prof. Beat Signer Department of Computer Science Vrije Universiteit Brussel http://www.beatsigner.com
  • 2. Beat Signer - Department of Computer Science - [email protected] 2October 6, 2017 Basic Client-Server Web Architecture  Effect of typing http://www.vub.ac.be in the broswer bar (1) use a Domain Name Service (DNS) to get the IP address for www.vub.ac.be (answer 134.184.129.2) (2) create a TCP connection to 134.184.129.2 (3) send an HTTP request message over the TCP connection (4) visualise the received HTTP response message in the browser Internet Client Server HTTP Request HTTP Response
  • 3. Beat Signer - Department of Computer Science - [email protected] 3October 6, 2017 Web Server  Tasks of a web server (1) setup connection (2) receive and process HTTP request (3) fetch resource (4) create and send HTTP response (5) logging  The most prominent web servers are the Apache HTTP Server and Microsoft's Internet Information Services (IIS)  A lot of devices have an embedded web server  printers, WLAN routers, TVs, ... Worldwide Web Servers, http://news.netcraft.com
  • 4. Beat Signer - Department of Computer Science - [email protected] 4October 6, 2017 Example HTTP Request Message GET / HTTP/1.1 Host: www.vub.ac.be User-Agent: Mozilla/5.0 (Windows NT 6.1; rv:24.0) Gecko/20100101 Firefox/24.0 Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8 Accept-Language: en-gb,en;q=0.5 Accept-Encoding: gzip, deflate Accept-Charset: ISO-8859-1,utf-8;q=0.7,*;q=0.7 Connection: keep-alive
  • 5. Beat Signer - Department of Computer Science - [email protected] 5October 6, 2017 Example HTTP Response Message HTTP/1.1 200 OK Date: Thu, 03 Oct 2013 17:02:19 GMT Server: Apache/2.2.14 (Ubuntu) X-Powered-By: PHP/5.3.2-1ubuntu4.15 Content-Language: nl Set-Cookie: lang=nl; path=/; domain=.vub.ac.be; expires=Mon, 18-Sep-2073 17:02:16 GMT Content-Type: text/html; charset=utf-8 Keep-Alive: timeout=15, max=987 Connection: Keep-Alive Transfer-Encoding: chunked <!DOCTYPE html> <html lang="nl" dir="ltr"> <head> ... <title>Vrije Universiteit Brussel | Redelijk eigenzinnig</title> <meta name="Description" content="Welkom aan de VUB" /> ... </html>
  • 6. Beat Signer - Department of Computer Science - [email protected] 6October 6, 2017 HTTP Protocol  Request/response communication model  HTTP Request  HTTP Response  Communication always has to be initiated by the client  Stateless protocol (no sessions)  HTTP can be used on top of various reliable protocols  TCP is by far the most commonly used one  runs on TCP port 80 by default  Latest version: HTTP/2.0 (May 2015)  HTTPS scheme used for encrypted connections
  • 7. Beat Signer - Department of Computer Science - [email protected] 7October 6, 2017 Uniform Resource Identifier (URI)  A Uniform Resource Identifier (URI) uniquely identifies a resource  There are two types of URIs  Uniform Resource Locator (URL) - contains information about the exact location of a resource - consists of a scheme, a host and the path (resource name) - e.g. https://vub.academia.edu/BeatSigner - problem: the URL changes if resource is moved! • idea of Persistent Uniform Resource Locators (PURLs) [https://purl.oclc.org]  Uniform Resource Name (URN) - unique and location independent name for a resource - consists of a scheme name, a namespace identifier and a namespace-specific string (separated by colons) - e.g. urn:ISBN:3837027139
  • 8. Beat Signer - Department of Computer Science - [email protected] 8October 6, 2017 HTTP Message Format  Request and response messages have the same format <html> ... </html> HTTP/1.1 200 OK Date: Thu, 03 Oct 2013 17:02:19 GMT Server: Apache/2.2.14 (Ubuntu) X-Powered-By: PHP/5.3.2-1ubuntu4.15 Transfer-Encoding: chunked Content-Type: text/html header field(s) blank line (CRLF) message body (optional) start line HTTP_message = start_line , {header} , "CRLF" , {body};
  • 9. Beat Signer - Department of Computer Science - [email protected] 9October 6, 2017 HTTP Request Message  Request-specific start line  Methods  GET : get a resource from the server  HEAD : get the header only (no body)  POST : send data (in the body) to the server  PUT : store request body on server  TRACE : get the "final" request (after it has potentially been modified by proxies)  OPTIONS : get a list of methods supported by the server  DELETE: delete a resource on the server start_line = method, " " , resource , " " , version; method = "GET" , "HEAD" , "POST" , "PUT" , "TRACE" , "OPTIONS" , "DELETE"; resource = complete_URL | path; version = "HTTP/" , major_version, "." , minor_version;
  • 10. Beat Signer - Department of Computer Science - [email protected] 10October 6, 2017 HTTP Response Message  Response-specific start line  Status codes  100-199 : informational  200-299 : success (e.g. 200 for 'OK')  300-399 : redirection  400-499 : client error (e.g. 404 for 'Not Found')  500-599 : server error (e.g. 503 for 'Service Unavailable') start_line = version , status_code , reason; version = "HTTP/" , major_version, "." , minor_version; status_code = digit , digit , digit; reason = string_phrase;
  • 11. Beat Signer - Department of Computer Science - [email protected] 11October 6, 2017 HTTP Header Fields  There exist general headers (for requests and responses), request headers, response headers, entity headers and extension headers  Some important headers  Accept - request header definining the Media Type that the client will accept • formerly known as Multipurpose Internet Mail Extensions (MIME type)  User-Agent - request header specifying the type of client  Keep-Alive (HTTP/1.0) and Persistent (HTTP/1.1) - general header helping to improve the performance since otherwise a new HTTP connection has to be established for every single webpage element  Content-Type - entity header specifing the body's MIME type
  • 12. Beat Signer - Department of Computer Science - [email protected] 12October 6, 2017 HTTP Header Fields ...  Some important headers ...  If-Modified-Since - request header that is used in combination with a GET request (conditional GET); the resource is only returned if it has been modified since the specified date
  • 13. Beat Signer - Department of Computer Science - [email protected] 13October 6, 2017 Media Types  The Media Type defines the request or response body's content (used for appropiate processing)  7 top-level media types  Standard Media Types are registered with the Internet Assigned Numbers Authority (IANA) [RFC-6838] mediaType = toplevel_type , "/" , subtype; Media Type Description text/plain Human-readable text without formatting information text/html HTML document image/jpeg JPEG-encoded image ... ...
  • 14. Beat Signer - Department of Computer Science - [email protected] 14October 6, 2017 HTTP Message Information  Various tools for HTTP message logging  e.g. HttpFox add-on for Firefox browser  Simple telnet connection  Until 1999 the W3C has been working on HTTP Next Generation (HTTP-NG) as a replacement for HTTP/1.1  never introduced  recently HTTP/2.0 has been released - inspired by Goggle’s development of SPDY telnet wise.vub.ac.be 80 (press Enter) GET /beat-signer HTTP/1.1 (press Enter) Host: wise.vub.ac.be (press Enter 2 times)
  • 15. Beat Signer - Department of Computer Science - [email protected] 15October 6, 2017 Proxies  A web proxy is situated between the client and the server  acts as a server to the client and as a client to the server  can for example be specified in the browser settings; used for - firewalls and content filters - transcoding (on the fly transformation of HTTP message body) - content router (e.g. select optimal server in content distribution networks) - anonymous browsing, ... Internet Client Server Proxy
  • 16. Beat Signer - Department of Computer Science - [email protected] 16October 6, 2017 Caches  A proxy cache is a special type of proxy server  can reduce server load if multiple clients share the same cache  often multi-level hierarchies of caches (e.g. continent, country and regional level) with communication between sibling and parent caches as defined by the Internet Cache Protocol (ICP)  passive or active (prefetching) caches Internet Client 1 ServerProxy Cache Client 2 1 2 12
  • 17. Beat Signer - Department of Computer Science - [email protected] 17October 6, 2017 Caches ...  Special HTTP cache control header fields  Expires - expiration date after which the cached resource has to be refetched  Cache-Control: max-age - maximum age of a document (in seconds) after it has been added to the cache  Cache-Control: no-cache - response cannot be directly served from the cache (has to be revalidated first)  ...  Validators  Last-modified time as validator - cache with resource that has been last modified at time t uses an If-Modified-Since t request for updates  Entity tags (ETag) - changed by the publisher if content has changed; If-None-Match etag request
  • 18. Beat Signer - Department of Computer Science - [email protected] 18October 6, 2017 Caches ...  Advantages  reduces latency and used network bandwidth  reduces server load (client and reverse proxy caches)  transparent to client and server  Disadvantages  additional resources (hardware) required  might get stale data out of the cache  creates additional network traffic if we use an active caching approach (prefetching) but achieve a low cache hit rate  server loses control (e.g. access statistics) since no longer all requests have to be sent to the server
  • 19. Beat Signer - Department of Computer Science - [email protected] 19October 6, 2017 Tunnels  Implement one protocol on top of another protocol  e.g. HTTP as a carrier for SSL connections  Often used to "open" a firewall to protocols that would otherwise be blocked  e.g. tunneling of SSL connections through an open HTTP port Internet SSL Client SSL Server SSL HTTP SSL HTTP[SSL] HTTP[SSL]
  • 20. Beat Signer - Department of Computer Science - [email protected] 20October 6, 2017 Gateways  A gateway can act as a kind of "glue" between applications (client) and resources (server)  translate between two protocols (e.g. from HTTP to FTP)  security accelerator (e.g. HTTPS/HTTP on the server side)  often the gateway and destination server are combined in a single application server (HTTP to server application translator) Internet HTTP Client FTP Server HTTP/FTP Gateway HTTP FTP
  • 21. Beat Signer - Department of Computer Science - [email protected] 21October 6, 2017 Session Management  HTTP is a stateless protocol  Session (state) tracking solutions  use of IP address - problem: IP address is often not uniquely assigned to a single user  browser login - use of special HTTP authenticate headers - after a login the browser sends the user information in each request  URL rewriting - add information to the URL in each request  hidden form fields - similar to URL rewriting but information can also be in body (POST request)  cookies - the server stores a piece of information on the client which is then sent back to the server with each request
  • 22. Beat Signer - Department of Computer Science - [email protected] 22October 6, 2017 Cookies  Introduced by Netscape in June 1994  A cookie is a piece of information that is assigned to a client on their first visit  list of <key,value> pairs  often just a unique identifier  sent via Set-Cookie or Set-Cookie2 HTTP response headers  Browser stores the information in a "cookie database" and sends it back every time the same server is accessed  Potential privacy issues  third-party websites might use persistent cookies for user tracking  Cookies can be disabled in the browser settings
  • 23. Beat Signer - Department of Computer Science - [email protected] 23October 6, 2017 Hypertext Markup Language (HTML)  Dominant markup language for webpages  If you never heard about HTML have a look at  http://www.w3schools.com/html/  More details in the exercise and in the next lecture <!DOCTYPE html> <html lang="en"> <head> <title>Beat Signer: Interactive Paper, PaperWorks, Paper++, ...</title> </head> <body> Beat Signer is Associate Professor of Computer Science at the VUB and co-director of the WISE laboratory ... </body> </html>
  • 24. Beat Signer - Department of Computer Science - [email protected] 24October 6, 2017 Dynamic Web Content  Often it is not enough to serve static web pages but content should be changed on the client or server side  Server-side processing  Common Gateway Interface (CGI)  Java Servlets  JavaServer Pages (JSP)  PHP: Hypertext Preprocessor (PHP)  ...  Client-side processing  JavaScript  Java Applets  Adobe Flash  ...
  • 25. Beat Signer - Department of Computer Science - [email protected] 25October 6, 2017 Common Gateway Interface (CGI)  CGI was the first server-side processing solution  transparent to the user  certain requests (e.g. /account.pl) are forwarded via CGI to a program by creating a new process  program processes the request and creates an answer with optional HTTP response headers Internet Client Server HTTP Request HTTP Response Program in Perl, Tcl, C, C++, Java, .. HTML Pages CGI
  • 26. Beat Signer - Department of Computer Science - [email protected] 26October 6, 2017 Common Gateway Interface (CGI) ...  CGI Problems  a new process has to be started for each request  if the CGI program for example acts as a gateway to a database, a new DB connection has to be established for each request which results in a very poor performance  FastCGI solves some of the problems by introducing persistent processes and process pools  CGI/FastCGI becomes more and more replaced by other technologies (e.g. Java Servlets)
  • 27. Beat Signer - Department of Computer Science - [email protected] 27October 6, 2017 Java Servlets  A Java servlet is a Java class that has to extend the abstract HTTPServlet class  The Java servlet class is loaded by a servlet container and relevant requests (based on a servlet binding) are forwarded to the servlet instance for further processing Internet Client Server HTTP Request HTTP Response HTML Pages Servlet Container Servlets
  • 28. Beat Signer - Department of Computer Science - [email protected] 28October 6, 2017 Java Servlets ...  Main HttpServlet methods  Servlet life cycle  a servlet is initialised once via the init() method  the doGet(), doPost() methods may be executed multiple times (by different HTTP requests)  finally the servlet container may unload a servlet (upcall of the destroy() method before that happens)  Servlet container (e.g. Apache Tomcat) either integrated with web server or as standalone component doGet(HttpServletRequest req, HttpServletResponse resp) doPost(HttpServletRequest req, HttpServletResponse resp) init(ServletConfig config) destroy()
  • 29. Beat Signer - Department of Computer Science - [email protected] 29October 6, 2017 Java Servlet Example  In the exercise you will learn how to process parameters etc. package org.vub.wise; import java.io.*; import java.util.Date; import javax.servlet.http.*; import javax.servlet.*; public class HelloWorldServlet extends HttpServlet { public void doGet (HttpServletRequest req, HttpServletResponse res) throws ServletException, IOException { PrintWriter out = res.getWriter(); out.println("<html>"); out.println("<head><title>Hello World</title></head>"); out.println("<body>The time is " + new Date().toString() + "</body>"); out.println("</html>"); out.close(); } }
  • 30. Beat Signer - Department of Computer Science - [email protected] 30October 6, 2017 JavaServer Pages (JSP)  A "drawback" of Java servlets is that the whole page (e.g. HTML) has to be defined within the servlet  not easy to share tasks between web designer and programmer  Add program code through scriptlets and markup to existing HTML pages  These JSP documents are then either interpreted on the fly (Apache Tomcat) or compiled into Java servlets  The JSP approach is similar to PHP or Active Server Pages (ASP)  Note that Java Servlets become more and more an enabling technology (as with JSP)
  • 31. Beat Signer - Department of Computer Science - [email protected] 31October 6, 2017 JavaScript  Interpreted scripting language for client-side processing  JavaScript functionality often embedded in HTML documents but can also be provided in separate files  JavaScript often used to  validate data (e.g. in a form)  dynamically add content to a webpage  process events (onLoad, onFocus, etc.)  change parts of the original HTML document  create cookies  ...  Note: Java and JavaScript are completely different languages!
  • 32. Beat Signer - Department of Computer Science - [email protected] 32October 6, 2017 JavaScript Example  More details about JavaScript in lecture 6 and in the exercise session <html> <body> <script type="text/javascript"> document.write("<h1>Hello World!</h1>"); </script> </body> </html>
  • 33. Beat Signer - Department of Computer Science - [email protected] 33October 6, 2017 Java Applets  A Java applet is a program delivered to the client side in the form of Java bytecode  executed in the browser using a Java Virtual Machine (JVM)  an applet has to extend the Applet or JApplet class  runs in the sandbox  Advantages  the user automatically always has the most recent version  high security for untrusted applets  full Java API available  Disadvantages  requires a browser Java plug-in
  • 34. Beat Signer - Department of Computer Science - [email protected] 34October 6, 2017 Java Applets ...  Disadvantages ...  only signed applets can get more advanced functionality - e.g. network connections to other machines than the source machine  More recently Java Web Start (JavaWS) is replacing Java Applets  program no longer runs within the browser - less problematic security restrictions - less browser compatibility issues  Math and Physics Applet Examples  http://www.falstad.com/mathphysics.html
  • 35. Beat Signer - Department of Computer Science - [email protected] 35October 6, 2017 Exercise 2  Hands-on experience with the HTTP protocol
  • 36. Beat Signer - Department of Computer Science - [email protected] 36October 6, 2017 References  David Gourley et al., HTTP: The Definitive Guide, O'Reilly Media, September 2002  R. Fielding et al., RFC2616 - Hypertext Transfer Protocol - HTTP/1.1  http://www.faqs.org/rfcs/rfc2616.html  N. Freed et al., RFC6838 - Media Type Specifications and Registration Procedures  http://www.faqs.org/rfcs/rfc6838.html  HTML and JavaScript Tutorials  http://www.w3schools.com
  • 37. Beat Signer - Department of Computer Science - [email protected] 37October 6, 2017 References ...  M. Knutson, HTTP: The Hypertext Transfer Protocol (refcardz #172)  http://refcardz.dzone.com/refcardz/http-hypertext-transfer-0  W. Jason Gilmore, PHP 5.4 (refcardz #23)  http://refcardz.dzone.com/refcardz/php-54-scalable  Java Servlet Tutorial  http://www.tutorialspoint.com/servlets/
  • 38. 2 December 2005 Next Lecture HTML5 and the Open Web Platform