Unit-3 Attribute Data Input and Data Display (E-next.in)
Unit-3 Attribute Data Input and Data Display (E-next.in)
Tirup Parmar
UNIT –III
Attribute data input and data display:
Attribute data in GIS, Relational model, Data entry, Manipulation of fields and attribute
data, cartographic symbolization, types of maps, typography, map design, map production
https://E-next.in
Prof. Tirup Parmar
Figure A
The difference between spatial and attribute data is well defined with vector based
features.
The Georelational data model stores spatial and attribute data separately and links the two
by feature ID. Figure B
The object based data model combines both spatial and attribute data in single system.
Each spatial feature has unique object ID and attribute to store its geometry. Figure C
https://E-next.in
Prof. Tirup Parmar
https://E-next.in
Prof. Tirup Parmar
In the case of the object based data model, the feature attribute table has a field that stores
the features geometry.
Feature attribute tables also have default fields that summaries the feature geometries
such as length for line features and area and perimeter for area features.
A feature attribute table maybe the only table needed if a data set has only several
attributes.
But this is often not the case. For example, soil map can have over 100 oil interpretations,
soil properties, and performance data.
To store all these attributes in a feature attribute table will require many repetitive and
redundant entries, a process that wastes both time and computer memory.
Moreover, the table will be extremely difficult to use and update.
This is why we need the second type of attribute table.
This second type of attribute table is non spatial, meaning that the table does not have
direct access to the geometry of features but has a field that can link the table to the
feature attribute table whenever necessary.
Tables of non-spatial data may exist as delimited text files, dBase files, excel files.
Database management:-
The essence of feature attribute and non-spatial data tables means that a GIS requires a
database management system to manage these tables.
A DBMS is a software package that enables us to build and manipulate a database.
A DBMS provides tools for data input, search, retrieval, manipulation and output the use
of DBMS has other advantages beyond its GIS applications.
Often a GIS is part of an enterprise wide information system, and attribute data needed
for the GIS may reside in various departments of the same organization.
Therefore GIS must function within the overall information system and interact with
other information technologies.
https://E-next.in
Prof. Tirup Parmar
Ordinal data differentiates data by ranking relationship. For example, soil erosion may be
ordered from severe to moderate to light.
Interval data have known intervals between values. For example, a temperature reading
of 70 degree Fahrenheit is warmer than 60 degree Fahrenheit by 10 degree.
Ratio data are the same as internal data except that ratio data are based on a meaningful
or absolute, zero value. Population densities are an example of ratio data because a
density of zero is an absolute zero.
In GIS applications the four measurement scales may be grouped into two general
categories :
Categorical data include nominal and ordinal scale, and numeric data include interval and
ratio scales.
Data types and measurement scales are obviously related.
Character strings are appropriate for nominal and ordinal data.
Integers and floats are appropriate for interval and ratio data depending on whether
decimal digits are included or not.
But there are exceptions. For example, study may clarify the potential for groundwater
contamination, medium or low but may represent information may be present information
as numeric data using a lookup table.
https://E-next.in
Prof. Tirup Parmar
A hierarchical database organizes its data at different levels and uses only the one to
many association between levels.
This simple example
Based on the one to many association each level is divided into different branches.
A network database with connection across table, as shown by the pages between the
tables in figure.
https://E-next.in
Prof. Tirup Parmar
A common problem with both the hierarchical and the network database design is that the
link pages between tables must be known in advance and built using the database at
design time.
This requirement tend to make a complicated and inflexible database and limit the
database applications.
A relational database is a collection of table, also called relations, which can be
connected to each other by keys.
https://E-next.in
Prof. Tirup Parmar
A primary key replaces one or more attributes whose values can identify a record in a
table.
Its counterpart is another table for the purpose of linkage is called a foreign key.
Thus a key common to two tables can tablets connections between corresponding records
in the table.
In the key connecting zoning and parcel is the zone code and the key containing parcel
and owner is the PIN.
When used together, the key can relate zoning and owner.
Normalization
Preparing a relational database involves following certain rules.
An important rule is called normalization.
Normalization is a process of decomposition, taking a table with all the attribute data and
breaking it down into small tables while maintaining the necessary linkages between
them.
Normalization is designed to achieve the following objectives
1. To avoid redundant data in table that waste space in the database and may
cause data integrity problems
2. To ensure that attribute data in separate tables can be maintained and updated
separately and can be linked whenever necessary.
3. To facilitate a distributed database.
An example of normalization is offered here. Consider the image
Table shows attributes data for a parcel map. The table contains redundant data: owner
addresses are repeated for smith and residential and commercial zoning are entered twice.
The table also contains uneven records depending on the parcel, the field of owner and
owner address have either one or two values.
An unnormalized table such as shown cannot be easily managed or edited.
To begin with, it is difficult to define the field of owner and owner address and to store
their values.
A change of the ownership requires that all attribute data be updated in the table.
The same difficulty applies to search option as adding or deleting values. Table 1
represent the first step in normalization .Often called the first normal form, table 2 no
longer has multiple values in its cells, but the problem of data redundancy has increased.
https://E-next.in
Prof. Tirup Parmar
P101 & P102 are duplicated except for changes of the owner and the owner address.
Smith’s address is included twice and the zoning description of residential and
commercial are listed three times each.
Also, identifying the owner address is not possible with PIN alone but requires a
compound key of PIN and owner.
Figure D represents the second step in normalization. In place of table 2 are three small
tables of parcel, owner, and address.
https://E-next.in
Prof. Tirup Parmar
The final step in normalization is presented in figure E. A new table, zone, is created to
take care of the remaining data redundancy problem with zoning.
Zone code is the key relating the parcel and zone tables.
Unnormalized data in table 1 are fully normalized.
Higher normal form can achieve objectives consistent with the relational model, but they
can slow down data access and create higher maintenance cost.
To find the addresses of parcel owners, for example, we must link three tables (parcel,
owner and address) and employ to keys (PIN and owner name).
One way to increase the performance in data access is to reduce the level of
normalization by, for example removing the address table and including the addresses in
the owner table.
Therefore, normalizations should be maintained in the conceptual design of a database,
but performance and other factors should be considered in its physical design.
Types of relationships
A relational database may contain four types of relationships between tables or more
precisely between records in tables.
o One to one,
o One to many,
o Many to one
o Many to many.
The one to one relationship means that one and only one record in a table is related to one
and only one record in another table.
The one to many relationship means that one record in a table may be related to many
records in another table.
For example the street address of an apartment complex may include several households.
https://E-next.in
Prof. Tirup Parmar
The many to one relationship means that many records in table may be related to one
record in another table.
For example, several households may share the same street address.
The many to many relationship means that many records in a table may be related to
many records in another table.
For example, a timber stand can grow more than one species and a species can grow in
more than one stand.
To explain these relationships, especially one to many and many to one the designation of
base table can be helpful.
For example the purpose is to join attribute data from another table to a feature attribute
table, then the feature attribute table is the base table and other table is a table to join.
The feature attribute table has the primary key and other table has the foreign key.
Often the designation of the base table depends on the storage of data and information
sought.
This is illustrated in the following two example .First example refers to the four
normalized tables of parcel, owner, address, and zone in figure E.
Suppose the question is to find who owns a selected parcel? To answer this question we
can treat the parcel table as the base table and the owner table as the table to be linked.
https://E-next.in
Prof. Tirup Parmar
Relationship between tables is one to many. One record in the parcel may correspond to
more than one record in owner table.
Suppose the question is now changed to find land parcels owned by a selected owner?
The owner table becomes the base table and parcel table is the table to be linked.
The relationship is many to one: more than one record in the owner table may correspond
to one record in parcel table. The same is true between the parcel table and the zone table.
If the question is to find the zoning code for select parcel. It is many to one relationship.
The question is to find land parcel that as zoned commercial, it is one to many
relationship.
Joining and relating tables. A relational database allows attribute data in separate tables to
be linked whenever necessary.
Two common operations for linking tables are join and relate. A join operation brings
together two tables by using a key that is common to both tables.
A typical example of join is to join attribute data from one or more non-spatial data tables
to a feature attribute table.
The joined table and attribute can then be used for data query and analysis.
Join is usually recommended for one to one or many to one relationship.
Given a one to one relationship, two tables are joined by record.
Given a many to one relationship many records in the base table have the same value for
a record in the other table.
Join is appropriate with the one to many relationship because only the first matching
record value from the other table is assigned to a record in the base table.
A relate operation temporarily connects two tables but keeps the tables physically
separate.
We can connect three or more tables simultaneously by first establishing relates between
tables in pair.
One advantage of relate is that they are appropriate for all four types of relationships.
This is important for data exploration and data query because a relational database is
likely to include different types of relationships.
But relates tend to slow down data access especially if the data reside in a remote
database.
Field definition
First step in attribute data entry is to define each field in the table. A field definition
usually includes the field name, data-width, data type, and the number of decimal digits.
The width refers to the number of spaces to be reserved for field. The width should be
large enough for the largest number, including the sign, or the longest string in the data.
The data type must follow data types allowed in GIS package.
https://E-next.in
Prof. Tirup Parmar
https://E-next.in
Prof. Tirup Parmar
Common map elements are the title, body, legend, north arrow, scale bar,
acknowledgement and neat line / map border. Other elements include the graticule or
grid, name of map projections, inset or location map, and data quality information, these
elements bring the map information to the map readers.
The map body is most important part map it contains the map information.
The elements of the map support the communication process for example, title suggest a
subject matter and legend relates map symbols to mapped data.
In practical terms, map making may be described as a process of assembling map
elements.
Data display is one area in which commercial GIS packages have greatly improved. GIS
packages with graphical user interface are excellent for data display for 2 reasons.
First, the map maker can simply point and click on the graphic icons to construct map. In
comparison, a command line driven package requires the user to become familiar with
many commands and their parameters making a map
Second, desktop GIS packages have incorporated some design options such as symbol
choices and color schemes into menu selection.
Cartographic symbolization
Cartography is the making of and study of maps in all their aspects.
Basic element of cartography is symbolization, the use of different map symbols to
represent spatial features and map symbols.
Spatial features are characterized by their locations and attributes .To display a spatial
data feature on map, we use maps symbol to indicate the features location and visual
variables with the symbol to show the features attribute data.
The choice of map symbol is simple for raster data: the map symbols applies to cells
whether the spatial feature to be depicted is a point, line, or area.
https://E-next.in
Prof. Tirup Parmar
The choice of map symbols for vector data depends on the features type. The general rule
is to use point symbols for point Features, line symbols for line features, and area
symbols for area features.
But this general rule does not apply to volumetric data or aggregate data. There are no
volumetric symbols for data such as division, temperature, and specification.
Instead, 3D surfaces and isolines are often used to map volumetric data. Aggregate data
such as county populations are data reported at and aggregate level.
A common approach is to design an aggregate data to the center of each county and
display the data using Symbols.
Visual variables for data display include hue, value, Chroma, size, texture, shape, and
pattern.
https://E-next.in
Prof. Tirup Parmar
The choice of visual variable depends on the type of data to be displayed. The
measurement scale is commonly used for classifying attribute data.
Size and texture are more appropriate for displaying ordinal, interval and ratio data.
For example, a map may use different circles to represent different size cities.
Shape and pattern are more appropriate for displaying nominal data. For example, a map
may use different area patterns to show different land use types.
Use of color
Because color adds a spatial appeal to a map, mapmakers will choose color maps over
black and white maps whenever possible.
Color is probably the most misused visual variable.
Hue is the quality that distinguishes one color from another, such as read from blue.
Hue can also be defined as the dominant wavelength of light making up a color which
tend to relate different hues with different kinds of data.
Value is the lightness or darkness of color, with black at the lower end and white at the
higher end.
We generally perceive symbols on map as being more important, or greater in terms of
magnitude.
Chroma refers to the richness, or brilliance, of a color.
A fully saturated color is pure, whereas a low saturation approaches gray.
We generally associate higher intensity symbols with greater visual importance.
The first rule of thumb is the use of color is simple: hue is a visual variable better suited
for qualitative data.
Whereas value and Chroma are better suited for quantitative data.
Qualitative mapping is relatively easy. It is not difficult to find 12 or 15 distinctive hues
for a map.
https://E-next.in
Prof. Tirup Parmar
If a map requires more symbols we can add pattern or text to hue to make up more map
symbols.
Quantitative mapping has received much more attention than qualitative mapping.
Over the years, cartographers have suggested general color schemes that combined value
and Chroma for displaying quantitative data.
The following is a summary of the color schemes
Data classification
Data classification involves the use of classification methods and a number of classes for
aggregating data and map features. A GIS package typically offers different data
classification methods. The following summarizes five commonly used methods:
Equal interval: This classification methods divides the range of data values in equal
intervals.
Equal frequency: Also called quantile, this classification method divides the total
number of data values by the number of classes and ensures that each class contains the
same number of data values.
Mean and standard deviation: This classification method sets the class breaks at units
of the standard deviation (0.5, 1.0 etc.) above or below the mean.
Natural breaks: Also called the chain optimization method, this classification method
optimizes the grouping of data values. The methods uses computing algorithm to
https://E-next.in
Prof. Tirup Parmar
minimize differences between data values in the same class and to maximize differences
between classes.
User defined: This method lets the user choose the appropriate or meaningful class
breaks. For example, in mapping rates of population change by state, the user may choose
0 or the national average as a break.
Types of maps:-
Cartographers classify maps by function and by symbolization.
By function, maps can be general reference or thematic.
The general reference map is used for general purposes.
Thematic map, also called the spatial purpose map because its main objective is to show
the distribution pattern of a theme, such as the distribution of population densities by a
county in a state.
By map symbol, map can be qualitative or quantitative.
Qualitative map uses visual variables that are appropriate for portraying qualitative data.
Whereas a quantitative map uses visual variables that are appropriate for communicating
quantitative data.
The following describes several common types of quantitative maps.
https://E-next.in
Prof. Tirup Parmar
https://E-next.in
Prof. Tirup Parmar
https://E-next.in
Prof. Tirup Parmar
7. Flow map
o Flow map displays different quantities of data such as traffic volume and stream flow
by varying the line symbol width. Similar to the graduated symbols the flow symbols
usually represents a range of values.
https://E-next.in
Prof. Tirup Parmar
Typography
A map cannot be understood without text on it.
Text is needed for almost every map element. Mapmakers treat text as a map symbols
because like point, line, or area symbols, text can have many type variations. Using type
variation to create a pleasing and coherent map is therefore part of the map making
process.
Type variations
Type can vary in typeface and form.
Typeface refers to the design character of the type.
Two main groups of typefaces are serif and sans serif.
The first one with serif and last without serif. Serif are small, finishing touches at the end
of line strokes, which tend to make running text in newspapers and books easier to read.
Compared to serif types san serif types appears simpler and bolder.
Type form variations include type weight (bold, regular or light), type width (condensed
or extended), upright vs. slanted (or roman vs. italic) and uppercase vs. lower case.
A Font is a complete set of all variants of a given typeface.
Type can also vary in size and color type size measure the height of a letter come up with
72 points to an inch. Figure
https://E-next.in
Prof. Tirup Parmar
https://E-next.in
Prof. Tirup Parmar
https://E-next.in
Prof. Tirup Parmar
Automated name placement presents several difficult problems for the computer
programmer: names must be legible, names cannot overlap other name, names must be
clearly associated with their intended referent symbols and name placement must follow
cartographic conventions.
These problems worsen as smaller map scales as competition of the map space intensifies
between names.
https://E-next.in
Prof. Tirup Parmar
Map design
Like graphic design, map design is a visual plan to achieve a goal. The purpose of map
design is to enhance map communication which is particularly important for thematic
maps.
Well-designed map is balanced, coherent, ordered and interesting to look, whereas a
poorly design map is confusing and disoriented.
There may not be the right or wrong design for maps, but there are better or more
effective maps and worse or less effective maps.
Cartographers usually study map design from the perspective of layout and visual
hierarchy.
Layout
Layout or planar organization deals with the arrangement and composition of various
map
elements.
Major concerns with layout are focus, order and balance.
A thematic map have a clear focus, which is usually the map body or a part of my body.
To draw the map reader’s attention, the focal element should be placed at the optical
center just above the maps geometric center.
The focal element should be differentiated from other map elements by contrast in line
width, texture, value, detail and color.
After viewing the focal element, the reader should be directed the rest of the map in an
ordered fashion.
For example, the legend the title are probably the next element that the way you need to
look at after the map body.
https://E-next.in
Prof. Tirup Parmar
To smooth the transition, the mapmaker should clearly place the legend and the title on
the map with perhaps a box around the legend and a larger type size for the title to draw
attention to them.
A finished map should look balanced.
It should not give the map reader an impression that the map looks heavier on the top, or
bottom or a side.
But balance does not suggest the breaking down of the map elements and placing them,
almost mechanically, in every part of the map.
Although in that case the elements would be in balance, the map would be disorganized
and confusing.
Map maker therefore should deal with balance within the context of organization and
map communication.
Regardless of the methods used in the layout design the legend deserves special
consideration.
The legend includes description of all the layers that make up a map.
Visual Hierarchy
Visual Hierarchy is the process of developing a visual plan to introduce the 3d effect or
depth to the map.
https://E-next.in
Prof. Tirup Parmar
Map makers create the visual hierarchy by placing map elements at different visual levels
according to their importance to the maps purpose.
The most important element should be at the top of visual hierarchy and it should appear
closest to the reader.
The least important element should be at the bottom.
Thematic map may consist of three or more levels in a visual hierarchy.
The concept of visual hierarchy is an extension of the figure ground relationship in visual
perception.
The figure is more important, appears closer to the viewer, has form and shape, has more
impressive color and has meaning.
The ground is the background.
Cartographers have adopted the depth cues for developing the figure ground relationship
in map design.
https://E-next.in
Prof. Tirup Parmar
Probably the simplest and yet most effective principle in creating visual hierarchy is
interposition or superposition.
Interposition uses the incomplete outline of an object to make it appear as which is
behind another.
Example of interposition abound in maps, especially in newspapers and magazines.
Continents on a map look more important or occupy a higher level in visual hierarchy if
the lines of latitude and longitude stop at the coast.
Subdivisional organization is a map design principle that groups map symbols at the
primary and secondary level according to the intended visual hierarchy.
Each primary signal is given a distinctive hue and the differentiation among the
secondary symbols is based on color variation, pattern or texture.
For example, all tropical climates on climate map are shown in red and different tropical
climates are distinguished by different shades of red.
Sub divisional organization is most useful for maps with many map symbols, such as
climate, soil, geology and vegetation maps.
Contrast
https://E-next.in
Prof. Tirup Parmar
Map production
GIS user design and make maps on computer screen.
These softcopy maps can be used in a variety of ways. They can be printed, exported to
use on the Internet, used in or computer projection systems, exported to other software
packages, or further processed for publishing.
Map production is a complex topic. We are often surprised that color symbols from the
color printers do not exactly match those on the computer screen.
This discrepancy results from the use of different media and color models.
Data display on the computer screen uses either CRT or LCD (liquid crystal display).
It used to be that desktop computers used CRT or laptop or portable computers used
LCDs.
But now more text of computers are also using LCD to take advantage of the thin, flat
screen.
A CRT screen has a built-in fine mesh of pixels and each pixel has colored dots called
phosphors.
When struck by electron from an electron gun a dot slightly lights up and slowly grows
dim.
An LCD screen uses two sheets of polarizing materials which a liquid crystal solution
between them.
Each pixel on an LCD screen can be turned on or off independently.
Besides being thinner and lighter, LCD screens have two other advantages over CRT
screen.
They consume less power and they produce sharper, flicker free images.
With either a CRT or an LCD a color symbol we see on a screen is made of pixels and
color of each pixel is a mixture of RGB (red, green, blue) 256 states. Combining the three
primary colors produces a possible palette of 16.8 million colors 256*256*256.
https://E-next.in