Section6 DataAndFileManagement
Section6 DataAndFileManagement
Outline
Data vs information
Ways of presenting information
Data Concepts
Analogue vs digital data
Need for data converters
Data representation in computers
Data types
Data entry techniques
Methods of data collection
Methods of data capture
Codes
Data entry methods
Data entry checks
Verification
Validation
File organisation
File components
File access methods
Type of files
1
2022/08/19
Data vs information
Data Information
Data Concepts
2
2022/08/19
Data Concepts
Data Concepts
Need for Data converters
These are devices used to convert data from one form to the
other :
Analogue to Digital Converter (ADC)- Used to convert analogue
data into digital form so that the computer can understand and
process the data.
E.g. When processing data sent from a sensor
Digital to Analogue Converter (DAC)- Used to convert digital
from the computer to analogue form so that the computer can
effectively control these devices.
E.g. If the computer is being used to control a device
(motor/valve) the device will be controlled by variable
voltages; the DAC will be used to send out analogue signal
Modem (MODulator DEModulator)-
Converts computer’s digital signals (modulates it) into
analogue for transmission through telephone lines
Reverse this process- analogue signal from a telephone line
into digital for a computer to process the data (demodulates
it)
The main use it to connect to computer networks over long
distances using existing telephone line
3
2022/08/19
Data Concepts
Data types
Many different data types can be stored on a computer system. The data types
which are commonly used are as follows:
Numeric
Numeric (number) data can be in two forms:- Integer and Real. E.g. 1288, 12.45,
-156
Text/Alphanumeric
This allows you to type in text, numbers and symbols. E.g. John Smith, John123
Date/time
Usually formatted in a specific way, e.g. dd/mm/yy, dd/mmm/yyyy, long time
etc. The format depends upon the setup of the computer, the software in use
and the user’s preferences. E.g. 25/10/2007, 15:56PM
Percentage
Percentage numbers are real numbers (decimals) that have been formatted to
show values out of 100. Percentage are usually shown with the percentage
symbol (%). For example: 10%, -178%
Currency
Currency refers to real numbers that are formatted in a specific way. Usually
currency is shown with a currency symbol and (usually) two decimal places,
e.g. P5.23
Boolean
Boolean data is sometimes called 'logical' data (or in some software, 'yes/no'
data). Boolean data can only have two values: TRUE or FALSE
Data Concepts
4
2022/08/19
Observation
Involves looking at how things are done, while
making notes on the information obtained
Interviews
Preparing questions and giving them to
respondents to answer immediately they get
questions, either orally or written
Questionnaires
Preparing set of questions giving them to the
respondents to answer at their own time.
Document study
Allows the analyst to see how the paper files are
kept, look at operating instructions and training
manuals, check the accounts, etc.
Key-to-disk
Optical Character Recognition
Magnetic Ink Character Recognition
Optical Mark Recognition
Barcode readers/scanners
Magnetic Stripe reading
Voice Recognition System
Data logging
5
2022/08/19
Advantages Disadvantages
There shouldn’t be
much of a need for Slow to enter data
training, as it is the most
common data input
method
Transcription (data
No need for entry) errors can
occur
specialised data
collection documents
Keyboard is cheaper Handwriting
to purchase recognition can be
unreliable
6
2022/08/19
Advantages Disadvantages
Damaged and dirty
Recognition is documents are difficult to
exactly accurate read
It is a much faster The forms need special
designing to make sure
method of recording that the marks can easily
data than doing it all be read by the machine.
manually If the forms are not
Less chance of errors correctly filled, they
cannot be read properly
Advantages Disadvantages
Much faster than Damaged and dirty
entering all the data documents are difficult to
manually read
No special data- The system cannot easily
preparation equipment read handwriting (text
required – it just uses text and/or numbers)
on ordinary paper
Data is easily read by It is not very accurate
humans as well as the Converted documents
computer will need to be checked
7
2022/08/19
Advantages Disadvantages
Limited storage capacity
Fast data entry on a magnetic stripe
compared to manual If the stripe becomes
entry damaged in any way, all
of the data is lost
Secure/Error Free – No
The card needs to be
Typing close to the reader for it
Not effected by water
to work properly
Not secure- stripes are
and robust if dropped easily duplicated
Advantages Advantages
No special data- Recognition is not 100%
preparation equipment accurate
required – you just say Dictation systems need
the data to be trained
Data is easily understood Not everything – e.g.
by humans as well as the mathematical formulae
computer – are easy to describe in
Little training is required words
8
2022/08/19
Advantages Disadvantages
Specialist high-quality printing
No need to manually equipment is required – this
enter text – less chance obviously costs more
Only certain characters can
of human error. be written that the device will
Characters can not be be able to interpret
Its more expensive than most
altered. direct data entry methods
Characters can be read Limited amount of characters
even if they have been can be read.
written over.
Codes
Introduction
This usually means shortening the original data in an agreed
manner. The agreement is between the users of the system.
This coding scheme could be part of the training of how to
use the system, and it could also be documented within the
system for new users.
E.g. suppose that a field could contain one of three possible
values; Small, Medium or Large. Instead of typing in the full
word each time we could instead type S, M or L.
9
2022/08/19
Verification methods:
Double entry:
Data is entered twice and the computer checks that they
match up
Visual check/proof reading:
The user manually reads and compares the newly inputted
data against the original source to ensure they match
10
2022/08/19
Validation checks:
Range Check: Checks the data falls between an acceptable upper and
lower value, within a set range.
Format Check: Checks that data is in a specific format E.g. Date should
be in the form dd/mm/yyyy
Length Check: Checks if the input data contains the required number of
characters
Character/type Check: Checks that the data entered is of an expected
type, e.g. text or a number
Presence Check: Checks if data is actually present and has not been
missed out
Check Digit: Look at an extra digit which is calculated from the digits of a
number and then put on the end of the number E.g. Check digits can
identify three types of error:
If two digits have been inverted during input
An incorrect digit entered twice
A digit missed out altogether
Verification Validation
11
2022/08/19
File organisation
File components
A database is an organised collection of information consisting of 1
or more files (or tables)
A database file is a collection of related records. For example, a file
of information about all the pupils in a school
A record in a database file is a collection of related fields. In the
records of the same type , the fields are in the same order.
A field is an item of information. A field contains one individual item
of data. For example, each pupils surname.
A key field or a primary key uniquely identifies a record.
File organisation
File access methods
Direct access
This method is also called Random Access
means that the required data can be found straight away
without having to read through all the data on the disk.
Hard disks, CDs, DVDs, USB memory sticks all allow direct
access to data.
Speed of access is faster
Serial Access
Data is accessed by starting at the beginning and then
searched through, in order/sequence, until the required
information is found.
Because it take longer to locate a file on serial access
devices, they are only used as backup and batch
processing.
Were the speed of locating data is not important
Magnetic tape allows only serial access to data.
12
2022/08/19
File organisation
Types of files
Transaction file:
A temporary file containing transactions/changes that took place.
Used to update the master file.
Master file:
Represent the on-going information pertaining to an organisation
The most important, permanent copy a file of an organisation
It is lost or damaged, the whole system may break down
13