STM Unit - Ii
STM Unit - Ii
TESTING
TRANSACTION FLOWS:
INTRODUCTION:
o A transaction is a unit of work seen from a system user's point of view.
o A transaction consists of a sequence of operations, some of which are performed
by a system, persons or devices that are outside of the system.
o Transaction begin with Birth-that is they are created as a result of some external
act.
o At the conclusion of the transaction's processing, the transaction is no longer in
the system.
o Example of a transaction: A transaction for an online information retrieval
UNIT – 2
system might consist of the following steps or tasks:
Accept input (tentative birth)
Validate input (birth)
Transmit acknowledgement to requester
Do input processing
Search file
Request directions from user
Accept input
Validate input
Process request
Update file
Transmit output
Record transaction in log and clean up (death)
TRANSACTION FLOW GRAPHS:
o Transaction flows are introduced as a representation of a system's processing.
o The methods that were applied to control flow graphs are then used for functional
testing.
o Transaction flows and transaction flow testing are to the independent system
tester what control flows are path testing are to the programmer.
o The transaction flow graph is to create a behavioral model of the program that
leads to functional testing.
o The transaction flowgraph is a model of the structure of the system's behavior
(functionality).
o An example of a Transaction Flow is as follows:
Mergers:Transaction flow junction points are potentially as troublesome as transaction flow splits.
There are three types of junctions: (1) Ordinary Junction (2) Absorption (3) Conjugation
0. Ordinary Junction: An ordinary junction which is similar to the
junction in a control flow graph. A transaction can arrive either on one
link or the other. (See Figure 3.3 (a))
1. Absorption: In absorption case, the predator transaction absorbs
prey transaction. The prey gone but the predator retains its identity.
(See Figure 3.3 (b))
Figure 3.1: An Example of a Transaction Flow 2. Conjugation: In conjugation case, the two parent transactions merge
to form a new daughter. In keeping with the biological flavor this case
USAGE: is called as conjugation.(See Figure 3.3 (c))
o Transaction flows are indispensable for specifying requirements of complicated
systems, especially online systems.
o A big system such as an air traffic control or airline reservation system, has not
hundreds, but thousands of different transaction flows.
o The flows are represented by relatively simple flowgraphs, many of which have a
single straight-through path.
o Loops are infrequent compared to control flowgraphs.
o The most common loop is used to request a retry after user input errors. An ATM
system, for example, allows the user to try, say three times, and will take the card
away the fourth time.
COMPLICATIONS:
o In simple cases, the transactions have a unique identity from the time they're
created to the time they're completed.
o In many systems the transactions can give birth to others, and transactions can
also merge.
o Births:There are three different possible interpretations of the decision symbol,
or nodes with two or more out links. It can be a Decision, Biosis or a Mitosis.
1. Decision:Here the transaction will take one alternative or the other Figure 3.3: Transaction Flow Junctions and Mergers
alternative but not both. (See Figure 3.2 (a))
2. Biosis:Here the incoming transaction gives birth to a new transaction, We have no problem with ordinary decisions and junctions. Births, absorptions, and conjugations
and both transaction continue on their separate paths, and the parent are as problematic for the software designer as they are for the software modeler and the test
retains it identity. (See Figure 3.2 (b)) designer; as a consequence, such points have more than their share of bugs. The common
3. Mitosis:Here the parent transaction is destroyed and two new problems are: lost daughters, wrongful deaths, and illegitimate births.
transactions are created.(See Figure 3.2 (c))
This graph has three normal and three anomalous states and he considers the kk
sequence not to be anomalous. The difference between this state graph and
Figure 3.5 is that redemption is possible. A proper action from any of the three
anomalous states returns the variable to a useful working state.
The point of showing you this alternative anomaly state graph is to demonstrate
that the specifics of an anomaly depends on such things as language, application,
context, or even your frame of mind. In principle, you must create a new definition
of data flow anomaly (e.g., a new state graph) in each situation. You must at least
verify that the anomaly definition behind the theory or imbedded in a data flow
anomaly test tool is appropriate to your situation.
Figure 3.5: Unforgiving Data Flow Anomaly State
Graph STATIC Vs DYNAMIC ANOMALY DETECTION:
Static analysis is analysis done on source code without actually executing it. For example: source
Assume that the variable starts in the K state - that is, it has not been defined or code syntax error detection is the static analysis result.
does not exist. If an attempt is made to use it or to kill it (e.g., say that we're talking Dynamic analysis is done on the fly as the program is being executed and is based on
about opening, closing, and using files and that 'killing' means closing), the intermediate values that result from the program's execution. For example: a division by zero
object's state becomes anomalous (state A) and, once it is anomalous, no action warning is the dynamic result.
can return the variable to a working state. If it is defined (d), it goes into the D, or If a problem, such as a data flow anomaly, can be detected by static analysis methods, then it
defined but not yet used, state. If it has been defined (D) and redefined (d) or doesnot belongs in testing - it belongs in the language processor.
killed without use (k), it becomes anomalous, while usage (u) brings it to the U There is actually a lot more static analysis for data flow analysis for data flow anomalies going on
state. If in U, redefinition (d) brings it to D, u keeps it in U, and k kills it. in current language processors.
For example, language processors which force variable declarations can detect (-u) and (ku)
anomalies.
Forgiving Data - Flow Anomaly Flow Graph:Forgiving model is an alternate model where
But still there are many things for which current notions of static analysis are INADEQUATE.
redemption (recover) from the anomalous state is possible.
Why Static Analysis isn't enough? There are many things for which current notions of static
analysis are inadequate. They are:
Dead Variables:Although it is often possible to prove that a variable
is dead or alive at a given point in the program, the general problem is
unsolvable.
For variable Z: The situation for variable Z (Figure 3.10) is more complicated
STRATEGIES OF DATA FLOW TESTING: because the variable is redefined in many places. For the definition on link (1,3)
we must exercise paths that include subpaths (1,3,4) and (1,3,5). The definition
on link (4,5) is covered by any path that includes (5,6), such as subpath
(1,3,4,5,6, ...). The (5,6) definition requires paths that include subpaths (5,6,7,4)
and (5,6,7,8).
INTRODUCTION:
o Data Flow Testing Strategies are structural strategies. For variable V: Variable V (Figure 3.11) is defined only once on link (1,3).
o In contrast to the path-testing strategies, data-flow strategies take into account Because V has a predicate use at node 12 and the subsequent path to the end
what happens to data objects on the links in addition to the raw connectivity of the must be forced for both directions at node 12, the all-du-paths strategy for this
graph. variable requires that we exercise all loop-free entry/exit paths and at least one
o In other words, data flow strategies require data-flow link weights (d,k,u,c,p). path that includes the loop caused by (11,4). Note that we must test paths that
o Data Flow Testing Strategies are based on selecting test path segments (also include both subpaths (3,4,5) and (3,5) even though neither of these has V
called sub paths) that satisfy some characteristic of data flows for all data definitions. They must be included because they provide alternate du paths to the
objects. V use on link (5,6). Although (7,4) is not used in the test set for variable V, it will be
o For example, all subpaths that contain a d (or u, k, du, dk). included in the test set that covers the predicate uses of array variable V() and U.
o A strategy X is stronger than another strategy Y if all test cases produced under
Y are included in those produced under X - conversely for weaker. The all-du-paths strategy is a strong criterion, but it does not take as many tests
TERMINOLOGY: as it might seem at first because any one test simultaneously satisfies the
1. Definition-Clear Path Segment, with respect to variable X, is a connected criterion for several definitions and uses of several different variables.
sequence of links such that X is (possibly) defined on the first link and not
redefined or killed on any subsequent link of that path segment. ll paths in Figure 1. All Uses Startegy (AU):The all uses strategy is that at least one definition clear
3.9 are definition clear because variables X and Y are defined only on the first link path from every definition of every variable to every use of that definition be
(1,3) and not thereafter. In Figure 3.10, we have a more complicated situation. exercised under some test. Just as we reduced our ambitions by stepping down
The following path segments are definition-clear: (1,3,4), (1,3,5), (5,6,7,4), from all paths (P) to branch coverage (C2), say, we can reduce the number of test
(7,8,9,6,7), (7,8,9,10), (7,8,10), (7,8,10,11). Subpath (1,3,4,5) is not cases by asking that the test set should include at least one path segment from
definition-clear because the variable is defined on (1,3) and again on (4,5). For every definition to every use that can be reached by that definition.
practice, try finding all the definition-clear subpaths for this routine (i.e., for all
variables). For variable V: In Figure 3.11, ADUP requires that we include subpaths (3,4,5)
2. Loop-Free Path Segment is a path segment for which every node in it is visited and (3,5) in some test because subsequent uses of V, such as on link (5,6), can
atmost once. For Example, path (4,5,6,7,8,10) in Figure 3.10 is loop free, but path be reached by either alternative. In AU either (3,4,5) or (3,5) can be used to start
(10,11,4,5,6,7,8,10,11,12) is not because nodes 10 and 11 are each visited twice. paths, but we don't have to use both. Similarly, we can skip the (8,10) link if we've
3. Simple path segment is a path segment in which at most one node is visited included the (8,9,10) subpath. Note the hole. We must include (8,9,10) in some
twice. For example, in Figure 3.10, (7,4,5,6,7) is a simple path segment. A simple test cases because that's the only way to reach the c use at link (9,10) - but
path segment is either loop-free or if there is a loop, only one node is involved. suppose our bug for variable V is on link (8,10) after all? Find a covering set of
4. A du path from node i to k is a path segment such that if the last link has a paths under AU for Figure 3.11.
computational use of X, then the path is simple and definition-clear; if the
penultimate (last but one) node is j - that is, the path is (i,p,q,...,r,s,t,j,k) and link 2. All p-uses/some c-uses strategy (APU+C) : For every variable and every
(j,k) has a predicate use - then the path from i to j is both loop-free and definition of that variable, include at least one definition free path from the
definition-clear. definition to every predicate use; if there are definitions of the variables that are
STRATEGIES: The structural test strategies discussed below are based on the program's control not covered by the above prescription, then add computational use test cases as
flowgraph. They differ in the extent to which predicate uses and/or computational uses of variables required to cover every definition.
are included in the test set. Various types of data flow testing strategies in decreasing order of their
effectiveness are: For variable Z:In Figure 3.10, for APU+C we can select paths that all take the
upper link (12,13) and therefore we do not cover the c-use of Z: but that's okay
0. All - du Paths (ADUP): The all-du-paths (ADUP) strategy is the strongest according to the strategy's definition because every definition is covered. Links
data-flow testing strategy discussed here. It requires that every du path from (1,3), (4,5), (5,6), and (7,8) must be included because they contain definitions for
every definition of every variable to every use of that definition be exercised under variable Z. Links (3,4), (3,5), (8,9), (8,10), (9,6), and (9,10) must be included
some test. because they contain predicate uses of Z. Find a covering set of test cases under
APU+C for all variables in this example - it only takes two tests.
4. All Definitions Strategy (AD) : The all definitions strategy asks only every
definition of every variable be covered by atleast one use of that variable, be that
use a computational use or a predicate use.
5. All Predicate Uses (APU), All Computational Uses (ACU) Strategies : The all Figure 3.12: Relative Strength of Structural Test
predicate uses strategy is derived from APU+C strategy by dropping the Strategies.
requirement that we include a c-use for the variable if there are no p-uses for the
variable. The all computational uses strategy is derived from ACU+P strategy by
dropping the requirement that we include a p-use for the variable if there are no oThe right-hand side of this graph, along the path from "all paths" to "all
c-uses for the variable. statements" is the more interesting hierarchy for practical applications.
o Note that although ACU+P is stronger than ACU, both are incomparable to the
It is intuitively obvious that ACU should be weaker than ACU+P and predicate-biased strategies. Note also that "all definitions" is not comparable to
that APU should be weaker than APU+C. ACU or APU.
SLICING AND DICING:
o A (static) program slice is a part of a program (e.g., a selected set of statements)
ORDERING THE STRATEGIES: defined with respect to a given variable X (where X is a simple variable or a data
o Figure 3.12 compares path-flow and data-flow testing strategies. The arrows vector) and a statement i: it is the set of all statements that could (potentially,
denote that the strategy at the arrow's tail is stronger than the strategy at the under static analysis) affect the value of X at statement i - where the influence of a
arrow's head. faulty statement could result from an improper computational use or predicate
use of some other variables at prior statements.
o If X is incorrect at statement i, it follows that the bug must be in the program slice
for X with respect to i
o A program dice is a part of a slice in which all statements which are known to be
correct have been removed.
o In other words, a dice is obtained from a slice by incorporating information
obtained through testing or experiment (e.g., debugging).
o The debugger first limits her scope to those prior statements that could have
caused the faulty value at statement i (the slice) and then eliminates from further