Forensic DNA Testing
Forensic DNA Testing
DNA from no two individuals except identical twins is alike. The person to person
differences in DNA can be discovered by PCR amplification and genomic sequencing.
The main advantages of DNA typing are the universality of application, an almost
unlimited power to discriminate, extreme sensitivity, and reasonable resistance to
degradation by environmental factors. The typing of DNA can be used in:
The differences in DNA are either in the form of variable number of tandem repeats
(VNTR) or single nucleotide polymorphisms (SNP). Approximately 20% of the human
genome comprises tandem repeat sequences. These are called micro-satellites or short
tandem repeats (STR) when the repeats are 2-6bp in length. When the repeat sequences
are 7-80bp in length these are called mini-satellite or variable number of tandem repeats
(VNTR). The number of STR repeat units tends to vary between individuals. This
variation (polymorphism) makes them extremely useful in applications like human
identification and linkage analysis for diagnosis of genetic disorders.
STR analysis:
There are over a million sites (loci) in the human genome that have STRs of which over
20,000 have been characterized. Considering the highly polymorphic nature and the ease
with witch the STRs can be analyzed these provide an extremely powerful tool for human
identification. The discrimination power of STRs increases with increase in the number
of loci tested. Therefore it is usual to use a battery of STRs. The repeat sequence of STRs
range from 2-6bp. Due to technical reasons four base pair (tetra-nucleotide) repeats are
best suited for forensic case work.
STR loci
The STRs are mostly located in the non-coding DNA between the genes (inter-genic
DNA). Some STRs are also present in the intervening sequences (introns) of known
genes. The STRs found in the inter-genic DNA are named according to the chromosome
number and the site. For example in the name “D5S818” “D” stands for DNA, “5” is the
chromosome number, “S” stands for single copy sequence and “818” is the locus number.
The STRs in the introns of the known genes are identified by their location e.g. TH01 is
present in the intron-1 of human tyrosine hydroxylase gene and TPOX is located in the
thyroid peroxidase gene.
1
GTATCCTTATGTAATATTTTGAAGATAGATAGATAGATAGATAGATAGATA
GATAGATAGATAGATAGGTAGATAGAGGTATAAATAAGGATACAGATATAG
Autosomal STRs
In most forensic DNA applications STRs located on autosomes are used. A list of 15 core
STR loci and their chromosomal locations are shown in Table 16.1
(Short Tandem Repeat DNA Internet DataBase).
Table. 16.2. Fifteen Y-STRs available as a commercial kit (Applied Biosystems, USA)
Allele frequencies
In a given population the STR allele frequencies are calculated by simple counting. Each
individual has two alleles at each locus. Genotyping of 100 individuals from a population
would mean examination of 200 chromosomes (alleles). Each allele of a compound
heterozygote is counted as one and homozygotes are counted as two. For example if
20/100 people have allele 8, including 2 homozygotes (8,8), the frequency of allele 8
would be 18+2+2 = 22/200 i.e. 11% or 0.11.
Genotype frequencies
The allele combination in an individual at a locus is called its genotype e.g. “7,10” or
“7,7” etc. The number of possible genotypes increases with an increase in the number of
alleles. The possible genotypes can be calculated by the formula [n(n+1)/2] where n is the
number of different alleles found in the population. The observed genotype frequencies
are calculated by simple counting. The expected genotype frequencies can be calculated
from the observed allele frequencies by using Hardy Weinberg equation (p2 + q2 + 2pq =
1). The homozygotes of two alleles with frequencies of p and q would be equal to p2 and
q2 respectively while the heterozygotes (compound heterozygotes) would be equal to 2pq.
Example
Table 16.4 describes an example of calculation of expected genotype frequencies of
alleles 10 and 11 with observed frequencies of p and q respectively.
Table 16.5. Example of calculation of profile frequency from the genotype frequencies in a
given population.
Table 16.6. Example of correction for consanguineous marriage applied to the genotype
and profile frequency of the example presented in Table 16.5.
Mutations in STRs
Genomic DNA is liable to develop spontaneous mutations with the passage of time. In
fact the existence of highly polymorphic STRs in the genome is thought to be due to
spontaneous mutations that is a fairly regular event. The STR mutations become
significant if these are encountered in solving a case of inheritance.
The rate of spontaneous mutations at the STR loci ranges from 1 in 500 to 1 in 1000. The
rates of mutations at the core STR loci used in forensic DNA testing are given in Table
16.7. A mutation may be suspected if a disagreement is found between alleles of the
parents and the offspring at one odd locus out of the several tested. The mutation can be
confirmed by genomic sequencing. It may be pointed out that matching between DNA
samples from a crime scene and a suspect would not be affected by mutations in the STR
loci.
STR Loci Mutation Rate STR Loci Mutation Rate STR Loci Mutation Rate
CSF1PO 0.16% D3S1358 0.12% D16S539 0.11%
FGA 0.28% D5S818 0.11% D18S51 0.22%
TH01 0.01% D7S820 0.10% D21S11 0.19%
TPOX 0.01% D8S1179 0.14% D2S1338 0.12%
VWA 0.17% D13S317 0.14% D19S433 0.11%
Table 16.9 Core STR loci required by various countries and the legal systems.
Table 16.10 Comparison of the STR profile of a DNA sample collected from a crime
scene (serial 1) and three suspects (serial 2-4). The profile of sample at serial 2
completely matches with that of the crime scene DNA whereas the samples at serial 2 &
3 are excluded from the match.
Probability of match
The example shown in Table 16.10 shows a complete match between the samples at
serials 1 & 2. However, there is a remote probability that the match could be by chance.
The probability of match by chances is inversely proportional to the number of loci
tested. Its calculation in a given population requires comprehensive knowledge of the
allele frequencies of all the loci tested. A worked example of the profile frequency
(probability of match by chance) was discussed in Table 16.5.
Example
The STR profiles of DNA extracted from high vaginal swab of a victim of sexual assault
(sample 1) and the suspects (sample 2 & 3) are shown in Table 16.11. The vaginal swab
shows mixture of two DNAs. The minor component has a male genotype and matches
exactly with that of the suspect 2 whereas the suspect 3 is excluded.
Table 16.11 DNA profiles of victim (sample 1) and two suspects (samples 2 &3) in a
case of sexual assault. The vaginal swab shows mixture of two DNAs. The minor
component has a male genotype and matches exactly with that of the suspect 2 whereas
the suspect 3 is excluded.
11
*
If the father is heterozygous for the allele Prior Probability = 0.5
*
If the father is homozygous for the allele Prior Probability = 1.0
Table. 16.13. Comparison of the DNA profiles of three dead bodies from a mass disaster
and a father. The DNA profile of the “father” shares at least one allele at all fifteen loci
with the DNA profile of the body number 7 and is therefore a proof of identity.
13
Bibliography
1. Kaiser L and Sever G (1983) Paternity testing: I. Calculation of paternity indexes.
Am J Med Genet 15: 323-329.
2. Elston RC (1986) Probability and paternity testing. Am J Hum Genet 39: 112-
122.
3. Smith RN (1995) Accurate size comparison of short tandem repeat alleles amplified
by PCR. Biotechniques 18: 122-128.
4. Brinkmann, B., Klintschar, M., Neuhuber, F., Huhne, J., and Rolf, B. (1998).
Mutation rate in human microsatellites: Influence of the structure and length of
the tandem repeat. Am J Hum Genet 62: 1408-1415.
5. Short Tandem Repeat DNA Internet DataBase, http://www.cstl.nist.gov/strbase/
15