Final File 4
Final File 4
Name of the Institution: Department of Basic Medical Sciences, Unaizah Colloege of Medicine and
Address of Author: Uthman Ibn Affan Rd, Unayzah, Al Qassim 56436, Saudi Arabia
Email: [email protected]
1
Abstract:
Objective: Iron deficiency anemia (IDA) and thalassemia are the most common causes of anemia,
However the differentiation between Beta thalassemia trait (βTT) and IDA is not an easy process
and need more sophisticated procedure like hemoglobin electrophoresis, High Performance Liquid
Chromatography (HPLC), genetic and molecular studies, which are time and money consuming.
However, many equations are made using the hemoglobin, MCV, RDW, MCH and R.B.Cs count to
discriminate between IDA andβTT. An example of these equations are Metzer, Sirdah, Green and
king, Shine and Lal, and Ehsani also many other equations are present, but No one is superior to the
others, In our paper we are trying to build a score system to differentiate between IDA and βTT.
Methodology: We used the most 5 equations with high sensitivity and specificity and give a point
either 1 or zero for each result either IDA or βTT and if the final score is more than 3 it is most
Results: We applied this method for 50patientswhoarediagnosedasIDA orβTT and get the result
Conclusion: We recommend using this score with the help of Artificial Intelligence (AI) as an easy,
cheap, fast, more specific and more sensitive tool to discriminate between IDA and βTT.
2
Introduction:
Iron deficiency anemia (IDA) is the most common cause of anemia (1), also Thalassemia is not are
type of anemia in the Middle East. (2) One variety of β Thalassemia, which is β Thalassemia trait
(βTT), has a blood picture similar in many aspects to Iron deficiency anemia. In both types of
anemia, we get low hemoglobin level (Hb), Low volume of red blood corpuscles (MCV), increased
degree of variation in cell volume (RDW) and low hemoglobin content (MCH). This makes the
Although there is similarity in blood picture, the treatment is different and prescription of iron to β
Chromatography (HPLC), genetic and molecular techniques are used. However, these methods are
expensive and time consuming. (8.9.10) In order to get rapid and easy methods to differentiate
between them many equations and formulae are represented using the R.B.Cs, MCV, MCH and RDW
Table (1)
Name Equation
3
Ricerca RDW/R.B.Cs
MDHL (MCH/MCV)*R.B.Cs
RDWI MCV*RDW/R.B.Cs
No method has a sensitivity and specificity of 100%, but variable degrees of sensitivity and
specificity are present between these methods. (5) Thus, the aim of this study is to put a score
system. This score system is expected to have higher sensitivity and specificity.
This score system depends on the sum of the above methods and uses this score to discriminate
between βTT and IDA. We are going to use the higher equations for sensitivity and specificity as the
base of the score system, This score will be applied on 50 cases of IDA and BTT and will be tested to
see if the score can be used as a method to discriminate between iron deficiency anemia and beta
thalassemia trait.
1- MCV/R.B.Cs
2. MCV-(5*hb)-R.B.Cs–3.4
3. MCV- (10*R.B.Cs)
4
4. MCV-R.B.Cs – (3XHb)
5. MCV*RDW/R.B.Cs
At the end, if the total score Number>3, the diagnosis will be IDA and if the total score<3, it will be βTT. So, we
are going to apply this hypothesis on 30 patients who are diagnosed with IDA and 20 patients who are
Method
obtained from 50 patients with no clinical symptoms of acute or chronic inflammation, or infectious
diseases. We collected our data from laboratory in King Khalid University Hospital, in Riyadh.
Table 2
IDA group
5
7 7.0 3.78 60 18.5 16.7
6
29 11.2 4.71 77 23.7 16.0
30 patients of them were diagnosed with IDA. Diagnosis was done after clinical and laboratory tests
The above table shows various hematological parameters for a group of individuals with Iron
Deficiency Anemia (IDA). Hb: Hemoglobin (g/dL), R.B.Cs: Red Blood Cell Count (million/μL),
MCV: Mean Corpuscular Volume (fL), MCH: Mean Corpuscular Hemoglobin (pg), RDW: Red Cell
Table 3
Ferritin 30–300ng/mL
The above table provides normal reference ranges for various iron-related blood tests. These values are
used to assess an individual's iron status and help diagnose iron-deficiency anemia. Serum iron: This
measures the amount of iron circulating in the blood. (Men=75-150 mcg/dL, Women= 60-140
mcg/dL). Total iron-binding capacity (TIBC): This measures the maximum amount of iron that
transferrin, a protein in the blood, can bind to (Normal range: 250-450 mcg/dL). Ferritin: This is a
7
protein that stores iron in the body (Normal range: 30-300 ng/mL). Transferrin saturation: This is the
Table 4
βTT Group
8
18 11.0 5.50 62 20.0 13.6
9
The other 20 patients were diagnosed with βTT by doing CBC, Iron Profile and Hemoglobin
electrophoresis. The normal value of HBA2 is less than 3.2%, while 3.2% to 3.6% is considered
borderline, which warrants further investigations. Values are 3.6% to 7% are considered beta
thalassemia carriers. We build the score system by using the previously mentioned 5 formulas, all using
the 5 indices (Hb, R.B.Cs, MCV, MCH, and RDW) for evaluation.
The above table shows various hematological parameters for a group of individuals with Iron
Deficiency Anemia (βTT). Hb: Hemoglobin (g/dL), R.B.Cs: Red Blood Cell Count (million/μL),
MCV: Mean Corpuscular Volume (fL), MCH: Mean Corpuscular Hemoglobin (pg), RDW: Red Cell
Results:
In IDA patients, the mean value of Hb was 9.01±4.5, mean value of R.B.Cs 3.79±1.70 mean of MCV
65.2±23.77, MCH 23±11.60, and RDW 17.84 ±5.6. While in βTT, the mean value of Hb was 8.84±
4.00, mean of MCV57.15±12.00 and MCH 18.57±4.10 and RDW 16.29± 5.80.
Our results show that the mean Hb in IDA is slightly higher than βTT group, also other parameters of
MCV, MCH and RDW are higher in IDA than that of βTT the only exception is R.B.Cs count which
is more in βTT than IDA group. Table 5 and 6summarize the mean ±standard deviation of the various
The data was then used to calculate the 5 ratios outlined in the introduction, and the outputs were
recorded accordingly to each patient. A binary distribution system was then used to assign whether a
case was an IDA or βTT diagnosis as per each ratio, where cases of IDA were assigned the value of 1
The results were then used to create a score using the proposed scoring system, and each case was
assigned a test outcome. Example in Patient No.1 where Hb is 6.8, R.B.Cs 3.65, MCV58, MCH17.2
10
2- MCV-(Hb*5)-R.B.Cs -3.4 = if applied 58-(6.8X5)-3.65- 3.4 = 16.9 which is more than zero,
After summing all the scores above, the final score is 5. Using the same principle shown in the above
example, we applied the 5 equations to the 50 samples and get the result (table 7), the 30 patients.
Whose are diagnosed with IDA get a score more than 3 while in βTT group 15 patients are less than 3.
11
Data Analysis
The sample of 50 patients with confirmed cases of IDA and βTT were used in conducting this
analysis. The data consist of blood test results, with the descriptive statistics for each sample of
Table 5
Above table shows descriptive statistics for a group of individuals with Iron Deficiency Anemia (IDA).
Hb (Hemoglobin): The average hemoglobin level is 9.01 g/dL, which is below the normal range,
indicating anemia. The standard deviation is 1.40 g/dL, indicating a moderate spread of hemoglobin
levels in the group. RBCs (Red Blood Cell Count): The average red blood cell count is 3.79
million/μL, which is also below the normal range, indicating anemia. The standard deviation is 0.46
million/μL, suggesting a relatively small variation in red blood cell counts. MCV (Mean Corpuscular
Volume): The average red blood cell size is 65.20 fL, which is below the normal range, indicating
microcytosis (small red blood cells). The standard deviation is 6.78 fL, suggesting a moderate variation
12
in red blood cell sizes. MCH (Mean Corpuscular Hemoglobin): The average amount of hemoglobin in
each red blood cell is 23.77 pg, which is within the normal range. However, when combined with the
low MCV, it indicates a hypochromic anemia (pale red blood cells). The standard deviation is 2.89 pg,
suggesting a moderate variation in hemoglobin content per red blood cell. RDW (Red Cell Distribution
Width): The variation in red blood cell size is 17.84%, which is within the normal range. This suggests
that the anemia is primarily microcytic rather than anisocytic (having a wide variation in cell size).
Mode: The most frequent value in the data.8.70 g/dL (Hb), 3.78 million/μL (RBCs), 67.00 fL (MCV),
19.60 pg (MCH), 16.90% (RDW). Median: The middle value in the data when arranged in order.9.05
g/dL (Hb), 3.80 million/μL (RBCs), 65.00 fL (MCV), 24.30 pg (MCH), 17.90% (RDW). Range: The
difference between the largest and smallest values.4.50 g/dL (Hb), 1.70 million/μL (RBCs), 23.00 fL
(MCV), 11.60 pg (MCH), 5.60% (RDW.) Variance: The square of the standard deviation.1.95 g/dL²
(Hb), 0.21 million/μL² (RBCs), 46.03 fL² (MCV), 8.34 pg² (MCH), 1.60%² (RDW). The descriptive
statistics indicate that the individuals in this group have iron-deficiency anemia, characterized by low
Table 6
13
The above table shows descriptive statistics for a group of individuals, likely patients receiving Anti-
Tuberculosis Therapy (BTT). Hb (Hemoglobin): The average hemoglobin level is 8.84 g/dL, which is
slightly below the normal range, indicating mild anemia. The standard deviation is 1.20 g/dL,
indicating a moderate spread of hemoglobin levels in the group. RBCs (Red Blood Cell Count): The
average red blood cell count is 4.72 million/μL, which is within the normal range. The standard
deviation is 0.42 million/μL, suggesting a relatively small variation in red blood cell counts. MCV
(Mean Corpuscular Volume): The average red blood cell size is 57.15 fL, which is below the normal
range, indicating microcytosis (small red blood cells). The standard deviation is 3.70 fL, suggesting a
moderate variation in red blood cell sizes. MCH (Mean Corpuscular Hemoglobin): The average
amount of hemoglobin in each red blood cell is 18.57 pg, which is within the normal range. However,
when combined with the low MCV, it indicates a hypochromic anemia (pale red blood cells). The
standard deviation is 1.20 pg, suggesting a moderate variation in hemoglobin content per red blood
cell. RDW (Red Cell Distribution Width): The variation in red blood cell size is 16.29%, which is
within the normal range. This suggests that the anemia is primarily microcytic rather than anisocytic
(having a wide variation in cell size).Mode: The most frequent value in the data.7.90 g/dL (Hb), 4.97
million/μL (RBCs), 59.00 fL (MCV), 17.70 pg (MCH), 14.20% (RDW). Median: The middle value in
the data when arranged in order.8.65 g/dL (Hb), 4.74 million/μL (RBCs), 58.00 fL (MCV), 18.65 pg
(MCH), 16.15% (RDW). Range: The difference between the largest and smallest values.4.00 g/dL
(Hb), 1.53 million/μL (RBCs), 12.00 fL (MCV), 4.10 pg (MCH), 5.80% (RDW). Variance: The square
of the standard deviation.1.43 g/dL², 0.18 million/μL², 13.71 fL², 1.45 pg², 3.84%² (RDW). The
descriptive statistics indicate that the individuals in this BTT group have mild anemia with
14
Table 7
Tested
Actual Positive Negative
Positive 30 0
Negative 2 18
The tested out comes were then compared to actual conditions. The table shows a 2x2
contingency table, the table compares the actual condition of individuals to the results of a diagnostic
test. Actual: This column represents the true condition of the individuals, whether they are positive or
negative for the condition being tested. Tested: This column represents the results of the diagnostic test,
whether it was positive or negative. 30 individuals who were actually positive for the condition were
correctly identified as positive by the test (True Positive). 0 individuals who were actually positive
were incorrectly identified as negative by the test (False Negative). 2 individuals who were actually
negative were incorrectly identified as positive by the test (False Positive). 18 individuals who were
actually negative were correctly identified as negative by the test (True Negative).
Table 8
F1 Score
Precision 0.94
Recall 1.00
F1 0.97
15
To validate the results, two tests were conducted: the F1score, and the diagnostic odds ratio (DOR).
The F-Score uses the precision (ratio of true positives to all predicted positives) and the recall (ratio of
true positives to samples that were meant to be positive). The F1 score represents precision and recall
in one metric using the harmonic mean of both measures. As shown in above table, the results
displayed an F1 score of 0.97, which is on the higher end of precision, as a score of 1.0 indicates
The table presents the results of a diagnostic test, including the F1 score, precision, and recall.
These metrics are commonly used to evaluate the performance of classification models. Precision=
0.94, this indicates that 94% of the positive predictions made by the test were actually correct. In other
words, out of all the individuals the test predicted as positive, 94% truly had the condition. Recall=
1.00, this indicates that 100% of the individuals who actually had the condition were correctly
identified by the test. In other words, the test did not miss any positive cases. F1 Score= 0.97, the F1
score is a harmonic mean of precision and recall, providing a single metric that balances both
measures. In this case, the F1 score of 0.97 suggests that the test has good overall performance, with
both high precision and recall. The results suggest that the diagnostic test has high sensitivity (ability to
correctly identify positive cases) and specificity (ability to correctly identify negative cases). This is
indicated by the high recall and precision values, respectively. The F1 score further confirms the good
To support our findings, the diagnostic odds ratio was also calculated. Due to having 0 false
negatives, a value of 0.5 was added to all figures in the contingency (table7) which the reasoning for
16
Table 9
DOR Metrics
The above metrics were calculated to aid with formulating the DOR. Diagnostic Odds Ratio
(DOR) the diagnostic odds ratio is metric used to evaluate the performance of diagnostic tests. It is
calculated as the ratio of the odds of a positive test result in individuals with the condition to the odds
of a positive test result in individuals without the condition. A high DOR indicates a strong association
between the test result and the presence or absence of the condition. Dividing the ratio of true positive
to false positive by the ratio of false positive to false negatives yields a DOR of 451, which can be
interpreted as the scoring system proposed is effective at 451:1. To test for significance, a 95%
confidence interval was calculated which yielded a confidence interval of 9,929 to 21.
The table 9 shows various performance metrics for a diagnostic test. True Positive Rate (TPR)
= 0.98, this represents the proportion of individuals who actually have the condition and were correctly
identified by the test (sensitivity). A high TPR indicates that the test is good at detecting individuals
with the condition. False Negative Rate (FNR) = 0.02, this represents the proportion of individuals who
actually have the condition but were incorrectly identified as negative by the test. A low FNR indicates
that the test is good at avoiding false negatives. True Negative Rate (TNR) = 0.88, this represents the
17
proportion of individuals who do not have the condition and were correctly identified as negative by
the test (specificity). A high TNR indicates that the test is good at avoiding false positives. False
Positive Rate (FPR) = 0.12, this represents the proportion of individuals who do not have the condition
but were incorrectly identified as positive by the test. A low FPR indicates that the test is good at
avoiding false positives. Positive Predictive Value (PPV) = 0.92, this represents the probability that an
individual who tests positive actually has the condition. A high PPV indicates that a positive test result
is a strong predictor of the condition. False Discovery Rate (FDR) = 0.08, this represents the proportion
A low FDR indicates that the test is good at avoiding false positives. Negative Predictive Value
(NPV) = 0.97, this represents the probability that an individual who tests negative does not have the
condition. A high NPV indicates that a negative test result is a strong predictor of not having the
condition. False Omission Rate (FOR) = 0.03. This represents the proportion of individuals who have
the condition but were incorrectly identified as negative by the test. A low FOR indicates that the test is
good at avoiding false negatives. These metrics suggest that the diagnostic test has good performance
in terms of sensitivity, specificity, and predictive values. It is able to accurately identify both
individuals with and without the condition, with relatively low rates of false positives and false
negatives.
18
Table 10
20
48 11.o 5.5 62 20 13.6 βTT 0
21
Data Visualization
Figure 1
The bar graph Figure 1 compares the mean hemoglobin (Hb) levels between two groups: IDA
(Iron Deficiency Anemia) and BTT (Anti-Tuberculosis Therapy). Mean Hb in IDA Group = 9.007
g/dL. Mean Hb in BTT Group = 8.721 g/dL. The graph visually demonstrates that the mean
hemoglobin level is slightly higher in the IDA group compared to the BTT group. This suggests that
individuals in the IDA group may have a less severe anemia compared to those in the BTT group.
However, both groups have mean Hb levels below the normal range, indicating anemia in both.
Figure 2
22
The bar graph Figure 2 compares the mean red blood cell count (RBCs) between two groups:
IDA (Iron Deficiency Anemia) and BTT (Anti-Tuberculosis Therapy). Mean RBCs in IDA Group =
3.786 million/μL. Mean RBCs in BTT Group = 4.725 million/μL. The graph visually demonstrates that
the mean red blood cell count is higher in the BTT group compared to the IDA group. This suggests
that individuals in the BTT group may have a higher number of red blood cells than those in the IDA
group.
Figure 3
The bar graph Figure 3 compares the mean corpuscular volume (MCV) between two groups:
IDA (Iron Deficiency Anemia) and BTT (Anti-Tuberculosis Therapy). Mean MCV in IDA Group =
65.2 fL. Mean MCV in BTT Group = 57.15 fL. The graph visually demonstrates that the mean MCV is
higher in the IDA group compared to the BTT group. This suggests that the red blood cells in the IDA
Figure 4
23
The bar graph Figure 4 compares the mean corpuscular hemoglobin (MCH) between two
groups: IDA (Iron Deficiency Anemia) and BTT (Anti-Tuberculosis Therapy). Mean MCH in IDA
Group = 23.77 pg. Mean MCH in BTT Group = 18.57 pg. The graph visually demonstrates that the
mean MCH is higher in the IDA group compared to the BTT group. This suggests that the red blood
cells in the IDA group contain more hemoglobin than those in the BTT group.
Figure 5
24
The bar graph Figure 5 compares the mean red cell distribution width (RDW) between two
groups: IDA (Iron Deficiency Anemia) and BTT (Anti-Tuberculosis Therapy). Mean RDW in IDA
Group = 17.85%. Mean RDW in BTT Group = 16.29%. The graph visually demonstrates that the mean
RDW is slightly higher in the IDA group compared to the BTT group. This suggests that there is a
slightly greater variation in red blood cell size in the IDA group.
Limitations:
25
The sample size has proved so me limitations, mainly due to the lack of false negatives which
required an adjustment to the contingency table in order to make the DOR ratio meaningful (24).
Further, the confidence interval range can be perceived to be too wide to be representative.
However, the calculated DOR ratio after adjustment falls within the confidence interval, and since
the DOR ratio has no upper bounds, the result is still acceptable. To support the DOR ratio, the F1
score displayed 97% precision. Future studies can refine the results by incorporating a much larger
Discussion:
IDA and βTT are the most common types of hypochromic microcytic anemia present in the
Middle East, and the blood picture is so similar that make the deferential between them not easy,
and to discriminate between the pass through many investigations including SerumIon, TIBC and
ferritin level also we have to measure HBA2 by HPLCor hemoglobin electro phoresis or even use
molecular technology. All these methods are expensive and time consuming. Through history many
scientists tried to use different equations depending on R.B.Cs, Hb, MCV, MCH and RDW.
No equation was better or more accurate than other there was variation in sensitivity and
specificity of these equation, this study propose to make a score to differentiate between IDA and
βTT this score depend on the sum of the most sensitive equations made though many years ago to
discriminate between IDA and βTT. If the result of the equation is showing that the result is giving
the diagnose of IDA, we give it a score of 1 and if the result of the equation is showing that the
patient is βTT we give it a score of 0. Then, we apply this to the five 5 equations and if the result is 3
or more so this is a case of IDA and if the result is less than 3 so it is βTT.
The results for all 30 patients of IDA were 4 or more. While in, βTT group one patient got the score of 4, one
patient had a score of 3 and two patients showing result of 5. This may be due to the combination of IDA and
βTT. However, it is recommended to repeat this score for these 4 patients after treatment of iron deficiency.
26
These results confirm that this method and scoring system are showing high sensitivity and specificity more
than any individual equation and we recommend using this score to differentiate between IDA and βTT. For
future improvements to this system, we hope to use Artificial Intelligence (AI) to make the result of the score
Conclusion:
It was found statistically that the score system to discriminate IDA from βTT has a higher
sensitivity than the 5 equations when each equation is used alone, and we recommend using of this
27
References:
3. Zaini RG.Sickle-cell anemia and Consanguinity among the Saudi Arabian population. Arch
Med.2016;8 (3):3–15.
Mineral Nutrition Information System, 1993-2005. Public Health Nutrition 2009; 12:444-54.
ClinObstetGynaecol 2012;26:3-24.
9. Zhao J., Li J., Lai Q., Yu Y. (2020). Combined use of gap-PCR and next-generation sequencing
improves thalassaemia carrier screening among premarital adults in China. J. Clin. Pathol.
10.1136/jclinpath2019-206339
10. Rund D. Thalassemia 2016: modern medicine battles an ancient disease. Am J Hematol 2016;
91: 15– 21
The Iran Thalassemia prevention program: success or failure? Iran J Ped HematolOncol 2015;
28
5: 161–6
no.7808, p.882,1973.
13. Shineand, S.Lal,“A strategy to detect 𝛽-thalassaemia minor,” The Lancet ,vol.1,
14. J. M. England and P. M. Fraser, “Differentiation of iron deficiency from thalassaemia trait by
15. P.C.Srivastava,“Differentiation of thalassemia minor from iron deficiency, ”The Lancet,” vol.2,
16. R.Greenand, R.King, “A new red cell discriminant incorporating volume dispersion for
differentiating iron deficiency anemia from thalassemia minor, ”Blood Cells”, vol.15, no.3,
pp.481–495, 1989.
18. M.Sirdah, I.Tarazi, E.AlNajjar, and R.Al Haddad, “Evaluation of the diagnostic reliability of
different RBC indices and formulas in the differentiation of the 𝛽-thalassaemia minor from iron
discrimination between iron deficiency anemia and beta-thalassemia minor: results in 284
20. A. Telmissani, S. Khalil, and T. R. George, “Mean density of hemoglobin per liter of blood: a
Haematology”, vol.5,pp.149–152,1999.
29
Appendix
Informed Consent
Building A Score to Discriminate Between Iron Deficiency Anemia and Beta Thalassemia Trait
This research aims to analyze and compare the hematological data of patients with microcytic anemia
to better differentiate between Iron Deficiency Anemia (IDA) and Beta Thalassemia Trait (βTT) , we are
trying to build a score system to differentiate between IDA and βTT. This will contribute to the
development of a score system for improving the diagnostic accuracy of these conditions.
Procedures:
The study will collect data from patients’ previous medical records stored in the laboratory database.
No direct contact with patients will be involved, and no additional testing will be required from
participants. All data will be used anonymously, ensuring the confidentiality of personal information.
Voluntary Participation:
Participation is voluntary, and patients may choose not to have their data included in the study
without any impact on their medical care. Since this is a retrospective study, informed consent is
Confidentiality:
All data will be de-identified, and no information will be used to reveal the identity of the participants.
Only the research team will have access to the anonymized data.
There are no direct risks to participants as no new testing or procedures will be performed. The
potential benefit is the improvement in the diagnostic process for microcytic anemia.
Contact Information:
30
For any queries or concerns regarding the study, you may contact [Fatimah Suliman] at [Email:
[email protected]]. Alternatively, you can reach out to the Ethics Committee at Qassim University
Consent Statement:
I have read and understood the information provided above, and I agree to allow the researchers to
o Yes
o No
----------------- -----------------
We collected our data from laboratory in King Khalid University Hospital, in Riyadh. 30 patients of
them were diagnosed with IDA. Diagnosis was done after clinical and laboratory tests including CBC,
Data will be extracted from the laboratory records of King Khalid University Hospital, Riyadh. The
31
Additional parameters: Serum Iron, Total Iron Binding Capacity (TIBC), Ferritin, and Transferrin
32