0% found this document useful (0 votes)

3 views21 pages

LLMs-In-The-Loop Part 2 Expert Small AI Models For

This paper presents expert small AI models developed using the LLM-in-the-loop methodology for the de-identification of protected health information (PHI) across eight languages. These models outperform large language models (LLMs) in accuracy and reliability while ensuring patient data privacy by eliminating the need to transmit sensitive information. The findings highlight the potential for specialized models in healthcare AI, paving the way for future innovations in biomedical entity extraction and medical summarization.

Uploaded by

aihizlan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views21 pages

LLMs-In-The-Loop Part 2 Expert Small AI Models For

Uploaded by

aihizlan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

LLMs-in-the-Loop Part 2: Expert Small AI Models for

Anonymization and De-identification of PHI Across

Multiple Languages
Murat Gunay*, Bunyamin Keles†, Raife Hizlan‡
December 17, 2024
arXiv:2412.10918v1 [cs.CL] 14 Dec 2024

Abstract
The rise of chronic diseases and pandemics like COVID-19 has emphasized the need for
effective patient data processing while ensuring privacy through anonymization and de-
identification of protected health information (PHI). Anonymized data facilitates research
without compromising patient confidentiality.

This paper introduces expert small AI models developed using the LLM-in-the-loop method-
ology to meet the demand for domain-specific de-identification NER models. These models
overcome the privacy risks associated with large language models (LLMs) used via APIs by
eliminating the need to transmit or store sensitive data. More importantly, they consistently
outperform LLMs in de-identification tasks, offering superior performance and reliability.

Our de-identification NER models, developed in eight languages — English, German, Ital-
ian, French, Romanian, Turkish, Spanish, and Arabic — achieved f1-micro score averages
of 0.966, 0.975, 0.976, 0.970, 0.964, 0.974, 0.978, and 0.953 respectively. These results es-
tablish them as the most accurate healthcare anonymization solutions available, surpassing
existing small models and even general-purpose LLMs such as GPT-4o.

While Part-1 of this series introduced the LLM-in-the-loop methodology for bio-medical
document translation, this second paper showcases its success in developing cost-effective
expert small NER models in de-identification tasks. Our findings lay the groundwork for fu-
ture healthcare AI innovations, including biomedical entity and relation extraction, demon-
strating the value of specialized models for domain-specific challenges.

Keywords: de-identification, HIPAA, PHI, Patient Safety, LLMs-in-the-loop, Anonymiza-

tion

* Corresponding Author: [email protected]

† [email protected]
‡ [email protected]

1
1 Introduction
Patient data is essential for improving public health, expanding preventive health services, pre-
venting diseases, and formulating necessary health policies. Recent studies show that almost all
(99%) hospitals in the United States [1] use electronic health records (EHR). Similarly, Wales,
Scotland, Denmark, and Sweden have adopted EHRs in the last few years. However, there is
still a need for nationally accessible health data in the UK. In particular, the Covid-19 pandemic
has again highlighted the importance of EHR data [2]. Thanks to EHRs, disease trends can be
examined, modelling can be done, and health policies can be developed.1.

Technology, which has become more complex and has developed with medical practices, ne-
cessitates the development of methods that will protect patient privacy [3]. With information
security and information leakage recently gaining more importance, patient safety may have
significant consequences beyond ethical violations in fundamental and health law [4].

Personal data is sensitive information that can be associated with an individual and is protected
by various laws [5]. Personal privacy data in healthcare is called PHI and includes private in-
formation such as a patient’s health history, treatments received, etc. [6].

EHRs contain both valuable clinical information and PHI. While EHRs are a rich data source
for research, their usability is restricted due to the confidentiality of PHI [7-10]. For example,
the HIPAA law regulates the use of 18 types of PHIs, such as name, phone number, dates,
etc. (Table 1) [11, 12]. Therefore, PHI must be extracted from the text before EHR data can
be used. Automating de-identification systems is needed since manually extracting PHIs is
time-consuming and costly. In addition, coordination between annotators is also an important
consideration [13, 14]. While early approaches to de-identification relied on complex rules to
detect PHI, recent developments use machine learning methods and train on expert-annotated
records. Hybrid systems integrate practices as features into statistical models, like conditional
random fields (CRFs) [15].

1 Our codebase will be available soon

2
Table 1: Protected Health Information Types [11]

No PHI Type
1 Names
2 All geographic subdivisions smaller than a state
3 Dates
4 Telephone numbers
5 Vehicle identifiers
6 Fax numbers
7 Device identifiers and serial numbers
8 Emails
9 URLs
10 Social security numbers
11 Medical record numbers
12 IP addresses
13 Biometric identifiers
14 Health plan beneficiary numbers
15 Full-face photographic images and any comparable images
16 Account numbers
17 Certificate/license numbers
18 Any other unique identifying number, characteristic, or code

The de-identification method makes it possible to use EHRs in research by removing confiden-
tial information [15]. Basic de-identification rules include removing direct identifying state-
ments such as name, date, etc. Advanced statistical methods anonymize the data, thus reducing
the risk of de-identification [16]. However, new techniques may also introduce unknown privacy
risks. Therefore, continuous evaluation and improvement efforts are necessary [17]. Advanced
methods can enable extensive collections of EHRs to be used efficiently and securely in re-
search.

According to HIPAA, there are two possible methods of identity masking. The ”Expert De-
termination” method, which requires employing an expert in the field, involves a small risk in
identifying the individual whose information is used and is carried out using different statistical
methods. In this method, the expert must have sufficient experience and knowledge. The other
method is the ”Safe Harbor” method, which involves de-identifying 18 pre-determined relevant
identifiers that must be removed and/or modified from the corpus [11, 18]. In studies using deep
learning (DL) models, the Safe Harbor method is used, and the relevant PHI is de-identified.

The lack of comprehensive data privacy frameworks can lead to vulnerabilities, leaving sensitive
patient information susceptible to breaches and misuse. Despite efforts to anonymize this data,
reidentification is still feasible through just a few spatiotemporal data points [19]. Recent ad-
vancements in privacy-preserving technologies have seen increased adoption [20], particularly
in artificial intelligence (AI) and big data analytics. These technologies are vital in addressing
major global health challenges by enhancing access to healthcare, promoting health, preventing
diseases, and improving the overall experience for healthcare professionals and patients. AI,
coupled with big data analytics, is the backbone for many innovations in digital health, driv-
ing improvements in care delivery and decision-making processes. Together, these domains are
supported by additional technologies like the Internet of Things (IoT), next-generation networks
(e.g., 5G), and privacy-preserving platforms such as blockchain [21, 22].

3
However, questions remain regarding accountability for AI and LLM outcomes. Since AI lacks
autonomy and sentience, it cannot hold moral responsibility, leaving uncertainty about who
should be accountable for its decisions and actions [23].

LLMs, particularly GPT-4, have demonstrated remarkable success in zero-shot de-identification

tasks. However, significant challenges remain in utilizing proprietary paid APIs and open-
source small LLMs. Paid APIs raise privacy concerns, as hospitals are often unwilling to trans-
mit sensitive patient data to cloud-based services. Conversely, locally deployed small LLMs
present difficulties in production due to their limited accuracy, resource-heavy requirements,
and complex deployment processes. In response to these challenges, we propose Expert Small
AI Models—lightweight, plug-and-play solutions that offer superior performance and accu-
racy compared to LLMs while being more practical and efficient for deployment in secure,
on-premises environments. That’s why we don’t implement LLMs directly to solve this prob-
lem, we developed Expert Small NER Models with the LLM-in-the-loop methodology.

What is the LLM-in-the-loop?

From our perspective: the ”LLM-in-the-loop” methodology LLMs is an integral part of the
development process for expert small models, without relying on LLMs as the final solution.
Instead of directly using LLMs for tasks, we utilize them selectively at various stages, such
as synthetic data generation, rigorous evaluation, and agent orchestration, to improve the per-
formance of smaller, domain-specific models. This approach allows us to benefit from the
capabilities of LLMs while keeping the models efficient, focused, and specialized for specific
tasks.

Recently, there has been a growing emphasis on the work done within the scope of LLM-
in-loops. Studies have shown that LLMs perform better on tasks traditionally completed by
humans [24-26], and the potential for effective utilization of LLMs is emphasized. It is seen
that the innovative approach of ”LLM-in-the-loop” is used in different fields today. In a study
conducted to analyze social media content and reveal hidden themes [27]. In the study, the
advanced capabilities of LLMs were leveraged to gain a deeper understanding of social media
content by analyzing social media messages, discovering thematic structures and nuances in
texts, and effectively matching texts to themes. Another study using the LLM-in-loops tech-
nique to improve the performance of LLMs was aimed at continuously improving the model
outputs through iterative feedback loops, and this was applied in a study in the medical field.
The aim was to increase the accuracy and reliability of the model and reduce hallucinations. The
LLM-in-loops study, which involves the process of evaluating the model outputs as such human
experts, giving feedback, and using this feedback to retrain the model, focused on reducing
model errors and obtaining more reliable results in medical question-answer and summariza-
tion tasks [28].

Another study, which examines the potential of LLMs to recognize and examine intertextual
relationships in biblical and Koine Greek texts, highlights how LLMs evaluate different in-
tertextual scenarios and how these models can detect direct quotations, allusions, and echoes
between texts. The study also mentions the ability of LLMs to generate intertextual observa-
tions and connections and the potential of these models to reveal new insights. However, it is
noted that the model has difficulties with long query texts and can create incorrect intertextual

4
connections, which reveals the importance of expert evaluation [29].

We first used the “LLMs-in-loop” method in the context of bio-medical document translation
[30]. In this work, we demonstrate its success in developing cost-effective expert small NER
models for de-identification tasks. Our findings lay the groundwork for future healthcare AI
innovations, including biomedical entity and relation extraction, and demonstrate the value of
specialized models for domain-specific challenges.

As we navigate the evolving landscape of AI in healthcare, the LLM-in-the-loop methodology

stands out as a transformative approach. Recent studies have highlighted its capacity to enhance
the performance of the models by leveraging human expertise to refine outputs continuously.
This innovative strategy addresses the traditional challenges faced in biomedical text process-
ing, such as accuracy and reliability, and mitigates issues like hallucinations that commonly
occur in AI-generated content. By fostering a symbiotic relationship between human input and
machine learning, we pave the way for advanced applications, including more effective biomed-
ical entity extraction and improved medical summarization techniques. Ultimately, this research
underscores the significance of integrating human intelligence with AI capabilities, setting the
stage for more robust and trustworthy healthcare solutions.

2 Background
The de-identification model, called a Named Entity Recognition (NER) classification model,
can be considered under four headings [31]:
• Rule-based models
• Machine learning models
• Hybrid models
• Deep learning models
Techniques such as rule-based models and dictionaries can be easily implemented without la-
bels but are vulnerable to input errors [31-34]. ML methods such as Support Vector Machines
(SVM) and conditional random fields (CRF) can recognize complex patterns but require large
amounts of labelled data and feature engineering and are poor at generalization [35-37]. Hybrid
systems combine rule-based and ML models, providing high accuracy but requiring intensive
feature engineering [38, 39].

Considering the disadvantages of the last three approaches to de-identification system creation,
the latest state-of-the-art systems employ DL techniques to achieve better results than the best
hybrid systems without requiring a time-consuming feature engineering process. DL is an ML
subset using multilayered Artificial Neural Networks (ANN) and is very successful in most
Natural Language Processing (NLP) tasks. Recent advances in DL and NLP (especially in the
field of NER) enable the systems to outperform the winning hybrid system proposed by Yang
and Garibaldi [39] on the 2014 i2b2 de-identification challenge dataset [31, 35].

De-identifying unstructured data is a widely recognized problem [40] in NLP, involving two key
tasks: identifying PHI and replacing it through masking or obfuscation. Research has primarily

5
focused on PHI identification. Early de-identification approaches [41] and [42], especially in
healthcare, were rule-based, using regular expressions, syntactic rules, and specialized dictio-
naries to detect PHI, such as phone numbers and emails. However, they struggled with identi-
fying more complex entities like names and professions and required significant adjustments to
function in different datasets, limiting their flexibility. The 2014 i2b2 project [34] introduced
automatic de-identification, fueling the advancement of the machine and deep learning models
for more accurate PHI detection. Early machine learning methods, such as Conditional Random
Fields (CRF) [43], used hand-crafted features and lexical rules [44], signaling a shift to more
adaptive and scalable approaches.

Work in the de-identification context has achieved human-level accuracy in de-identifying clin-
ical notes from research datasets, but challenges remain in scaling this success to large, real-
world environments. The hybrid context-based model outperformed traditional NER models by
10% in the i2b2-2014 benchmark. It also has significantly fewer errors (93% accuracy) com-
pared to ChatGPT (60% accuracy) [45].

LLM-based methods have been used in the development of de-identification models. However,
these are still in the early stages, and further development is still needed to protect the privacy
and security of health data [46]. The continued need to use APIs in LLM models and the prob-
lem of storing patient data reveals that expert models are still needed.

3 Methodology
This section details the purpose of the research, the datasets employed, the methods for training
and testing, the data preparation process, and the modelling and evaluation phases. Key to this
study is the protection of personal data, adherence to legal regulations, and addressing the risks
associated with processing sensitive patient information.

Our LLM-in-the-loop methodology leverages LLMs at key stages such as synthetic data genera-
tion, labelling, and evaluation, focusing on the development of high-performance, expert small
models. To this end, we used a combination of proprietary closed-source data, open-source
datasets, and synthetic data, all annotated by our labelling team in accordance with i2b2 la-
belling logic. The incorporation of synthetic data and LLM-assisted labelling further enhanced
the scope and quality of our training datasets.

For English-language de-identification NER models, we utilized the entire dataset for training.
The i2b2 test dataset served as the exclusive test set for evaluation purposes, allowing us to
benchmark performance with high precision. For non-English languages, we applied an 80-
20 split for training and testing. Additionally, our medical translation models [30] were used
to translate the English datasets into non-English languages, generating high-quality parallel
datasets across multiple languages.

In the data pre-processing phase, we employed language-specific tools to ensure accurate de-
identification across different languages. The ”Stanza” library was utilized for Romanian-
language tasks, while the ”NLTK” library was used for the other languages. Word tokenization
for all datasets was performed using the ”word-punct tokenizer” from the NLTK library.

6
For evaluation, we adopted the strict evaluation method, where both the chunk and the label had
to match to be considered a correct prediction. This rigorous approach ensures the accuracy and
reliability of our models, particularly in handling PHI.

By integrating proprietary, open-source, and LLM-synthesized datasets, as well as utilizing real

and translated data, this methodology demonstrates the capability of expert small models to
provide accurate, domain-specific de-identification solutions. Our approach minimizes reliance
on large LLMs while ensuring privacy and top-tier performance in medical data anonymization.

The results in Table 4 and Table 7 were achieved using a structured and detailed prompt de-
signed to extract Protected Health Information (PHI) from clinical notes. The prompt provided
a comprehensive list of entity definitions, such as AGE, CITY, DEVICE, and ORGANIZA-
TION, along with examples for clarity. It instructed GPT-4o to identify and mark entities using
a consistent tagging format (e.g., BEGINER LABEL CHUNK ENDNER) while preserving the
original text. Specific guidelines were included for nuanced cases, such as excluding titles (e.g.,
”Dr.”) from names and marking only actual dates for the DATE label. This rigorous approach
ensured precision in high-performing categories and highlighted areas for improvement in more
challenging entities. The prompt used in the study is presented in Appendix A.

3.1 Datasets
“i2b2-2014” is a research project 2 on de-identification and heart disease in clinical texts, and
its labelling logic was used in our study. For English-language de-identification NER mod-
els, we utilized a combination of mostly open-source and synthetic data, with 22% derived
from proprietary closed-source data. The i2b2 test dataset served as the exclusive test set for
evaluation, enabling us to benchmark performance with high precision. For non-English lan-
guages, we applied an 80-20 split for training and testing. Most of the non-English datasets
were generated through translation from the English dataset using our medical translation mod-
els [30], open-source and through synthetic data generation with LLM-assisted labelling, pro-
ducing high-quality parallel datasets across multiple languages.

Additionally, we utilized some NLP techniques and open-source third-party tools 3 to enhance
and augment the training datasets.

Although the i2b2 2014 dataset was not utilized for training purposes, we provide relevant in-
formation and statistics here to offer a more comprehensive understanding of its role in our
evaluation process. i2b2/UTHealth is a dataset focused on identifying medical risk factors for
Coronary Artery Disease (CAD) in the medical records of diabetic patients, where risk factors
include hypertension, hyperlipidemia, obesity, smoking status, and family history, as well as
diabetes, CAD, and indicators suggestive of the presence of these diseases [47]. i2b2 dataset
consists of 1,304 progress notes of 296 diabetic patients. All PHIs in the dataset were removed
throughout the study, and de-identification was performed randomly. The PHIs in this dataset
were first categorized into HIPAA categories and then into i2b2-PHI categories, as shown in
Table 2. Overall, the i2b2 dataset contains 56,348 sentences with 984,723 individual tokens, of
2 https://portal.dbmi.hms.harvard.edu
3
LangTest by John Snow Labs: https://langtest.org/

7
which 41,355 are individual PHI tokens representing 28,867 particular PHI instances [31].

Table 2: PHI Categories, HIPPA, and Our Study Entities [31]

HIPPA i2b2 Dataset Our Dataset

Name Patient, Doctor, Username Patient, Doctor
Profession Profession Profession
Location Street, City, State, Country,
Street, City, Country, Zip,
Zip, Hospital, Organization Hospital, Location, Organiza-
tion
Age Age Age
Date Date Date
Contact Phone, Fax, Email, URL, IP Phone, Fax, Email
Address
ID Medical Record, ID No, SSN, Medical Record, ID, ID-
License No NUM, SSN
SEX
FAMILY

In the literature review, it is seen that there are relative limitations in terms of data sets in de-
identification model studies other than English. For this reason, it can be stated that only a few
de-identification models have been developed for different languages.

In this respect, the de-identification models in different languages developed in this study will
contribute to the literature and data scientists working on these models and the health institu-
tions that will use them.

3.2 Experimental Setup and Metrics

3.2.1 Clinical English de-identification Model
The corpus of clinical admission discharge and private clinical reports from private hospitals and
healthcare organizations was used to develop the English de-identification model. Labelling
was done according to i2b2 2014 data principles as described previously. The labels used in
the model, which uses a fine-tuned version of the ”microsoft/deberta-v3-small” model as an
embedding, are shown in Table 3.

Table 3: English De-identification Model Labels

Rule-Based Method Labels Deep Learning Method Labels

ACCOUNT, DLN, EMAIL, FAX, AGE, CITY, COUNTRY, DATE, DEVICE, DOC-
IP, LICENSE, PLATE, SSN, URL, TOR, HOSPITAL, IDNUM, LOCATION-OTHER,
VIN MEDICAL RECORD, ORGANIZATION, PATIENT,
PHONE, PROFESSION, STATE, STREET, USER-
NAME, ZIP

In the study, ten labels were used for the Rule-based method, and 18 labels were used for deep
learning methods. The training dataset was augmented for these labels since ORGANIZATION,

8
PROFESSION, and LOCATION-OTHER entities gave low results due to the first training pro-
cess with the deep learning method. The augmentation stages of the model were performed as
follows:

• Firstly, a fake chunk data frame was created for each label in various formats.

• Sentences with the labels ORGANIZATION, PROFESSION, LOCATION-OTHER in the

training dataset and CoNLL file were extracted.

• Each labelled chunk was removed and replaced with label abbreviations.

• The sentences were translated from English to the working language. For the translation,
our medical translation models used [30].

• The label abbreviations in the new sentences were replaced with new chunks of those
labels from the fake data frame.

• This new dataset was converted to “beginning, inside, outside” (BIO) format and added
to the training dataset.

The model’s performance implemented with the DL method used in this study was tested with
the i2b2-2014 test set. It was observed that the retrained dataset with augmented labels showed
better classification results when evaluated using the i2b2 2014 test set [33].

In the de-identification study conducted in English and with the DL method, learning rate=2e-5,
max sentence length=512, batch size=2, and ten epoch train was performed. For the rule-based
method, regexes suitable for each format were created for the selected labels.

3.2.2 Non-English de-identification Models

To understand which labels could be used in de-identification models and which labels would be
appropriate for which aggregates and to determine the principles, the labelling team organized
meetings with relevant hospital staff to make the models in German, French, Italian, Romanian,
Spanish, and Turkish. Data collected from clinical admission reports, discharge reports, and
special clinic reports obtained from hospitals and health institutions were labelled according to
these principles.

The training process was carried out with the obtained data set. In the study, the 0.20 parts of
the dataset determined during the division process were used as the test dataset. The dataset
was preprocessed and converted into BIO format.

For German: bert-base-german-cased, for Italian: bert-base-italian-cased, for French: camembert-

bio-base, for Romanian: bert-base-ro-cased, for Turkish: bert-base-turkish-cased and for Span-
ish: roberta-base-biomedical-clinical-es were used as embeddings.

The augmentation stages of the other language models were performed as follows:

• In the dataset used for the English in the de-identification model, a fake chunk data frame
was created for each label in various formats.

9
• Each labelled chunk was removed and replaced with label abbreviations

• The sentences were translated from English to the working language. For the translation,
our medical translation models used [30].

• The label abbreviations in the new sentences were replaced with new chunks of those
labels from the fake data frame.

• This new data set was converted to BIO format and added to the train data set.

The de-identification research was performed with the DL method in seven languages other than
English, learning rate=2e-5, max sentence length=512, batch size=16 (batch size=2 in Roma-
nian), and ten epoch trains were performed.

4 Result
The results obtained from the de-identification NER models are shown in Table 4. In addition,
the results obtained by using GPT-4o and the comparison results of other studies using the same
dataset with the results obtained in this study are also included in the same table.

Table 4: De-identification Model in English i2b2-PHI Categories and Comparison

(F1-Score)

PHI/Model Our Scores Khin, Bur- Yang and Kocaman, GPT-4o

Owners ckhardt Garibaldi Talby [45]
[31] [39]
AGE 0.981 0.973 0.948 0.964 0.781
CITY 0.944 0.909 0.776 0.949 0.917
COUNTRY 0.881 0.805 0.303 0.920 0.802
DATE 0.978 0.987 0.976 0.996 0.494
DEVICE 0.762 - - 0.286 0.217
DOCTOR 0.966 0.962 0.945 0.980 0.743
HOSPITAL 0.920 0.928 0.864 0.972 0.575
IDNUM 0.867 0.756 0.838 0.909 0.288
LOCATION- 1 - - 0.722 -
OTHER
MEDICAL 0.942 0.979 0.971 0.980 0.716
RECORD
ORGANIZATION 0.876 0.719 0.427 0.874 0.400
PATIENT 0.967 0.961 0.933 0.967 0.535
PHONE 0.868 0.970 0.952 0.978 0.456
PROFESSION 0.900 0.899 0.688 0.925 0.583
STATE 0.961 0.932 0.863 0.969 0.932
STREET 0.985 0.989 0.978 0.996 0.900
USERNAME 0.962 0.957 0.978 0.954 0.635
ZIP 0.989 0.982 0.986 0.982 0.975
Macro-avg 0.931 0.919 0.840 0.863 0.548

10
As seen in Table 4, the model realized in this study includes PHIs not used in other studies, and
satisfactory results were obtained. When the performance results of the studies are compared
with the results of this study, it is determined that new SOTA values are obtained with this study.
As a result of the analysis and calculations, although the train was performed with 18 PHI labels
(DEVICE and LOCATION-OTHER labels were not used in other studies) and high scores of
some labels were not obtained, the F1 macro score (0.931) obtained in this study was higher
than the other models and a new SOTA value was received.

GPT-4o performs well in classes such as CITY, COUNTRY, ZIP, and STATE, achieving high
precision, recall, and F1-scores. However, it struggles significantly with IDNUM, LOCATION-
OTHER, ORGANIZATION, EMAIL, FAX, and DEVICE, where the scores are notably low.
The macro average (0.5757) indicates that the model’s performance varies significantly across
classes, with weaker performance in certain categories. On the other hand, the micro average
(0.5907) is slightly higher, reflecting the model’s stronger performance in more frequent classes,
but overall, the scores are low.

As a result of the de-identification study in seven different languages, the results obtained for
13 labels in German, Italian, and French are shown in Table 5, while the results obtained for
Turkish (13 labels), Spanish (14 labels), and Romanian (14 labels) are shown in Table 6.

Table 5: German, Italian, and French de-identification Model Outputs (F1-Score)

Language/labels German Italian French

AGE 0.985 0.983 0.981
CITY 0.963 0.922 0.939
COUNTRY 0.954 0.906 0.926
DATE 0.997 0.998 0.998
DOCTOR 0.944 0.955 0.952
HOSPITAL 0.981 0.975 0.915
IDNUM 0.987 0.998 0.997
ORGANIZATION 0.865 0.916 0.699
PATIENT 0.903 0.920 0.918
PHONE 0.995 0.995 0.995
PROFESSION 0.980 0.917 0.941
STREET 0.945 0.952 0.949
ZIP 0.975 0.982 0.975
macro-avg 0.960 0.955 0.937

The table presents F1-scores for de-identification tasks across German, Italian, and French
datasets. Overall, the German model achieves the highest macro-average (0.960), followed by
Italian (0.955) and French (0.937). DATE and PHONE categories exhibit consistently strong
performance across all languages, achieving nearly perfect scores (≥ 0.995). In contrast, the
ORGANIZATION category shows notable variability, with the French model scoring signifi-
cantly lower (0.699). These results highlight the robustness of the models in categories such
as AGE, IDNUM, and ZIP while identifying areas for improvement in language-specific chal-
lenges, particularly for underperforming categories like ORGANIZATION in French (Table 5).
However, since it was impossible to find any benchmark tests for these languages, comparing
the scores obtained in this study was impossible.

11
Table 6: Turkish, Spanish, Romanian, and Arabic de-identification Model Outputs
(F1-Score)

Language/labels Turkish Spanish Romanian Arabic

AGE 0.988 0.980 0.984 0.980
CITY 0.979 0.958 0.889 0.867
COUNTRY 0.917 0.969 0.899 0.881
DATE 0.997 0.997 0.973 0.987
DOCTOR 0.953 0.969 0.966 0.908
EMAIL - 0.994 0.857 -
HOSPITAL 0.942 0.976 0.935 0.988
ID - 0.995 - -
IDNUM 0.979 - 0.997 0.962
LOCATION 1 - 0.846 -
MEDICAL RECORD 1 0.991 0.999 -
ORGANIZATION 0.975 0.734 0.768 0.978
PATIENT 0.946 0.967 0.944 0.856
PHONE 0.982 0.981 1 0.984
PROFESSION 0.924 0.912 0.888 0.877
SEX - 0.971 - -
SSN - 0.937 - -
STREET 0.913 0.959 0.953 0.768
ZIP 0.913 0.980 0.992 0.950
FAX - - 0.923 -
FAMILY 1 - - -
macro-avg 0.963 0.957 0.930 0.922

Table 6 highlights strong performances for Turkish (macro-avg 0.963) and Spanish (0.957)
models, followed by Romanian (0.930) and Arabic (0.922). Categories like DATE, PHONE,
and MEDICAL RECORD achieve near-perfect scores across languages, demonstrating model
robustness. Lower scores are observed for CITY and ORGANIZATION in Romanian and Ara-
bic, indicating room for improvement. Missing or language-specific labels (e.g., EMAIL, SSN)
show variability in evaluation, reflecting dataset differences. Turkish and Spanish excel in most
categories, with consistent performance across diverse labels.

12
Table 7: i2b2 Test Set Scores (IOB Token Level) Using GPT-4o

Entity Precision Recall F1-Score

B-AGE 0.688 0.937 0.791
B-CITY 0.948 0.904 0.925
B-COUNTRY 0.908 0.718 0.832
B-DATE 0.808 0.834 0.821
B-DEVICE 0.132 0.625 0.217
B-DOCTOR 0.956 0.810 0.877
B-HOSPITAL 0.916 0.675 0.775
B-IDNUM 0.340 0.672 0.531
B-MEDICALRECORD 0.960 0.794 0.869
B-ORGANIZATION 0.303 0.695 0.422
B-PATIENT 0.852 0.779 0.814
B-PHONE 0.757 0.726 0.741
B-PROFESSION 0.695 0.637 0.665
B-STATE 0.902 0.974 0.937
B-STREET 0.933 0.927 0.930
B-USERNAME 0.563 0.728 0.635
B-ZIP 1.000 0.993 0.997
I-AGE 0.175 0.453 0.253
I-CITY 0.872 0.852 0.862
I-COUNTRY 0.800 0.615 0.696
I-DATE 0.755 0.755 0.755
I-DEVICE 0.133 1.000 0.235
I-DOCTOR 0.490 0.767 0.605
I-HOSPITAL 0.891 0.715 0.793
I-IDNUM 0.392 0.550 0.458
I-LOCATION 0.114 0.121 0.118
I-MEDICALRECORD 0.763 0.457 0.571
I-ORGANIZATION 0.246 0.750 0.370
I-PATIENT 0.535 0.652 0.587
I-PHONE 0.749 0.755 0.752
I-PROFESSION 0.628 0.693 0.659
I-STATE 0.917 0.688 0.786
I-STREET 0.839 0.964 0.897
I-ZIP 0.714 0.625 0.667
O 0.986 0.984 0.985
Macro avg 0.5819 0.6247 0.5775
Weighted avg 0.970 0.967 0.968

Table 7 evaluates the B- (Beginning) and I- (Inside) tags separately, shows that the model
achieves high accuracy (0.9672) overall. Classes like B-STATE, I-CITY, and I-COUNTRY
perform very well, while B-EMAIL, B-FAX, and I-LOCATION have lower precision and re-
call, indicating challenges in identifying these entities. The macro average (0.5775) is lower
than the weighted average (0.968), suggesting that less frequent or more difficult classes are
pulling down the macro scores, whereas the model is quite successful in predicting the more
common entities.

13
The Low Scores Are Attributed to Several Issues The model struggles to identify patient and
doctor names located in the middle of the text, even though it can find those at the beginning and
end. Some hospital names are partially labelled, which affects the overall precision and recall.
Occasionally, the model includes extra tokens within the labels, leading to incorrect annotations.
Despite specifying which labels to use in the prompt, the model sometimes incorrectly adds
different labels, like time, which were not meant to be included. The model confuses some
labels or fails to identify them altogether, contributing to the lower scores.

5 Conclusion
This study underscores the importance of de-identification as a key method for safeguarding pa-
tient/personal health information and ensuring its ethical use in scientific research. By remov-
ing identifiable details through techniques like anonymization, generalization, and differential
privacy, de-identification allows data to be used for diverse scientific applications, including
epidemiological studies, disease modelling, and artificial intelligence development while main-
taining patient privacy.

Recent advancements have demonstrated the potential of LLMs in de-identification tasks, yet
challenges remain, particularly around issues of patient data security, API dependencies, and
the need for domain-specific expertise in handling EHRs. Our ”LLMs-in-the-loop” approach
addresses these concerns by integrating small, specialized models tailored to the medical field.
This method enhances both privacy and reliability, enabling the secure use of data without rely-
ing on external APIs or compromising sensitive patient information.

The multilingual nature of this research, spanning several languages, shows the adaptability
and robustness of our models across diverse healthcare environments. While there are inherent
risks associated with data anonymization, this study demonstrates that when properly applied,
de-identification models can strike a delicate balance between protecting individual privacy and
maximizing the utility of health data.

Furthermore, as the field progresses, it is crucial to establish globally recognized standards,

raise awareness of best practices, and ensure that ethical principles guide the deployment of de-
identification technologies. Transparency, accountability, and a rigorous risk-benefit analysis
must remain at the forefront of these efforts.

Ultimately, the findings of this study highlight the potential of expert small models devel-
oped through the LLMs-in-the-loop methodology to meet the evolving demands of health-
care research. The models presented here offer a reliable and scalable solution for future de-
identification applications, advancing the capabilities of AI in healthcare while safeguarding
patient privacy.

Future research should focus on further refining and expanding de-identification models to
cover a wider range of languages and healthcare contexts. One of the primary challenges is the
scarcity of high-quality, annotated datasets in languages other than English, which limits the de-
velopment of robust models for non-English speaking regions. Addressing this gap will require
collaborative efforts to create and share multilingual datasets, ensuring more comprehensive
language coverage. Additionally, future studies could explore more advanced augmentation

14
techniques and develop models capable of handling increasingly complex medical data types,
such as clinical narratives and imaging reports. Continuous innovation in privacy-preserving
methods, such as federated learning, may also prove valuable in safeguarding sensitive patient
information while advancing the performance and applicability of de-identification technolo-
gies across diverse healthcare systems.

References
[1] Ahmed, T., M.M.A. Aziz, and N. Mohammed, De-identification of electronic health
record using neural network, Sci Rep, 2020, 10(1): p. 18600.
[2] Wood, A., et al., Linked electronic health records for research on a nationwide cohort of
more than 54 million people in England: data resource, BMJ, 2021, 373: p. n826.
[3] Gungoren, M., F. Orhan, and N. Kurutkan, Mikro Rekabetçilikte Yeni Yaklaşımlar:
Hastanelerde Oluşan Etik İklimin Kalite ve Akreditasyon Açısından Değerlendirilmesi,
Süleyman Demirel Üniversitesi İktisadi ve İdari Bilimler Fakültesi Dergisi, 2013, 18(1):
p. 221-241.
[4] Varol, Ş., et al., Sağlık kurumlarında bilgi güvenliği bağlamında biyometrik sistemler,
Sağlık Akademisyenleri Dergisi, 2016, 3(4): p. 155-162.
[5] Yilmaz, D., E. Erguner Ozkoc, and G. Ogutcu Ulas, Elektronik Sağlık Kayıtlarında
Farkındalık, 24, 2023.
[6] healthITSecurity, De-Identification of PHI According to the HIPAA Privacy Rule, 2023,
April 13, 2023; Available from: https://healthitsecurity.com/features/de-identification-of-
phi-according-to-the-hipaa-privacy-rule.
[7] Act, A., Health insurance portability and accountability act of 1996, Public law, 1996,
104: p. 191.
[8] Fernandez-Aleman, J.L., et al., Security and privacy in electronic health records: a sys-
tematic literature review, J Biomed Inform, 2013, 46(3): p. 541-62.
[9] Office for Civil Rights, H., Standards for privacy of individually identifiable health infor-
mation. Final rule, Federal register, 2002, 67(157): p. 53181-53273.
[10] Toscano, F., et al., Electronic health records implementation: can the European Union
learn from the United States?, European Journal of Public Health, 2018, 28(suppl 4): p.
cky213. 401.
[11] hhs.gov, Guidance on De-identification of Protected Health Information
- hhs deid guidance.pdf, 2012; [cited 2023 July 17]; Available from:
https://www.hhs.gov/sites/default/files/ocr/privacy/hipaa/understanding/coveredentities/De-
identification/hhs deid guidance.pdf.
[12] hhs.gov, Standards for Privacy of Individually Identifiable Health Info — HHS.gov,
2013; [cited 2023 July 17]; Available from: https://www.hhs.gov/hipaa/for-
professionals/privacy/guidance/standards-privacy-individually-identifiable-health-
information/index.html.

15
[13] Neamatullah, I., et al., Automated de-identification of free-text medical records, BMC Med
Inform Decis Mak, 2008, 8: p. 32.

[14] Paul, T., et al., Investigation of the Utility of Features in a Clinical De-identification
Model: A Demonstration Using EHR Pathology Reports for Advanced NSCLC Patients,
Front Digit Health, 2022, 4: p. 728922.

[15] Garfinkel, S., De-identification of Personal Information, 2015: US Department of Com-

merce, National Institute of Standards and Technology.

[16] Wu, H., et al., SemEHR: A general-purpose semantic search system to surface semantic
data from clinical notes for tailored care, trial recruitment, and clinical research, J Am
Med Inform Assoc, 2018, 25(5): p. 530-537.

[17] Stubbs, A. and O. Uzuner, Annotating risk factors for heart disease in clinical narratives
for diabetic patients, J Biomed Inform, 2015, 58 Suppl(Suppl): p. S78-S91.

[18] Catelli, R., et al., A Novel COVID-19 Data Set and an Effective Deep Learning Approach
for the De-Identification of Italian Medical Records, IEEE Access, 2021, 9: p. 19097-
19110.

[19] Reddy, S., et al., A governance model for the application of AI in health care, J Am Med
Inform Assoc, 2020, 27(3): p. 491-497.

[20] Ong, J.C.L., et al., Artificial intelligence, ChatGPT, and other large language models for
social determinants of health: Current state and future directions, Cell Rep Med, 2024,
5(1): p. 101356.

[21] Gunasekeran, D.V., et al., Digital health during COVID-19: lessons from operationalising
new models of care in ophthalmology, Lancet Digit Health, 2021, 3(2): p. e124-e134.

[22] Ting, D.S.W., et al., Digital technology and COVID-19, Nat Med, 2020, 26(4): p. 459-461.

[23] Verdicchio, M. and A. Perin, When Doctors and AI Interact: on Human Responsibility for
Artificial Risks, Philos Technol, 2022, 35(1): p. 11.

[24] Dai, S.-C., A. Xiong, and L.-W. Ku, LLM-in-the-loop: Leveraging large language model
for thematic analysis, arXiv preprint arXiv:2310.15100, 2023.

[25] De Paoli, S., Can Large Language Models emulate an inductive Thematic Analysis of
semi-structured interviews? An exploration and provocation on the limits of the approach
and the model, arXiv preprint arXiv:2305.13014, 2023.

[26] Gilardi, F., M. Alizadeh, and M. Kubli, ChatGPT outperforms crowd workers for text-
annotation tasks, Proc Natl Acad Sci U S A, 2023, 120(30): p. e2305016120.

[27] Islam, T. and D. Goldwasser, Discovering latent themes in social media messaging: A
machine-in-the-loop approach integrating llms, arXiv preprint arXiv:2403.10707, 2024.

[28] Pham, D.K. and B.Q. Vo, Towards Reliable Medical Question Answering: Tech-
niques and Challenges in Mitigating Hallucinations in Language Models, arXiv preprint
arXiv:2408.13808, 2024.

16
[29] Umphrey, R., J. Roberts, and L. Roberts, Investigating Expert-in-the-Loop LLM Discourse
Patterns for Ancient Intertextual Analysis, arXiv preprint arXiv:2409.01882, 2024.

[30] Keles, B., M. Gunay, and S.I. Caglar, LLMs-in-the-loop Part-1: Expert Small AI Models
for Bio-Medical Text Translation, arXiv preprint arXiv:2407.12126, 2024.

[31] Khin, K., P. Burckhardt, and R. Padman, A Deep Learning Architecture for De-
identification of Patient Notes: Implementation and Evaluation, arXiv pre-print server,
2018.

[32] Morrison, F.P., S. Sengupta, and G. Hripcsak, Using a pipeline to improve de-identification
performance, AMIA Annu Symp Proc, 2009. 2009: p. 447–51.

[33] Stubbs, A., C. Kotfila, and O. Uzuner, Automated systems for the de-identification of lon-
gitudinal clinical narratives: Overview of 2014 i2b2/UTHealth shared task Track 1, J
Biomed Inform, 2015, 58 Suppl(Suppl): p. S11-S19.

[34] Uzuner, O., Y. Luo, and P. Szolovits, Evaluating the state-of-the-art in automatic de-
identification, J Am Med Inform Assoc, 2007, 14(5): p. 550–63.

[35] Dernoncourt, F., et al., De-identification of patient notes with recurrent neural networks, J
Am Med Inform Assoc, 2017, 24(3): p. 596–606.

[36] Ferrandez, O., et al., Evaluating current automatic de-identification methods with Vet-
eran’s health administration clinical documents, BMC Med Res Methodol, 2012, 12: p.
109.

[37] Meystre, S.M., et al., Automatic de-identification of textual documents in the electronic
health record: a review of recent research, BMC Med Res Methodol, 2010, 10: p. 70.

[38] Liu, Z., et al., Automatic de-identification of electronic medical records using token-level
and character-level conditional random fields, J Biomed Inform, 2015, 58 Suppl(Suppl):
p. S47-S52.

[39] Yang, H. and J.M. Garibaldi, Automatic detection of protected health information from
clinic narratives, J Biomed Inform, 2015, 58 Suppl(Suppl): p. S30-S38.

[40] Nadkarni, P.M., L. Ohno-Machado, and W.W. Chapman, Natural language processing:
an introduction, J Am Med Inform Assoc, 2011, 18(5): p. 544-51.

[41] Sweeney, L., Replacing personally-identifying information in medical records, the Scrub
system, Proc AMIA Annu Fall Symp, 1996: p. 333-7.

[42] Gupta, D., M. Saul, and J. Gilbertson, Evaluation of a deidentification (De-Id) software
engine to share pathology reports and clinical documents for research, Am J Clin Pathol,
2004, 121(2): p. 176-86.

[43] He, B., et al., CRFs based de-identification of medical records, J Biomed Inform, 2015,
58 Suppl(Suppl): p. S39-S46.

[44] Lafferty, J., A. McCallum, and F. Pereira, Conditional random fields: Probabilistic models
for segmenting and labeling sequence data, in Icml. 2001. Williamstown, MA.

17
[45] Kocaman, V., D. Talby, and H.U. Hak, Beyond Accuracy: Automated De-Identification of
Large Real-World Clinical Text Datasets, Value in Health, 2023, 26(12): p. S532.

[46] Liu, Z., et al., Deid-gpt: Zero-shot medical text de-identification by gpt-4, arXiv preprint
arXiv:2303.11032, 2023.

[47] Stubbs, A., et al., Identifying risk factors for heart disease over time: Overview of 2014
i2b2/UTHealth shared task Track 2, J Biomed Inform, 2015, 58 Suppl(Suppl): p. S67-S77.

18
Appendix A- The Prompt Used to Obtain Benchmarks with GPT-4o
1 prompt = f """ You are tasked with e x t r a c t i n g P r o t e c t e d Health
I n f o r m a t i o n ( PHI ) from clinical notes . Your job is to identify
and mark specific entities within the text . Here are the
entities you need to look for :
2

3 < entities >

4 AGE ( I d e n t i f i e s the age number or age - related i n f o r m a t i o n.
Example : In "88 years old ," 88 would be marked as AGE . In " in
his 50 ’ s ,"50 ’ s would be marked as AGE .)
5 CITY ( I d e n t i f i e s the name of a city .)
6 COUNTRY ( I d e n t i f i e s the name of a country .)
7 DATE ( I d e n t i f i e s specific dates or years . Example : In " He was
admitted on 03/29/2089 ," 0 3 / 2 9 / 2 0 8 9 would be marked as DATE . In
" His surgery was in the 1980 ’s ," 1980 ’ s would be marked as
DATE . In " His record was marked on 2089 -08 -24" 2089 -08 -24 would
be marked at DATE .)
8 DEVICE ( I d e n t i f i e s serial numbers , item code or product code of a
medical device m e n t i o n e d. Example : In " The AA 737 p a c e m a k e r
was implanted ," AA 737 would be marked as DEVICE .)
9 DOCTOR ( I d e n t i f i e s the name of a doctor or h e a l t h c a r e
p r o f e s s i o n a l . Only the name should be marked , not the title
such as " Dr ." , " M . D .".)
10 HOSPITAL ( I d e n t i f i e s the name of a hospital or nursing home .)
11 IDNUM ( I d e n t i f i e s i d e n t i f i c a t i o n numbers such as medical record
or patient numbers .)
12 LOCATION ( I d e n t i f i e s specific l o c a t i o n s related to healthcare ,
e x c l u d i n g city or country .)
13 M E D I C A L R E C O R D ( I d e n t i f i e s medical record numbers or similar
i d e n t i f i e r s .)
14 O R G A N I Z A T I O N ( I d e n t i f i e s names of o r g a n i z a t i o n s or i n s t i t u t i o n s .)
15 PATIENT ( I d e n t i f i e s the patient ’ s name . Only the name should be
marked , not titles like " Mr ." or " Mrs .")
16 PHONE ( I d e n t i f i e s phone numbers , i n c l u d i n g fax numbers .)
17 P R O F E S S I O N ( I d e n t i f i e s p r o f e s s i o n s or job titles .)
18 STATE ( I d e n t i f i e s the name of a state or region .)
19 STREET ( I d e n t i f i e s street a d d r e s s e s .)
20 USERNAME ( I d e n t i f i e s u s e r n a m e s or account IDs .)
21 ZIP ( I d e n t i f i e s postal or zip codes .)
22 </ entities >
23

24 I will provide you with a clinical note . Your task is to process

this note and mark all i n s t a n c e s of the PHI entities listed
above .
25

26 Here is the clinical note :

28 { c l i n i c a l _ n o t e}
29

30 I n s t r u c t i o n s for marking PHI entities :

31 * C a r e f u l l y read through the entire clinical note .

19
32 * Identify any text that matches one of the PHI entity types
listed above .
33 * For each i d e n t i f i e d PHI entity , mark the b e g i n n i n g and end of
the relevant text chunk using the f o l l o w i n g format :
34 BEGINER_ LABEL CHUNK ENDNER where ENTITY LABEL is one of the
entity types from the list , and CHUNK is the actual text
c o n t a i n i n g the PHI .
35 * While marking , DO NOT EDIT OR CHANGE the original clinical text
, only put marks d e s c r i b e d above .
36

37 Here are few examples of correct markup :

39 Original text :
40 Mrs . Linda Martinez , a 45 - year - old architect , having MR \#:
2775283 for an e v a l u a t i o n on 2023 -05 -10. Her insulin pump model
ZX900 was assessed by Dr . Michael Brown , M . D . The patient ’ s
c o n d i t i o n has improved since the 1990 s , but she m e n t i o n e d
feeling unwell for past 6 months . MF381 /1183 was r e f e r e n c e d
during her visit , which lasted a p p r o x i m a t e l y 5 hours and
c o n c l u d e d at 1 0 : 0 5 : 0 3 . She was d i s c h a r g e d on 2 0 / 1 0 / 2 0 2 3 .
41

42 Marked text :
43 Mrs . B E G I N E R _ P A T I E N T Linda Martinez ENDNER , a B E G I N E R _ A G E 45
ENDNER year - old B E G I N E R _ P R O F E S S I O N a r c h i t e c t ENDNER , having MR
\#: B E G I N E R _ M E D I C A L R E C O R D 2775283 ENDNER for an e v a l u a t i o n on
B E G I N E R _ D A T E 2023 -05 -10 ENDNER . Her insulin pump model
B E G I N E R _ D E V I C E ZX900 ENDNER was assessed by Dr . B E G I N E R _ D O C T O R
Michael Brown ENDNER , M . D . The patient ’ s c o n d i t i o n has improved
since the B E G I N E R _ D A T E 1990 s ENDNER , but she m e n t i o n e d feeling
unwell for past 6 months . B E G I N E R _ I D N U M MF381 /1183 ENDNER was
r e f e r e n c e d during her visit , which lasted a p p r o x i m a t e l y 5 hours
and c o n c l u d e d at 1 0 : 0 5 : 0 3 . She was d i s c h a r g e d on B E G I N E R _ D A T E
2 0 / 1 0 / 2 0 2 3 ENDNER .
44

45 I m p o r t a n t notes :
46 * Be sure to process the entire clinical note and mark all
i n s t a n c e s of PHI entities .
47 * If a chunk of text could belong to multiple entity types ,
choose the most specific or a p p r o p r i a t e one .
48 * Do not mark i n f o r m a t i o n that is not part of the s p e c i f i e d PHI
entity types .
49 * Preserve the original text exactly as it appears , i n c l u d i n g any
spelling errors or f o r m a t t i n g.
50 * Label the data , ensuring that p r o f e s s i o n a l titles or suffixes
such as ’M . D . ’ , ’ Ph . D . ’ , or similar are not removed . These
titles must be p r e s e r v e d exactly as they appear in the text ,
without a l t e r a t i o n or omission and should NEVER be inside the
label .
51 * A p o s t r o p h e ’s ’ ( ’ s ) should not be included within the label
when a s s o c i a t e d with Names . Only the person ’ s name should be
inside the label , and the a p o s t r o p h e ’s ’ should remain outside

20
the marked text . However , a p o s t r o p h e ’s ’ is allowed within the
DATE label when r e f e r r i n g to a decade ( e . g . , 80 ’ s ) .
52 * Mark only specific calendar dates as DATE . Do not mark relative
time e x p r e s s i o n s like "6 months ," "1 year ago ," "5 weeks ," "5
wks ," " yesterday ," " today ," " days ," or similar units of time (
months , years , weeks ) , as they do not r e p r e s e n t actual dates .
53 * Mark only actual dates as DATE . Do not mark time - related
e x p r e s s i o n s such as "10:05:03 ," "10 am ," or d u r a t i o n s like "5
hours " as DATE , since they refer to times or d u r a t i o n s rather
than specific calendar dates .
54 * Fax numbers should be treated as PHONE entities and marked the
same way as phone numbers .
55 Please process the provided clinical note and return it with all
PHI entities a p p r o p r i a t e l y marked .
56 """

en
No ratings yet
en
181 pages
AI Meets Anonymity: How named entity recognition is redefining data privacy
No ratings yet
AI Meets Anonymity: How named entity recognition is redefining data privacy
9 pages
Covidsafe Application Privacy Impact Assessment
100% (1)
Covidsafe Application Privacy Impact Assessment
78 pages
en
No ratings yet
en
327 pages
org-out
No ratings yet
org-out
224 pages
Aligned
No ratings yet
Aligned
224 pages
Nyclu PatientPrivacy
No ratings yet
Nyclu PatientPrivacy
40 pages
CIS Security Metrics-Quick Start Guide v1.0.0 PDF
No ratings yet
CIS Security Metrics-Quick Start Guide v1.0.0 PDF
18 pages
US Privacy Laws Comparison by OneTrust 1693424758
No ratings yet
US Privacy Laws Comparison by OneTrust 1693424758
118 pages
IRE Tipsheet - I'm Entitled To That Spreadsheet
No ratings yet
IRE Tipsheet - I'm Entitled To That Spreadsheet
65 pages
en
No ratings yet
en
194 pages
NeuroGeneces Instructions Manual PDF
No ratings yet
NeuroGeneces Instructions Manual PDF
44 pages
(2303.11032) DeID-GPT: Zero-Shot Medical Text De-Identification by GPT-4
No ratings yet
(2303.11032) DeID-GPT: Zero-Shot Medical Text De-Identification by GPT-4
53 pages
Principle-Based Approach For The De-Identification of Code-Mixed Electronic Health Records
No ratings yet
Principle-Based Approach For The De-Identification of Code-Mixed Electronic Health Records
11 pages
Real-Time De-Identification of Healthcare Data Using Ephemeral Pseudonyms
No ratings yet
Real-Time De-Identification of Healthcare Data Using Ephemeral Pseudonyms
5 pages
Manju.S Report
No ratings yet
Manju.S Report
42 pages
Hhs Deid Guidance
No ratings yet
Hhs Deid Guidance
32 pages
Biometric Systems De-Identification: Current Advancements and Future Directions
No ratings yet
Biometric Systems De-Identification: Current Advancements and Future Directions
26 pages
M2Sys Healthcare Solutions: Mac Mcmillan, Chair, Himss Privacy & Security Policy Task Force
No ratings yet
M2Sys Healthcare Solutions: Mac Mcmillan, Chair, Himss Privacy & Security Policy Task Force
14 pages
Privacy
No ratings yet
Privacy
24 pages
IT in The Electronic Hospital: by Al Gallant
No ratings yet
IT in The Electronic Hospital: by Al Gallant
22 pages
WP2021 - O.2.3 Pseudonymisation Healthcare
No ratings yet
WP2021 - O.2.3 Pseudonymisation Healthcare
22 pages
Solution Test 1
No ratings yet
Solution Test 1
85 pages
1 Artificial Intelligence in Healthcare Renders Privacy Artificial Under the Current Regulatory Scheme
No ratings yet
1 Artificial Intelligence in Healthcare Renders Privacy Artificial Under the Current Regulatory Scheme
39 pages
Secondary Use of Electronic Health Record
No ratings yet
Secondary Use of Electronic Health Record
19 pages
Privacy in Electronic Health Records: A Systematic Mapping Study
No ratings yet
Privacy in Electronic Health Records: A Systematic Mapping Study
20 pages
Security and Privacy in Electronic Health Records: A Systematic Literature Review
No ratings yet
Security and Privacy in Electronic Health Records: A Systematic Literature Review
22 pages
C DAC Winter Project Report 7 PDF
No ratings yet
C DAC Winter Project Report 7 PDF
29 pages
Maintaining Integrity and Confidentiality of Patients' Records Using An Enhanced Security Technique
No ratings yet
Maintaining Integrity and Confidentiality of Patients' Records Using An Enhanced Security Technique
7 pages
Patient Privacy
No ratings yet
Patient Privacy
18 pages
Security and Privacy Issues in Healthcare Information System
No ratings yet
Security and Privacy Issues in Healthcare Information System
5 pages
Checklist: Risks, Harms and Benefits Assessment Tool
No ratings yet
Checklist: Risks, Harms and Benefits Assessment Tool
14 pages
Health Care Data Privacy and Compliance: Navigating Regulatory Landscape
No ratings yet
Health Care Data Privacy and Compliance: Navigating Regulatory Landscape
13 pages
Expert Systems - 2023 - Gopalakrishnan - PriMed Private Federated Training and Encrypted Inference On Medical Images in
No ratings yet
Expert Systems - 2023 - Gopalakrishnan - PriMed Private Federated Training and Encrypted Inference On Medical Images in
14 pages
Radhakrishnan Health Data As Wealth
No ratings yet
Radhakrishnan Health Data As Wealth
47 pages
Beyond Accuracy: Automated De-Identification of Large Real-World Clinical Text Datasets
No ratings yet
Beyond Accuracy: Automated De-Identification of Large Real-World Clinical Text Datasets
13 pages
Chapter 7
No ratings yet
Chapter 7
23 pages
New Textdokument (2)
No ratings yet
New Textdokument (2)
59 pages
Maskanyone Toolkit: Offering Strategies For Minimizing Privacy Risks and Maximizing Utility in Audio-Visual Data Archiving
No ratings yet
Maskanyone Toolkit: Offering Strategies For Minimizing Privacy Risks and Maximizing Utility in Audio-Visual Data Archiving
17 pages
DE-Identification of Protected Health Information PHI from Free Text in Medical Records
No ratings yet
DE-Identification of Protected Health Information PHI from Free Text in Medical Records
11 pages
Kruse, C. S., Smith, B., Vanderlinden, H., & Nealand, A. (2017) - Security Techniques For The Electronic Health Records.
No ratings yet
Kruse, C. S., Smith, B., Vanderlinden, H., & Nealand, A. (2017) - Security Techniques For The Electronic Health Records.
9 pages
Quality Improvement Project Protocol Template (FO1282)
No ratings yet
Quality Improvement Project Protocol Template (FO1282)
12 pages
Argumentative Essay: Facial Recognition in Surveillance: Privacy vs. Security
No ratings yet
Argumentative Essay: Facial Recognition in Surveillance: Privacy vs. Security
9 pages
DkIT - Anonymisation and Pseudonymisation Policy
No ratings yet
DkIT - Anonymisation and Pseudonymisation Policy
10 pages
Blockchain and Electronic Healthcare Records (Cybertrust)
No ratings yet
Blockchain and Electronic Healthcare Records (Cybertrust)
5 pages
Section 17 - OriginalContrib2
No ratings yet
Section 17 - OriginalContrib2
6 pages
2020 A Privacy-Preserving Healthcare Framework Using Hyperledger Fabric
No ratings yet
2020 A Privacy-Preserving Healthcare Framework Using Hyperledger Fabric
16 pages
ARTERIAL A Natural Language Processing Model For Prevention of Information Leakage From Electronic Health Records
No ratings yet
ARTERIAL A Natural Language Processing Model For Prevention of Information Leakage From Electronic Health Records
6 pages
Yalvant Yadav Bioethics Test Point 3 - Viewing An Attempt
No ratings yet
Yalvant Yadav Bioethics Test Point 3 - Viewing An Attempt
4 pages
Dernoncourt Et Al. - 2016 - De-Identification of Patient Notes With Recurrent
No ratings yet
Dernoncourt Et Al. - 2016 - De-Identification of Patient Notes With Recurrent
11 pages
2023 Article 771
No ratings yet
2023 Article 771
10 pages
HIPAA
No ratings yet
HIPAA
6 pages
ijpds-08-2153
No ratings yet
ijpds-08-2153
12 pages
a-privacy-preserving-distributed-filtering-framework-for-nlp-30r6g0qti3
No ratings yet
a-privacy-preserving-distributed-filtering-framework-for-nlp-30r6g0qti3
10 pages
India Security Regulations
No ratings yet
India Security Regulations
87 pages
train9
No ratings yet
train9
1 page
Current Status of Information Security For Electronic Health Record Services in India
No ratings yet
Current Status of Information Security For Electronic Health Record Services in India
2 pages
Blue Button Checklist
No ratings yet
Blue Button Checklist
3 pages
Privacy Preserving Attribute-Focused Anonymization Scheme for Healthcare Data Publishing
No ratings yet
Privacy Preserving Attribute-Focused Anonymization Scheme for Healthcare Data Publishing
19 pages
Nursing Informatics
No ratings yet
Nursing Informatics
16 pages
FPF - Visual Guide To Practical Data DeID
No ratings yet
FPF - Visual Guide To Practical Data DeID
1 page
Wjarr 2024 0478
No ratings yet
Wjarr 2024 0478
10 pages
Insurance Cia3 - New
No ratings yet
Insurance Cia3 - New
7 pages
johnson2020
No ratings yet
johnson2020
8 pages
Policy Paper Final - Ethics and Society
No ratings yet
Policy Paper Final - Ethics and Society
9 pages
Health Informatics Course - Unit 2.2a - Privacy - Security - and Confidentiality - Final - 03312020
No ratings yet
Health Informatics Course - Unit 2.2a - Privacy - Security - and Confidentiality - Final - 03312020
55 pages
Phedha: Protecting Healthcare Data in Health Information Exchanges With Active Data Bundles
No ratings yet
Phedha: Protecting Healthcare Data in Health Information Exchanges With Active Data Bundles
9 pages
HIPAA and de-Identification of PHI
No ratings yet
HIPAA and de-Identification of PHI
3 pages
1 Problematic Interactions Between AI and Health Privacy
No ratings yet
1 Problematic Interactions Between AI and Health Privacy
13 pages
CH Regulatory Rules Incl. Fund Contract Text
No ratings yet
CH Regulatory Rules Incl. Fund Contract Text
15 pages
How To Live
No ratings yet
How To Live
1 page
What Is Protected Health Information (PHI) ?
No ratings yet
What Is Protected Health Information (PHI) ?
1 page
ECB12747
No ratings yet
ECB12747
20 pages
Big Data Security and Privacy A Review On Issues C
No ratings yet
Big Data Security and Privacy A Review On Issues C
7 pages
Protected Health Information
No ratings yet
Protected Health Information
12 pages
Medical Data Privacy Handbook - Gkoulalas Divanis and Loukides Eds.
No ratings yet
Medical Data Privacy Handbook - Gkoulalas Divanis and Loukides Eds.
854 pages
Mid Term Evaluation
No ratings yet
Mid Term Evaluation
19 pages
Copy of Creating an NLP Pipeline to Find LOINC Codes
No ratings yet
Copy of Creating an NLP Pipeline to Find LOINC Codes
7 pages
M2-TW-05
No ratings yet
M2-TW-05
6 pages
Ethical Implications in Digital Health
No ratings yet
Ethical Implications in Digital Health
14 pages
Smart Home Solutions For Healthcare Privacy in Ubiquitous Computing Infraestructures
No ratings yet
Smart Home Solutions For Healthcare Privacy in Ubiquitous Computing Infraestructures
10 pages
Wilson 4040 A2
No ratings yet
Wilson 4040 A2
6 pages
HIPAA
No ratings yet
HIPAA
14 pages
Lab 3
No ratings yet
Lab 3
4 pages
Law - Ethics - and Confidentiality in Nursing PDF
No ratings yet
Law - Ethics - and Confidentiality in Nursing PDF
20 pages
SAP Data Masking Tools
No ratings yet
SAP Data Masking Tools
6 pages
Board Meeting 10 22 2024
No ratings yet
Board Meeting 10 22 2024
1 page
LIVES
No ratings yet
LIVES
2 pages
Chapter 10 Health Information Privacy and Security (1)
No ratings yet
Chapter 10 Health Information Privacy and Security (1)
42 pages
2007 - Privacy and Confidentiality of Electronic Health Records - Academy of Hospital Administration
No ratings yet
2007 - Privacy and Confidentiality of Electronic Health Records - Academy of Hospital Administration
4 pages
6 Privacy Preservation in Healthcare Systems
No ratings yet
6 Privacy Preservation in Healthcare Systems
6 pages
Doc.10000-EN Manual On Flight Data Analysis Programmes (FDAP) PDF
100% (2)
Doc.10000-EN Manual On Flight Data Analysis Programmes (FDAP) PDF
34 pages
Master of Science in Applied Information and Data Science HSLU Okt 23
No ratings yet
Master of Science in Applied Information and Data Science HSLU Okt 23
2 pages
Health Privacy Issues For Researchers
No ratings yet
Health Privacy Issues For Researchers
20 pages
APznzaY2wJfOPwF33DAxH2P3U4jwKQ58VYrSe0M7lZWDvK0vViVVR5jK_6D9CC0M7qpae1i61Khl68VHDueSA4CGJUCSlS809qDvFBvjmUKq1YkO6ePVIcmV_EXFJCFHZZ298y4yuZNaqBZL2XZ84cLQHNp-NuPwb1VWI1vChY8o5AlIEOlIo8R4oew-rgCQYnnU0RUmM8Naf5oS6OYLE9
No ratings yet
APznzaY2wJfOPwF33DAxH2P3U4jwKQ58VYrSe0M7lZWDvK0vViVVR5jK_6D9CC0M7qpae1i61Khl68VHDueSA4CGJUCSlS809qDvFBvjmUKq1YkO6ePVIcmV_EXFJCFHZZ298y4yuZNaqBZL2XZ84cLQHNp-NuPwb1VWI1vChY8o5AlIEOlIo8R4oew-rgCQYnnU0RUmM8Naf5oS6OYLE9
22 pages
Joint Ao 2016-0002
0% (1)
Joint Ao 2016-0002
16 pages
HIPAA- CPC -2
No ratings yet
HIPAA- CPC -2
5 pages
Privacy Program Management
50% (2)
Privacy Program Management
182 pages
Data Strategy and Architecture
100% (4)
Data Strategy and Architecture
19 pages
Data Privacy for Everyone: A Simple Guide to Big Ideas
From Everand
Data Privacy for Everyone: A Simple Guide to Big Ideas
NOVA MARTIAN
No ratings yet