0% found this document useful (0 votes)

25 views7 pages

Understanding The Competition Rsna

The document summarizes work done on the RSNA 2023 Abdominal Trauma Detection competition dataset. It includes: 1. Loading and exploring the train CSV file containing patient IDs and labels. 2. Visualizing the distribution of injuries in the data using a bar plot. 3. Reading DICOM image files, displaying an animation of sample images. 4. Preprocessing a subset of the DICOM images to save them in NPY format for further analysis. The dataset contains CT images of patients labeled for abdominal trauma and injury location. The document explores the provided CSV metadata, visualizes injury distributions, reads DICOM files into NumPy arrays, and preprocesses a subset of images for

Uploaded by

asoedjfanush

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

25 views7 pages

Understanding The Competition Rsna

Uploaded by

asoedjfanush

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

RSNA : RadioLogical Society of North America

import warnings
warnings.filterwarnings("ignore")

The R S N A 2023 A b d o mi n a l T r a u m a D e t e c t io n competition is a machine learning

challenge that aims to develop algorithms to detect abdominal trauma in
C o m p ut e d T o mo g r a p h y ( C T ) images. The competition is hosted by the R a d io l o g ic a l
S o c i e t y of N o r t h A m e r ic a ( R S N A )
I couldnt find a good meme related to R S N A , so here is L i g h t n in g M c Q u e e n to cheer ¿
K a Cho ww ¿

Thanks to Theo Viel for providing small chunks of data to experiment

K U DO S ! ! !

1 | Data 📊
import pandas as pd
import pydicom

Our main data is situated in a CSV file −− −> ¿ train.csv

train_csv = pd.read_csv("/kaggle/input/rsna-2023-abdominal-trauma-
detection/train.csv")
print(train_csv.shape)
train_csv.head()

(3147, 15)

patient_id bowel_healthy bowel_injury extravasation_healthy \

0 10004 1 0 0
1 10005 1 0 1
2 10007 1 0 1
3 10026 1 0 1
4 10051 1 0 1

extravasation_injury kidney_healthy kidney_low kidney_high \

0 1 0 1 0
1 0 1 0 0
2 0 1 0 0
3 0 1 0 0
4 0 1 0 0
liver_healthy liver_low liver_high spleen_healthy spleen_low \
0 1 0 0 0 0
1 1 0 0 1 0
2 1 0 0 1 0
3 1 0 0 1 0
4 1 0 0 0 1

spleen_high any_injury
0 1 1
1 0 0
2 0 0
3 0 0
4 0 1

This csv file contains the patient_id and further explanatory columns

• P a t i e n t I D patient_id - THis column contains the ids of 3 , 147 patients.

Further columns denote what symptons were shown by the specific patient. We can use the
patient_id to get information from train_images Folder.

T r a i n I m a g e s is a big 3 l e v e l nested - directory, contianing images for the specific patient .

There are different folders in the Direcotry that denote the patient_id

The images are labeled with the presence or absence of abdominal trauma, as well
as the location of any injuries. The data is provided in DICOM format, which is a
standard format for medical images.

The type of data in the competition dataset is C T images of patients with abdominal
trauma. The images are labeled with the presence/absence of abdominal trauma, as well
as the location of any injuries. The data is provided in DICOM format, which is a
standard format for medical images.

We can read the DCIM file through pydicom

image_file =
"/kaggle/input/rsna-2023-abdominal-trauma-detection/train_images/10004
/21057/1001.dcm"
ds = pydicom.read_file(image_file)

Dataset.file_meta -------------------------------
(0002, 0001) File Meta Information Version OB: b'\x00\x01'
(0002, 0002) Media Storage SOP Class UID UI: CT Image Storage
(0002, 0003) Media Storage SOP Instance UID UI:
1.2.123.12345.1.2.3.10004.1.1001
(0002, 0010) Transfer Syntax UID UI: RLE Lossless
(0002, 0012) Implementation Class UID UI:
1.2.3.123456.4.5.1234.1.12.0
(0002, 0013) Implementation Version Name SH: 'PYDICOM 2.4.0'
-------------------------------------------------
(0008, 0018) SOP Instance UID UI:
1.2.123.12345.1.2.3.10004.1.1001
(0008, 0023) Content Date DA: '20230721'
(0008, 0033) Content Time TM: '232531.439438'
(0010, 0020) Patient ID LO: '10004'
(0018, 0050) Slice Thickness DS: '1.0'
(0018, 0060) KVP DS: '90.0'
(0018, 5100) Patient Position CS: 'HFS'
(0020, 000d) Study Instance UID UI:
1.2.123.12345.1.2.3.10004
(0020, 000e) Series Instance UID UI:
1.2.123.12345.1.2.3.10004.21057
(0020, 0011) Series Number IS: '16'
(0020, 0013) Instance Number IS: '1001'
(0020, 0032) Image Position (Patient) DS: [-240.55273, -
378.05273, -1601.4]
(0020, 0037) Image Orientation (Patient) DS: [1.0, 0.0, 0.0,
0.0, 1.0, 0.0]
(0020, 0052) Frame of Reference UID UI:
1.2.826.0.1.3680043.8.498.61841901354930484747532163888112046276
(0028, 0002) Samples per Pixel US: 1
(0028, 0004) Photometric Interpretation CS: 'MONOCHROME2'
(0028, 0010) Rows US: 512
(0028, 0011) Columns US: 512
(0028, 0030) Pixel Spacing DS: [0.89453125,
0.89453125]
(0028, 0100) Bits Allocated US: 16
(0028, 0101) Bits Stored US: 12
(0028, 0102) High Bit US: 11
(0028, 0103) Pixel Representation US: 0
(0028, 1050) Window Center DS: '50.0'
(0028, 1051) Window Width DS: '400.0'
(0028, 1052) Rescale Intercept DS: '-1024.0'
(0028, 1053) Rescale Slope DS: '1.0'
(0028, 1054) Rescale Type LO: 'HU'
(7fe0, 0010) Pixel Data OB: Array of 267568
elements

2 | Visualization 🔬
import numpy as np
import tqdm
import os

import matplotlib.pyplot as plt

import matplotlib.animation as animation
import seaborn as sns
from IPython.display import HTML

Thanks to Yuanjian Li=>EDA with Animation BeginnerFriendly/Jocelyn Dumlao=>Unleashing

the Healing Potential: Abdominal Trauma for providing great C S V − E D A Do checkout

organ_columns = ['bowel', 'extravasation', 'kidney', 'liver',

'spleen']

organ_counts = pd.DataFrame()
organ_counts['Organ'] = train_csv.columns[1:]
organ_counts["count"] = [0 for _ in range(organ_counts.shape[0])]
for index , column in enumerate(train_csv.columns[1:]):
organ_counts['count'][index] = train_csv[column].sum()

plt.figure(figsize=(10, 3))
sns.barplot(data=organ_counts.sort_values(by=['count']), x='Organ',
y='count')
plt.xticks(rotation=90)
plt.title("Distribution of Injury")
plt.xlabel("Injury --->")
plt.ylabel("Count --->")
plt.show()

Thanks to Franklin Shih0617=>RSNA Abdominal Trauma Detect EDA animation for providing
great A n im a t i o n s . Do checkout

file_1 =
['/kaggle/input/rsna-2023-abdominal-trauma-detection/train_images/
10004/21057/' + file
for file in os.listdir('/kaggle/input/rsna-2023-abdominal-
trauma-detection/train_images/10004/21057')]
file_2 =
['/kaggle/input/rsna-2023-abdominal-trauma-detection/train_images/
10004/51033/' + file
for file in os.listdir('/kaggle/input/rsna-2023-abdominal-
trauma-detection/train_images/10004/51033')]
sample_files = file_1 + file_2

sample_vid = [pydicom.dcmread(file).pixel_array for file in

tqdm.tqdm(sample_files , total = len(sample_files))]

fig, ax = plt.subplots()
im = ax.imshow(sample_vid[0], cmap=plt.cm.bone)

update = lambda i : im.set_array(sample_vid[i])

ani = animation.FuncAnimation(fig, update,

frames=range(len(sample_vid)), repeat=True)

HTML(ani.to_jshtml())

100%|██████████| 2066/2066 [00:45<00:00, 45.79it/s]

<IPython.core.display.HTML object>
3 | Preprocessing 🔨
Our data is in Dicom File, but we want them in NPY format, which can take a lot of time, but lets
try for the first 80 inputs

train_list = sorted(os.listdir('/kaggle/input/rsna-2023-abdominal-
trauma-detection/train_images'))[:79]

for folder_1 in tqdm.tqdm(train_list , total = len(train_list)):

folder_1_list = sorted(os.listdir('/kaggle/input/rsna-2023-
abdominal-trauma-detection/train_images/' + folder_1))

os.makedirs('/kaggle/working/Inputs/' + folder_1 + '/')

lis = list()

for folder_2 in folder_1_list:

folder_2_list = sorted(os.listdir('/kaggle/input/rsna-2023-
abdominal-trauma-detection/train_images/' + folder_1 + '/' +
folder_2))

for files in folder_2_list:

file = pydicom.read_file('/kaggle/input/rsna-2023-
abdominal-trauma-detection/train_images/' + folder_1 + '/' + folder_2
+ '/' + files)

arr = file.pixel_array
arr = np.resize(arr , new_shape = (512 , 512))
lis.append(arr)

np.save('/kaggle/working/Inputs/' + folder_1 + '/' + 'file',

np.stack(lis , -1))

100%|██████████| 79/79 [21:17<00:00, 16.17s/it]

4 | Ending ☑️
THAT IT FOR TODAY GUYS

WE WILL GO DEEPER INTO THE DATA IN THE UPCOMING VERSIONS

PLEASE COMMENT YOUR THOUGHTS, HIHGLY APPRICIATED

DONT FORGET TO MAKE AN UPVOTE, IF YOU LIKED MY WORK :)

PEACE OUT !!!! :)

Manual For 3D Nls PDF
100% (4)
Manual For 3D Nls PDF
73 pages
DICOM Processing and Segmentation in Python
No ratings yet
DICOM Processing and Segmentation in Python
18 pages
SAP - ABAP CDS Development User Guide: Warning
No ratings yet
SAP - ABAP CDS Development User Guide: Warning
91 pages
Notebooklien 1
No ratings yet
Notebooklien 1
1 page
Breast Cancer Prdiction
No ratings yet
Breast Cancer Prdiction
16 pages
Cancer 241029 150515
No ratings yet
Cancer 241029 150515
99 pages
Practical 1
No ratings yet
Practical 1
7 pages
222AX066 DSHS Expt-4
No ratings yet
222AX066 DSHS Expt-4
5 pages
Breastcancer
No ratings yet
Breastcancer
13 pages
ML Project - Binary - Colaboratory
No ratings yet
ML Project - Binary - Colaboratory
7 pages
Experiment - 12: Random Forest in Python
No ratings yet
Experiment - 12: Random Forest in Python
3 pages
Script Group8
No ratings yet
Script Group8
19 pages
Diabetes - Prediction - Project - Ipynb - Colab
No ratings yet
Diabetes - Prediction - Project - Ipynb - Colab
11 pages
Vertopal.com Heart Failure Prediction With Detailed Headings
No ratings yet
Vertopal.com Heart Failure Prediction With Detailed Headings
12 pages
Artificial Neural Network (Ann)
No ratings yet
Artificial Neural Network (Ann)
1 page
Image Classification With The MNIST Dataset: Objectives
No ratings yet
Image Classification With The MNIST Dataset: Objectives
21 pages
KNN For Classification
No ratings yet
KNN For Classification
4 pages
Shailesh020902@gmail - Com 1
No ratings yet
Shailesh020902@gmail - Com 1
1 page
3 Par
No ratings yet
3 Par
102 pages
MRI Tumor Detection
No ratings yet
MRI Tumor Detection
82 pages
Support Vector Machines com Python
No ratings yet
Support Vector Machines com Python
13 pages
Assignment Instructions:: Import As
No ratings yet
Assignment Instructions:: Import As
1 page
Shark Tank Deal Prediction - Uudhya - Dec 2019
No ratings yet
Shark Tank Deal Prediction - Uudhya - Dec 2019
16 pages
45B AIML Practical 08
No ratings yet
45B AIML Practical 08
10 pages
Codigo Dicom
No ratings yet
Codigo Dicom
2 pages
Dataset Source Kaggle-1
No ratings yet
Dataset Source Kaggle-1
4 pages
churn_V2
No ratings yet
churn_V2
15 pages
How To Work With Nls Devices: The First Level - Visually
No ratings yet
How To Work With Nls Devices: The First Level - Visually
9 pages
Data Science Code
No ratings yet
Data Science Code
29 pages
Cáncer de Mama Con 8 Componentes Principales - Jupyter Notebook
No ratings yet
Cáncer de Mama Con 8 Componentes Principales - Jupyter Notebook
7 pages
Correlation: Import As Import As Import As Import As From Import From Import Import Matplotlib Import
No ratings yet
Correlation: Import As Import As Import As Import As From Import From Import Import Matplotlib Import
1 page
MNIST Digit Classification Using NN
No ratings yet
MNIST Digit Classification Using NN
16 pages
LungCT-Diagnosis SurvivalData Journal - Pone.0118261.s010
No ratings yet
LungCT-Diagnosis SurvivalData Journal - Pone.0118261.s010
4 pages
DL EXP2.ipynb - Colaboratory
No ratings yet
DL EXP2.ipynb - Colaboratory
6 pages
1FsWES7YJDERHD-bZ2ujFakbQyzi6 Yin
No ratings yet
1FsWES7YJDERHD-bZ2ujFakbQyzi6 Yin
9 pages
AI Medical Diagnosis Week 01
No ratings yet
AI Medical Diagnosis Week 01
5 pages
Clinical Data
No ratings yet
Clinical Data
22 pages
Hcin620 m6 Lab6 Hanifahmutesi-Finalproject
No ratings yet
Hcin620 m6 Lab6 Hanifahmutesi-Finalproject
5 pages
Arturo
No ratings yet
Arturo
7 pages
45 AIML Practical 09
No ratings yet
45 AIML Practical 09
6 pages
IndahAgustienML 7 1-CNN
No ratings yet
IndahAgustienML 7 1-CNN
11 pages
My Code
No ratings yet
My Code
7 pages
Entropy: Symbolic Entropy Analysis and Its Applications
No ratings yet
Entropy: Symbolic Entropy Analysis and Its Applications
4 pages
Implement SOFM For Character Recognition - Watermark
No ratings yet
Implement SOFM For Character Recognition - Watermark
9 pages
Pima Indian Diabetes Prediction
No ratings yet
Pima Indian Diabetes Prediction
22 pages
Bio-Signal Analysis For Smoking
No ratings yet
Bio-Signal Analysis For Smoking
1 page
CT_LIVER_FINAL_PPT
No ratings yet
CT_LIVER_FINAL_PPT
30 pages
LP Practical ! Jupyter Notebook
No ratings yet
LP Practical ! Jupyter Notebook
6 pages
AIML Report.
No ratings yet
AIML Report.
12 pages
Yes/No Yes/No Yes/No Yes/No Yes/No Yes/No Yes Yes Yes Yes Yes Yes
No ratings yet
Yes/No Yes/No Yes/No Yes/No Yes/No Yes/No Yes Yes Yes Yes Yes Yes
162 pages
BRAIN TUMOR DETECTION
No ratings yet
BRAIN TUMOR DETECTION
23 pages
A2 - Jupyter Notebook PDF
No ratings yet
A2 - Jupyter Notebook PDF
8 pages
RX
No ratings yet
RX
2 pages
Exp 4
No ratings yet
Exp 4
21 pages
KNN - Jupyter Notebook (1)
No ratings yet
KNN - Jupyter Notebook (1)
7 pages
Diabetis Project
No ratings yet
Diabetis Project
7 pages
Dovdush_KN-305_lab3
No ratings yet
Dovdush_KN-305_lab3
2 pages
# Import Plotting Libraries: in (1) : Import Pandas As PD
No ratings yet
# Import Plotting Libraries: in (1) : Import Pandas As PD
13 pages
Project 3 - Diabetes Prediction.ipynb - Colab
No ratings yet
Project 3 - Diabetes Prediction.ipynb - Colab
4 pages
breast cancer-1
No ratings yet
breast cancer-1
16 pages
Anatomy and Physiology for Nurses: Essential Principles
From Everand
Anatomy and Physiology for Nurses: Essential Principles
Sterling Education
No ratings yet
YOLO You Only Look Once For Object
No ratings yet
YOLO You Only Look Once For Object
1 page
Anime Gan
No ratings yet
Anime Gan
1 page
Understanding The Transformer Archi
No ratings yet
Understanding The Transformer Archi
2 pages
Sample 5
No ratings yet
Sample 5
105 pages
Ayush Singhal Resume
No ratings yet
Ayush Singhal Resume
2 pages
Understanding The Competition Commonlit
No ratings yet
Understanding The Competition Commonlit
37 pages
jl168
No ratings yet
jl168
12 pages
Hard Disk Drive Destruction Devices: NSA/CSS Evaluated Products List For
No ratings yet
Hard Disk Drive Destruction Devices: NSA/CSS Evaluated Products List For
4 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
S2 - Introduction To ECPS
No ratings yet
S2 - Introduction To ECPS
52 pages
Chapter Address - RE Kolkata
No ratings yet
Chapter Address - RE Kolkata
1 page
Pendon Group
No ratings yet
Pendon Group
19 pages
Joint Khmer Word Segmentation and Part-Of-Speech T
No ratings yet
Joint Khmer Word Segmentation and Part-Of-Speech T
12 pages
Get Our Special Grand Bundle PDF Course For All Upcoming Bank Exams
No ratings yet
Get Our Special Grand Bundle PDF Course For All Upcoming Bank Exams
13 pages
Manual Eng H-Radio 4.16
No ratings yet
Manual Eng H-Radio 4.16
24 pages
150a Single Phase Series Multiple Dual Voltage Switches Catalog Ca800007en
No ratings yet
150a Single Phase Series Multiple Dual Voltage Switches Catalog Ca800007en
8 pages
Etap
No ratings yet
Etap
512 pages
design programing logic final exam
No ratings yet
design programing logic final exam
8 pages
Experiment 4 WDM Fiber Optic Link: Aim Components Used
No ratings yet
Experiment 4 WDM Fiber Optic Link: Aim Components Used
3 pages
A Novel Pipeline Leak Detection Approach Independent of Prior Failure Information
No ratings yet
A Novel Pipeline Leak Detection Approach Independent of Prior Failure Information
12 pages
RP PLC PendingTask01062024
No ratings yet
RP PLC PendingTask01062024
1,273 pages
ESD TR3.0-01-02 (13-02) Alternate Techniques for Measuring Ionizer
No ratings yet
ESD TR3.0-01-02 (13-02) Alternate Techniques for Measuring Ionizer
13 pages
GT100 Turbojet Datasheet
No ratings yet
GT100 Turbojet Datasheet
3 pages
Dirichlet Theorem
No ratings yet
Dirichlet Theorem
13 pages
Java 11 Web Applications and Java Ee
No ratings yet
Java 11 Web Applications and Java Ee
212 pages
CD40106BM/CD40106BC Hex Schmitt Trigger: General Description Features
No ratings yet
CD40106BM/CD40106BC Hex Schmitt Trigger: General Description Features
8 pages
Huawei Nova 17i
No ratings yet
Huawei Nova 17i
3 pages
HTML
No ratings yet
HTML
94 pages
Week-04Assignment MCQ
No ratings yet
Week-04Assignment MCQ
5 pages
Automated Essay Writing An AIED Opinion
No ratings yet
Automated Essay Writing An AIED Opinion
8 pages
SAP FICO Supporter Handbook
100% (1)
SAP FICO Supporter Handbook
35 pages
Fast Steering Mirror Control Using Embedded Self Learning Fuzzy Controller
No ratings yet
Fast Steering Mirror Control Using Embedded Self Learning Fuzzy Controller
14 pages
Lund University EIEN50 - Automation Simulation 1
No ratings yet
Lund University EIEN50 - Automation Simulation 1
18 pages
Mindmanager End User License Agreement
No ratings yet
Mindmanager End User License Agreement
11 pages
How To Use Vivo Rewards: A Student Guide
No ratings yet
How To Use Vivo Rewards: A Student Guide
12 pages

Understanding The Competition Rsna

Uploaded by

Understanding The Competition Rsna

Uploaded by

RSNA : RadioLogical Society of North America

The R S N A 2023 A b d o mi n a l T r a u m a D e t e c t io n competition is a machine learning

Thanks to Theo Viel for providing small chunks of data to experiment

Our main data is situated in a CSV file −− −> ¿ train.csv

patient_id bowel_healthy bowel_injury extravasation_healthy \

extravasation_injury kidney_healthy kidney_low kidney_high \

• P a t i e n t I D patient_id - THis column contains the ids of 3 , 147 patients.

T r a i n I m a g e s is a big 3 l e v e l nested - directory, contianing images for the specific patient .

We can read the DCIM file through pydicom

import matplotlib.pyplot as plt

Thanks to Yuanjian Li=>EDA with Animation BeginnerFriendly/Jocelyn Dumlao=>Unleashing

organ_columns = ['bowel', 'extravasation', 'kidney', 'liver',

sample_vid = [pydicom.dcmread(file).pixel_array for file in

update = lambda i : im.set_array(sample_vid[i])

ani = animation.FuncAnimation(fig, update,

100%|██████████| 2066/2066 [00:45<00:00, 45.79it/s]

for folder_1 in tqdm.tqdm(train_list , total = len(train_list)):

os.makedirs('/kaggle/working/Inputs/' + folder_1 + '/')

for folder_2 in folder_1_list:

for files in folder_2_list:

np.save('/kaggle/working/Inputs/' + folder_1 + '/' + 'file',

100%|██████████| 79/79 [21:17<00:00, 16.17s/it]

WE WILL GO DEEPER INTO THE DATA IN THE UPCOMING VERSIONS

PLEASE COMMENT YOUR THOUGHTS, HIHGLY APPRICIATED

DONT FORGET TO MAKE AN UPVOTE, IF YOU LIKED MY WORK :)

You might also like