0% found this document useful (0 votes)
74 views

Biostatistica: Catalin Negoita C.negoita@ugal

The document provides information about biostatistics including descriptive statistics for quantitative and qualitative variables, data formats, and links to datasets including a heart disease dataset and Indian liver patient dataset. It discusses measures of central tendency, dispersion, symmetry, and localization for quantitative variables and proportions, rates, and percentages for qualitative variables. Formats for CSV and Excel files are also listed along with notes on using the datasets in Excel.

Uploaded by

Suruianu Florina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
74 views

Biostatistica: Catalin Negoita C.negoita@ugal

The document provides information about biostatistics including descriptive statistics for quantitative and qualitative variables, data formats, and links to datasets including a heart disease dataset and Indian liver patient dataset. It discusses measures of central tendency, dispersion, symmetry, and localization for quantitative variables and proportions, rates, and percentages for qualitative variables. Formats for CSV and Excel files are also listed along with notes on using the datasets in Excel.

Uploaded by

Suruianu Florina
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

BIOSTATISTICA

Catalin Negoita
[email protected]
 Prezenta obligatorie

 E-mail: Subiect/Continut/Semnatura

 Fisier atasat: nume_prenume_grupa_specializarea_an-studiu_subiect


STATISTICA DESCRIPTIVA
 Variabile calitative: • Media aritmetică
• Mediana
 Proportia Centralitate
• Modulul
 Raportul • Valoarea centrală

 Prevalenta
• Amplitudinea
 Rata
• Varianța • Cvartile
 Incidenta • Deviația standard Dispersie Localizare
• Percentile
• Coeficientul de variație
 Riscul
• Eroarea standard

• Asimetria
Simetrie
• Boltirea
 Variabile cantitative :
 Măsusi de centralitate
 Măsuri de dispersie/împrăştiere
 Măsuri de simetrie
 Măsuri de localizare
DATASET
 *.csv \n new line (LF)
\r carriage return (CR)
\t tab character
 *.xls / *.xlsx \0 null character
\xddd special character with code ddd
 tab/space/cr/ls/

https://en.wikipedia.org/wiki/Newline

 https://archive.ics.uci.edu/ml/datasets/Heart+Disease
 http://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
HEART DISEASE DATA SET
 Source
 Data Set Information

 Attribute Information
 Data Folder - ungarian.data
 Data Set Description heart-disease.names

attribute_1 attribute_2 attribute_3 attribute_4 attribute_5 attribute_6 1254 0 40 1 1 0 0 -9 2 140 0 289 -9 -9 -9 0 -9 -9 0


value_1 value_1 value_1 value_1 value_1 value_1 12 16 84 0 0 0 0 0 150 18 -9 7 172 86 200 110 140
value_2 value_2 value_2 value_2 value_2 value_2 86 0 0 0 -9 26 20 -9 -9 -9 -9 -9 -9 -9 -9 -9 -9 -9 12
value_3 value_3 value_3 value_3 value_3 value_3 20 84 0 -9 -9 -9 -9 -9 -9 -9 -9 -9 -9 1 1 1 1 1 -9. -9.
value_4 value_4 value_4 value_4 value_4 value_4 name 1255 0 49 0 1 0 0 -9 3 160 1 180 -9 -9 -9 0 -9
value_5 value_5 value_5 value_5 value_5 value_5 -9 0 11 16 84 0 0 0 0 0 -9 10 9 7 156 100 220 106
value_6 value_6 value_6 value_6 value_6 value_6 160 90 0 0 1 2 14 13 -9 -9 -9 -9 -9 -9 -9 -9 -9 -9 -9
value_7 value_7 value_7 value_7 value_7 value_7 11 20 84 1 -9 -9 2 -9 -9 -9 -9 -9 -9 -9 1 1 1 1 1 -9.
value_8 value_8 value_8 value_8 value_8 value_8 -9. name
value_9 value_9 value_9 value_9 value_9 value_9
value_10 value_10 value_10 value_10 value_10 value_10
value_11 value_11 value_11 value_11 value_11 value_11
value_12 value_12 value_12 value_12 value_12 value_12

https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/heart-disease.names
https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/hungarian.data
Notepad++
1

1254 0 40 1 1 0 0-9 2 140 0 289 -9 -9 -90 -9 -9 0 12 16 84 00 0 0 0 150 18 -9 7172 86 200 110 140 86 0 00 -9 26 20 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1220 84 0 -9 -9 -9 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name1255 0 4

1254 0 40 1 1 0 0-9 2 140 0 289 -9 -9 -90 -9 -9 0 12 16 84 00 0 0 0 150 18 -9 7172 86 200 110 140 86 0 00 -9 26 20 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1220 84 0 -9 -9 -9 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name
1255 0 49 0 1 0 0-9 3 160 1 180 -9 -9 -90 -9 -9 0 11 16 84 00 0 0 0 -9 10 9 7156 100 220 106 160 90 0 01 2 14 13 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1120 84 1 -9 -9 2 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name
1256 0 37 1 1 0 0-9 2 130 0 283 -9 -9 -90 -9 -9 1 11 21 84 00 0 0 0 100 10 -9 598 58 180 100 130 80 0 00 -9 17 14 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1126 84 0 -9 -9 -9 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name
1257 0 48 0 1 1 1-9 4 138 0 214 -9 -9 -90 -9 -9 0 9 21 84 00 0 0 0 50 5 4 4108 54 210 106 138 86 1 01.5 2 19 22 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 930 84 3 -9 2 -9 -9 2-9 -9 -9 2 -9 1 1 11 1 -9. -9. name
3 File -> Save As
4 Excel -> Open File
4 Excel -> Open File
5 Convert text to columns
6 File -> Save As

Save As type: Excel Workbook(*.xlsx)


TASK
 Introduceti numele atributelor din fisierul heart-disease.names in
tabelul generat in excel pe primul rand.

Variabile calitative

Raportul x/y Grupa 30 / 10 Barbati


Proportia x/x+y
1 Select Column Gender -> Copy / Paste -> New Worksheet

2 Insert –> New Row(1) -> Gender

3 Select Column A -> Insert -> Pivot Table


4 Add Gender to Row Labels / Values

5 Insert Column Chart

Total

250

200
Total
150

100

50

0
0 1 (blank)
HISTOGRAMA
 Indian Liver Patient Dataset (ILPD).csv
 Insert column Age(A) into a new sheet
 Min value: =MIN(A:A)
 Max value: =MAX(A:A)
 Bin range: =min+10 (…)
HISTOGRAMA
 Indian Liver Patient Dataset (ILPD).csv
 Insert first new line

 Insert ->pivot table

 Insert->Chart->3 D column

100
80
60
40 Female
20 (blank)
0
)
nk 13 3
la 4- 4-2 4-33 -43 53 3 Female
(b - 3
1 2 34 44 54-6 4-7 -83 -93
6 7 4 84

You might also like