Biostatistica: Catalin Negoita C.negoita@ugal
Biostatistica: Catalin Negoita C.negoita@ugal
Catalin Negoita
[email protected]
Prezenta obligatorie
E-mail: Subiect/Continut/Semnatura
Prevalenta
• Amplitudinea
Rata
• Varianța • Cvartile
Incidenta • Deviația standard Dispersie Localizare
• Percentile
• Coeficientul de variație
Riscul
• Eroarea standard
• Asimetria
Simetrie
• Boltirea
Variabile cantitative :
Măsusi de centralitate
Măsuri de dispersie/împrăştiere
Măsuri de simetrie
Măsuri de localizare
DATASET
*.csv \n new line (LF)
\r carriage return (CR)
\t tab character
*.xls / *.xlsx \0 null character
\xddd special character with code ddd
tab/space/cr/ls/
https://en.wikipedia.org/wiki/Newline
https://archive.ics.uci.edu/ml/datasets/Heart+Disease
http://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/
HEART DISEASE DATA SET
Source
Data Set Information
Attribute Information
Data Folder - ungarian.data
Data Set Description heart-disease.names
https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/heart-disease.names
https://archive.ics.uci.edu/ml/machine-learning-databases/heart-disease/hungarian.data
Notepad++
1
1254 0 40 1 1 0 0-9 2 140 0 289 -9 -9 -90 -9 -9 0 12 16 84 00 0 0 0 150 18 -9 7172 86 200 110 140 86 0 00 -9 26 20 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1220 84 0 -9 -9 -9 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name1255 0 4
1254 0 40 1 1 0 0-9 2 140 0 289 -9 -9 -90 -9 -9 0 12 16 84 00 0 0 0 150 18 -9 7172 86 200 110 140 86 0 00 -9 26 20 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1220 84 0 -9 -9 -9 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name
1255 0 49 0 1 0 0-9 3 160 1 180 -9 -9 -90 -9 -9 0 11 16 84 00 0 0 0 -9 10 9 7156 100 220 106 160 90 0 01 2 14 13 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1120 84 1 -9 -9 2 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name
1256 0 37 1 1 0 0-9 2 130 0 283 -9 -9 -90 -9 -9 1 11 21 84 00 0 0 0 100 10 -9 598 58 180 100 130 80 0 00 -9 17 14 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 1126 84 0 -9 -9 -9 -9 -9-9 -9 -9 -9 -9 1 1 11 1 -9. -9. name
1257 0 48 0 1 1 1-9 4 138 0 214 -9 -9 -90 -9 -9 0 9 21 84 00 0 0 0 50 5 4 4108 54 210 106 138 86 1 01.5 2 19 22 -9 -9 -9 -9-9 -9 -9 -9 -9 -9 -9 930 84 3 -9 2 -9 -9 2-9 -9 -9 2 -9 1 1 11 1 -9. -9. name
3 File -> Save As
4 Excel -> Open File
4 Excel -> Open File
5 Convert text to columns
6 File -> Save As
Variabile calitative
Total
250
200
Total
150
100
50
0
0 1 (blank)
HISTOGRAMA
Indian Liver Patient Dataset (ILPD).csv
Insert column Age(A) into a new sheet
Min value: =MIN(A:A)
Max value: =MAX(A:A)
Bin range: =min+10 (…)
HISTOGRAMA
Indian Liver Patient Dataset (ILPD).csv
Insert first new line
Insert->Chart->3 D column
100
80
60
40 Female
20 (blank)
0
)
nk 13 3
la 4- 4-2 4-33 -43 53 3 Female
(b - 3
1 2 34 44 54-6 4-7 -83 -93
6 7 4 84