EDA report

Mode: Compare

comparison

Overview

Snapshot
Dataset
left
Rows500
Columns24
Duplicates0
Dataset
right
Rows809
Columns24
Duplicates0

Dataset Summary

Composition

left composition

Rows 500
Columns 24
Duplicate rows 0
Memory 121.8K
Type counts
Numeric 8
Categorical 6
Text 4
Datetime 2
Boolean 1

right composition

Rows 809
Columns 24
Duplicate rows 0
Memory 180.7K
Type counts
Numeric 8
Categorical 7
Text 3
Datetime 2
Boolean 1

Variables

Profiles

pclass

Compare column

numeric
left
Values500
Missing0 (0.0000)
Distinct3
Zeroes0
Stats
Min1
Max3
Mean1.9620
Median2
Std0.8721
Variance0.7606
IQR2
Outlier rate0
Quantiles
0.25001
0.50002
0.75003
Most frequent values
1200
3181
2119
Smallest values
1200
2119
3181
Largest values
3181
2119
1200
right
Values809
Missing0 (0.0000)
Distinct3
Zeroes0
Stats
Min1
Max3
Mean2.5006
Median3
Std0.7444
Variance0.5541
IQR1
Outlier rate0
Quantiles
0.25002
0.50003
0.75003
Most frequent values
3528
2158
1123
Smallest values
1123
2158
3528
Largest values
3528
2158
1123
Histogram

survived

Compare column

boolean
left
Values500
Missing0 (0.0000)
Distinct1
Zeroes0
Top categories
True500
right
Values809
Missing0 (0.0000)
Distinct1
Zeroes0
Top categories
False809
Histogram

name

Compare column

text
left
Values500
Missing0 (0.0000)
Distinct500
Zeroes0
Most frequent values
Abbott, Mrs. Stanton (Rosa Hunt)
Count: 1
Abelseth, Miss. Karen Marie
Count: 1
Abelseth, Mr. Olaus Jorgensen
Count: 1
Abelson, Mrs. Samuel (Hannah Wizosky)
Count: 1
Abrahamsson, Mr. Abraham August Johannes
Count: 1
Abrahim, Mrs. Joseph (Sophie Halaut Easu)
Count: 1
Aks, Master. Philip Frank
Count: 1
Aks, Mrs. Sam (Leah Rosen)
Count: 1
Albimona, Mr. Nassef Cassem
Count: 1
Allen, Miss. Elisabeth Walton
Count: 1
Text length stats
Mean31.392
Median29
Min12
Max82
right
Values809
Missing0 (0.0000)
Distinct808
Zeroes0
Most frequent values
Kelly, Mr. James
Count: 2
Abbing, Mr. Anthony
Count: 1
Abbott, Master. Eugene Joseph
Count: 1
Abbott, Mr. Rossmore Edward
Count: 1
Abelson, Mr. Samuel
Count: 1
Adahl, Mr. Mauritz Nils Martin
Count: 1
Adams, Mr. John
Count: 1
Ahlin, Mrs. Johan (Johanna Persdotter Larsson)
Count: 1
Aldworth, Mr. Charles Augustus
Count: 1
Alexander, Mr. William
Count: 1
Text length stats
Mean24.497
Median24
Min12
Max57
Histogram

sex

Compare column

categorical
left
Values500
Missing0 (0.0000)
Distinct2
Zeroes0
Top categories
female339
male161
right
Values809
Missing0 (0.0000)
Distinct2
Zeroes0
Top categories
male682
female127
Histogram

age

Compare column

numeric
left
Values427
Missing73 (0.1460)
Distinct71
Zeroes0
Stats
Min0.1700
Max80.000
Mean28.918
Median28.000
Std15.044
Variance226.32
IQR18.000
Outlier rate0.0047
Quantiles
0.250020.000
0.500028.000
0.750038.000
Most frequent values
24.00022
22.00020
30.00015
18.00014
36.00014
45.00014
27.00013
29.00013
35.00013
31.00012
Smallest values
0.17001
0.42001
0.67001
0.75002
0.83003
Largest values
80.0001
76.0001
64.0002
63.0002
62.0002
right
Values619
Missing190 (0.2349)
Distinct90
Zeroes0
Stats
Min0.3300
Max74.000
Mean30.545
Median28.000
Std13.911
Variance193.52
IQR18.000
Outlier rate0.0113
Quantiles
0.250021.000
0.500028.000
0.750039.000
Most frequent values
21.00030
18.00025
24.00025
30.00025
28.00024
22.00023
25.00023
26.00019
19.00018
27.00017
Smallest values
0.33001
0.75001
1.00003
2.00008
3.00002
Largest values
74.0001
71.0002
70.5001
70.0002
67.0001
Histogram

sibsp

Compare column

numeric
left
Values500
Missing0 (0.0000)
Distinct5
Zeroes309
Stats
Min0
Max4
Mean0.4620
Median0
Std0.6845
Variance0.4686
IQR1
Outlier rate0.0180
Quantiles
0.25000
0.50000
0.75001
Most frequent values
0309
1163
219
36
43
Smallest values
0309
1163
219
36
43
Largest values
43
36
219
1163
0309
right
Values809
Missing0 (0.0000)
Distinct7
Zeroes582
Stats
Min0
Max8
Mean0.5216
Median0
Std1.2097
Variance1.4634
IQR1
Outlier rate0.0593
Quantiles
0.25000
0.50000
0.75001
Most frequent values
0582
1156
223
419
314
89
56
Smallest values
0582
1156
223
314
419
Largest values
89
56
419
314
223
Histogram

parch

Compare column

numeric
left
Values500
Missing0 (0.0000)
Distinct6
Zeroes336
Stats
Min0
Max5
Mean0.4760
Median0
Std0.7755
Variance0.6014
IQR1
Outlier rate0.0140
Quantiles
0.25000
0.50000
0.75001
Most frequent values
0336
1100
257
35
41
51
Smallest values
0336
1100
257
35
41
Largest values
51
41
35
257
1100
right
Values809
Missing0 (0.0000)
Distinct8
Zeroes666
Stats
Min0
Max9
Mean0.3288
Median0
Std0.9118
Variance0.8313
IQR0
Outlier rate0
Quantiles
0.25000
0.50000
0.75000
Most frequent values
0666
170
256
45
55
33
62
92
Smallest values
0666
170
256
33
45
Largest values
92
62
55
45
33
Histogram

ticket

Compare column

text
left
Values500
Missing0 (0.0000)
Distinct347
Zeroes0
Most frequent values
1601
Count: 6
PC 17608
Count: 6
16966
Count: 5
113760
Count: 4
19950
Count: 4
230136
Count: 4
24160
Count: 4
2666
Count: 4
PC 17755
Count: 4
110152
Count: 3
Text length stats
Mean6.6360
Median6
Min4
Max18
right
Values809
Missing0 (0.0000)
Distinct668
Zeroes0
Most frequent values
CA. 2343
Count: 11
CA 2144
Count: 8
3101295
Count: 7
347082
Count: 7
S.O.C. 14879
Count: 7
347088
Count: 6
382652
Count: 6
349909
Count: 5
4133
Count: 5
W./C. 6608
Count: 5
Text length stats
Mean6.8863
Median6
Min3
Max18
Histogram

fare

Compare column

numeric
left
Values500
Missing0 (0.0000)
Distinct183
Zeroes2
Stats
Min0.0000
Max512.33
Mean49.361
Median26.000
Std68.580
Variance4703.2
IQR46.617
Outlier rate0.0940
Quantiles
0.250011.133
0.500026.000
0.750057.750
Most frequent values
26.00019
7.750017
13.00017
10.50012
26.55010
7.92509
8.05009
7.22927
7.77507
23.0007
Smallest values
0.00002
3.17081
6.95001
6.97501
7.00001
Largest values
512.334
263.004
262.386
247.522
227.533
right
Values808
Missing1 (0.0012)
Distinct210
Zeroes15
Stats
Min0.0000
Max263.00
Mean23.354
Median10.500
Std34.124
Variance1164.4
IQR18.146
Outlier rate0.0879
Quantiles
0.25007.8542
0.500010.500
0.750026.000
Most frequent values
8.050051
7.895848
13.00042
7.750038
26.00031
10.50023
8.662520
7.775019
7.229217
7.250017
Smallest values
0.000015
4.01251
5.00001
6.23751
6.43753
Largest values
263.002
262.381
247.521
227.532
221.783
Histogram

cabin

Compare column

left: text
right: categorical
left
Values193
Missing307 (0.6140)
Distinct130
Zeroes0
Most frequent values
B57 B59 B63 B66
Count: 4
B96 B98
Count: 4
C23 C25 C27
Count: 4
F33
Count: 4
F4
Count: 4
A34
Count: 3
C101
Count: 3
C78
Count: 3
E101
Count: 3
E34
Count: 3
Text length stats
Mean3.7772
Median3
Min1
Max15
right
Values102
Missing707 (0.8739)
Distinct91
Zeroes0
Top categories
C22 C263
C1242
C23 C25 C272
C55 C572
C62
D2
D262
E462
F G732
F22

embarked

Compare column

categorical
left
Values498
Missing2 (0.0040)
Distinct4
Zeroes0
Top categories
S304
C150
Q44
right
Values809
Missing0 (0.0000)
Distinct3
Zeroes0
Top categories
S610
C120
Q79
Histogram

boat

Compare column

categorical
left
Values477
Missing23 (0.0460)
Distinct28
Zeroes0
Top categories
1339
1537
C37
1432
431
1029
527
326
1125
925
right
Values9
Missing800 (0.9889)
Distinct7
Zeroes0
Top categories
A4
121
141
B1
C1
D1
Histogram

body

Compare column

numeric
left
Values0
Missing500 (1.0000)
Distinct1
Zeroes0
Stats
Min0
Max0
Mean0
Median0
Std0
Variance0
IQR0
Outlier rate0
Quantiles
Most frequent values
Smallest values
Largest values
right
Values121
Missing688 (0.8504)
Distinct122
Zeroes0
Stats
Min1
Max328
Mean160.81
Median155
Std97.292
Variance9465.8
IQR184
Outlier rate0
Quantiles
0.250072
0.5000155
0.7500256
Most frequent values
11
41
71
91
141
151
161
171
181
191
Smallest values
11
41
71
91
141
Largest values
3281
3271
3221
3141
3121
Histogram

home.dest

Compare column

text
left
Values347
Missing153 (0.3060)
Distinct190
Zeroes0
Most frequent values
New York, NY
Count: 40
Cornwall / Akron, OH
Count: 8
Paris, France
Count: 8
Brooklyn, NY
Count: 5
London
Count: 5
Bryn Mawr, PA
Count: 4
Guntur, India / Benton Harbour, MI
Count: 4
Haverford, PA / Cooperstown, NY
Count: 4
Montreal, PQ
Count: 4
St Louis, MO
Count: 4
Text length stats
Mean18.934
Median16
Min5
Max50
right
Values398
Missing411 (0.5080)
Distinct260
Zeroes0
Most frequent values
New York, NY
Count: 24
London
Count: 9
Wiltshire, England Niagara Falls, NY
Count: 8
Belfast
Count: 7
Sweden Winnipeg, MN
Count: 7
Montreal, PQ
Count: 6
Bulgaria Chicago, IL
Count: 5
Philadelphia, PA
Count: 5
Rotherfield, Sussex, England Essex Co, MA
Count: 5
Austria
Count: 4
Text length stats
Mean19.367
Median18.500
Min5
Max49
Histogram

noon_time

Compare column

datetime
left
Values500
Missing0 (0.0000)
Distinct1
Zeroes0
Datetime range
Min12:00:00
Max12:00:00
Most frequent values
12:00:00500
right
Values809
Missing0 (0.0000)
Distinct1
Zeroes0
Datetime range
Min12:00:00
Max12:00:00
Most frequent values
12:00:00809
Histogram

always_null

Compare column

categorical
left
Values0
Missing500 (1.0000)
Distinct1
Zeroes0
right
Values0
Missing809 (1.0000)
Distinct1
Zeroes0

birthdate

Compare column

datetime
left
Values427
Missing73 (0.1460)
Distinct71
Zeroes0
Datetime range
Min1832-04-13
Max1912-02-11
Most frequent values
1888-04-1322
1890-04-1320
1882-04-1315
1867-04-1314
1876-04-1314
1894-04-1314
1877-04-1313
1883-04-1313
1885-04-1313
1881-04-1312
right
Values619
Missing190 (0.2349)
Distinct90
Zeroes0
Datetime range
Min1838-04-13
Max1911-12-15
Most frequent values
1891-04-1330
1882-04-1325
1888-04-1325
1894-04-1325
1884-04-1324
1887-04-1323
1890-04-1323
1886-04-1319
1893-04-1318
1876-04-1317
Histogram

age_duration

Compare column

numeric
left
Values500
Missing0 (0.0000)
Distinct63
Zeroes83
Stats
Min0.0000
Max6912.0
Mean2132.4
Median2116.8
Std1492.0
Variance2226003
IQR2073.6
Outlier rate0.0040
Quantiles
0.25001036.8
0.50002116.8
0.75003110.4
Most frequent values
0.000083
2073.622
1900.820
2592.015
3110.415
1555.214
3888.014
2332.813
2505.613
3024.013
Smallest values
0.000083
86.4007
172.804
259.205
345.607
Largest values
6912.01
6566.41
5529.62
5443.22
5356.82
right
Values809
Missing0 (0.0000)
Distinct69
Zeroes192
Stats
Min0.0000
Max6393.6
Mean2017.5
Median2073.6
Std1534.6
Variance2354944
IQR2851.2
Outlier rate0.0000
Quantiles
0.2500172.80
0.50002073.6
0.75003024.0
Most frequent values
0.0000192
1814.430
1555.228
2419.227
2592.027
2073.626
1900.824
2160.023
2246.420
1641.618
Smallest values
0.0000192
86.4003
172.808
259.202
345.603
Largest values
6393.61
6134.42
6048.03
5788.81
5702.41
Histogram

name_binary

Compare column

numeric
left
Values500
Missing0 (0.0000)
Distinct500
Zeroes0
Stats
Min12
Max82
Mean31.392
Median29
Std11.130
Variance123.88
IQR16
Outlier rate0.0060
Quantiles
0.250023
0.500029
0.750039
Most frequent values
2527
2425
2825
2724
2923
2622
2019
3019
3119
1918
Smallest values
121
132
141
153
168
Largest values
821
671
651
632
621
right
Values809
Missing0 (0.0000)
Distinct808
Zeroes0
Stats
Min12
Max57
Mean24.497
Median24
Std7.1688
Variance51.392
IQR9
Outlier rate0.0358
Quantiles
0.250019
0.500024
0.750028
Most frequent values
1964
1861
2556
1751
2651
2047
2746
2444
2140
2240
Smallest values
121
132
143
1520
1631
Largest values
571
561
551
541
531
Histogram

sex_enum

Compare column

categorical
left
Values500
Missing0 (0.0000)
Distinct2
Zeroes0
Top categories
f339
m161
right
Values809
Missing0 (0.0000)
Distinct2
Zeroes0
Top categories
m682
f127
Histogram

embarked_category

Compare column

categorical
left
Values498
Missing2 (0.0040)
Distinct4
Zeroes0
Top categories
S304
C150
Q44
right
Values809
Missing0 (0.0000)
Distinct3
Zeroes0
Top categories
S610
C120
Q79
Histogram

family_struct

Compare column

struct
left
Values500
Missing0 (0.0000)
Distinct267
Zeroes0
Sample values
  • {"siblings_spouses": 0, "parents_children": 0, "fare": 211.3375}
  • {"siblings_spouses": 1, "parents_children": 2, "fare": 151.55}
  • {"siblings_spouses": 0, "parents_children": 0, "fare": 26.55}
  • {"siblings_spouses": 1, "parents_children": 0, "fare": 77.9583}
  • {"siblings_spouses": 2, "parents_children": 0, "fare": 51.4792}
right
Values809
Missing0 (0.0000)
Distinct282
Zeroes0
Sample values
  • {"siblings_spouses": 1, "parents_children": 2, "fare": 151.55}
  • {"siblings_spouses": 1, "parents_children": 2, "fare": 151.55}
  • {"siblings_spouses": 1, "parents_children": 2, "fare": 151.55}
  • {"siblings_spouses": 0, "parents_children": 0, "fare": 0.0}
  • {"siblings_spouses": 0, "parents_children": 0, "fare": 49.5042}

voyage_notes

Compare column

list
left
Values500
Missing0 (0.0000)
Distinct18
Zeroes0
List length
Min3
Max3
Mean3.0000
Median3.0000
Sample values
  • [0, 0, 1]
  • [1, 2, 1]
  • [0, 0, 1]
  • [1, 0, 1]
  • [2, 0, 1]
Length histogram
right
Values809
Missing0 (0.0000)
Distinct23
Zeroes0
List length
Min3
Max3
Mean3.0000
Median3.0000
Sample values
  • [1, 2, 0]
  • [1, 2, 0]
  • [1, 2, 0]
  • [0, 0, 0]
  • [0, 0, 0]
Length histogram

voyage_array

Compare column

list
left
Values500
Missing0 (0.0000)
Distinct18
Zeroes0
List length
Min3
Max3
Mean3.0000
Median3.0000
Sample values
  • [0, 0, 1]
  • [1, 2, 1]
  • [0, 0, 1]
  • [1, 0, 1]
  • [2, 0, 1]
Length histogram
right
Values809
Missing0 (0.0000)
Distinct23
Zeroes0
List length
Min3
Max3
Mean3.0000
Median3.0000
Sample values
  • [1, 2, 0]
  • [1, 2, 0]
  • [1, 2, 0]
  • [0, 0, 0]
  • [0, 0, 0]
Length histogram