Overview

Dataset Statistics

Number of Variables 21
Number of Rows 502
Missing Cells 5847
Missing Cells (%) 55.5%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 215.1 KB
Average Row Size in Memory 438.7 B
Variable Types
  • Numerical: 14
  • Categorical: 7

Dataset Insights

Aetna NC Preferred and Cigna have similar distributions Similar Distribution
Aetna NC Preferred and Coventry - First Health have similar distributions Similar Distribution
Aetna NC Preferred and Multiplan have similar distributions Similar Distribution
Cigna HMO and UHC NHRMC Employee Plan have similar distributions Similar Distribution
Cigna and Coventry - First Health have similar distributions Similar Distribution
Cigna and Multiplan have similar distributions Similar Distribution
Cigna and Medcost have similar distributions Similar Distribution
Coventry - First Health and Multiplan have similar distributions Similar Distribution
Multiplan and Medcost have similar distributions Similar Distribution
Aetna has 357 (71.12%) missing values Missing
Aetna NC Preferred has 488 (97.21%) missing values Missing
Atlantic Packaging has 495 (98.61%) missing values Missing
BCBS has 27 (5.38%) missing values Missing
BCBS Blue Value has 286 (56.97%) missing values Missing
Cigna HMO has 314 (62.55%) missing values Missing
Cigna has 450 (89.64%) missing values Missing
Coventry - First Health has 490 (97.61%) missing values Missing
Humana has 494 (98.41%) missing values Missing
Multiplan has 457 (91.04%) missing values Missing
Medcost has 440 (87.65%) missing values Missing
Medcost Southeast has 498 (99.2%) missing values Missing
Medcost Columbus has 494 (98.41%) missing values Missing
UHC has 217 (43.23%) missing values Missing
UHC NHRMC Employee Plan has 340 (67.73%) missing values Missing
Aetna is skewed Skewed
BCBS is skewed Skewed
BCBS Blue Value is skewed Skewed
Cigna HMO is skewed Skewed
Cigna is skewed Skewed
Coventry - First Health is skewed Skewed
Multiplan is skewed Skewed
Medcost is skewed Skewed
UHC is skewed Skewed
UHC NHRMC Employee Plan is skewed Skewed
De-Identified Minimum Negotiated Charge is skewed Skewed
De-Identified Maximum Negotiated Charge is skewed Skewed
Description has a high cardinality: 499 distinct values High Cardinality
Filename has constant value "340141-new-hanover-regional-medical-center-standard-charges.csv" Constant
Patient Type has constant value "Inpatient" Constant
Filename has constant length 63 Constant Length
Patient Type has constant length 9 Constant Length
Atlantic Packaging has all distinct values Unique
Humana has all distinct values Unique
Medcost Southeast has all distinct values Unique
Medcost Columbus has all distinct values Unique
  • 1
  • 2
  • 3
  • 4
  • 5

Variables

MS-DRG/APC

numerical

Approximate Distinct Count 502
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.8 KB
Mean 470.9582
Minimum 1
Maximum 999
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • MS-DRG/APC is skewed right (γ1 = 0.1428)

Quantile Statistics

Minimum 1
5-th Percentile 55.05
Q1 228.25
Median 453.5
Q3 720.75
95-th Percentile 920.95
Maximum 999
Range 998
IQR 492.5

Descriptive Statistics

Mean 470.9582
Standard Deviation 282.525
Variance 79820.3875
Sum 236421
Skewness 0.1428
Kurtosis -1.1654
Coefficient of Variation 0.5999

Description

categorical

Approximate Distinct Count 499
Approximate Unique (%) 99.4%
Missing 0
Missing (%) 0.0%
Memory Size 57.1 KB

Length

Mean 51.4382
Standard Deviation 19.8819
Median 49
Minimum 9
Maximum 100

Sample

1st row Heart Transplant O...
2nd row Ecmo Or Tracheosto...
3rd row Tracheostomy With ...
4th row Tracheostomy For F...
5th row Tracheostomy For F...

Letter

Count 22334
Lowercase Letter 18543
Space Separator 3071
Uppercase Letter 3791
Dash Punctuation 29
Decimal Number 22
  • The largest value (with) is over 1.52 times larger than the second largest value (without)

Aetna

numerical

Approximate Distinct Count 145
Approximate Unique (%) 100.0%
Missing 357
Missing (%) 71.1%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.3 KB
Mean 44264.5379
Minimum 1922
Maximum 567266
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Aetna is skewed right (γ1 = 4.9838)

Quantile Statistics

Minimum 1922
5-th Percentile 7125.2
Q1 12904
Median 25243
Q3 49372
95-th Percentile 135148.8
Maximum 567266
Range 565344
IQR 36468

Descriptive Statistics

Mean 44264.5379
Standard Deviation 63687.898
Variance 4.0561e+09
Sum 6.4184e+06
Skewness 4.9838
Kurtosis 33.2216
Coefficient of Variation 1.4388
  • Aetna is not normally distributed (p-value 4.991616434189804e-14)
  • Aetna has 13 outliers

Aetna NC Preferred

numerical

Approximate Distinct Count 14
Approximate Unique (%) 100.0%
Missing 488
Missing (%) 97.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 224.0 B
Mean 14084.2857
Minimum 2409
Maximum 29139
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Aetna NC Preferred is skewed right (γ1 = 0.4573)

Quantile Statistics

Minimum 2409
5-th Percentile 2485.05
Q1 8849.5
Median 11906.5
Q3 22394.25
95-th Percentile 28521.5
Maximum 29139
Range 26730
IQR 13544.75

Descriptive Statistics

Mean 14084.2857
Standard Deviation 9471.249
Variance 8.9705e+07
Sum 197180
Skewness 0.4573
Kurtosis -1.1337
Coefficient of Variation 0.6725
  • Aetna NC Preferred is not normally distributed (p-value 0.00014492142556125136)

Atlantic Packaging

categorical

Approximate Distinct Count 7
Approximate Unique (%) 100.0%
Missing 495
Missing (%) 98.6%
Memory Size 502.0 B

Length

Mean 6.7143
Standard Deviation 0.7559
Median 7
Minimum 6
Maximum 8

Sample

1st row 252828.0
2nd row 38219.0
3rd row 34062.0
4th row 34858.0
5th row 1303.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 40

BCBS

numerical

Approximate Distinct Count 461
Approximate Unique (%) 97.0%
Missing 27
Missing (%) 5.4%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.4 KB
Mean 25532.9242
Minimum 1627
Maximum 193683
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • BCBS is skewed right (γ1 = 3.0567)

Quantile Statistics

Minimum 1627
5-th Percentile 7004.2
Q1 11203
Median 18017
Q3 28074.5
95-th Percentile 71483.9
Maximum 193683
Range 192056
IQR 16871.5

Descriptive Statistics

Mean 25532.9242
Standard Deviation 25183.8301
Variance 6.3423e+08
Sum 1.2128e+07
Skewness 3.0567
Kurtosis 11.6791
Coefficient of Variation 0.9863
  • BCBS is not normally distributed (p-value 3.2406337398628453e-10)
  • BCBS has 49 outliers

BCBS Blue Value

numerical

Approximate Distinct Count 210
Approximate Unique (%) 97.2%
Missing 286
Missing (%) 57.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 3.4 KB
Mean 28340.7222
Minimum 1470
Maximum 432352
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • BCBS Blue Value is skewed right (γ1 = 5.4867)

Quantile Statistics

Minimum 1470
5-th Percentile 5772.75
Q1 9719.25
Median 17379.5
Q3 26768.75
95-th Percentile 86049.5
Maximum 432352
Range 430882
IQR 17049.5

Descriptive Statistics

Mean 28340.7222
Standard Deviation 42534.994
Variance 1.8092e+09
Sum 6.1216e+06
Skewness 5.4867
Kurtosis 41.0723
Coefficient of Variation 1.5008
  • BCBS Blue Value is not normally distributed (p-value 8.973751711608093e-15)
  • BCBS Blue Value has 24 outliers

Cigna HMO

numerical

Approximate Distinct Count 182
Approximate Unique (%) 96.8%
Missing 314
Missing (%) 62.5%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.9 KB
Mean 29440.3138
Minimum 1476
Maximum 365997
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Cigna HMO is skewed right (γ1 = 5.4444)

Quantile Statistics

Minimum 1476
5-th Percentile 7627.65
Q1 11747.25
Median 19701.5
Q3 30048.25
95-th Percentile 84009.65
Maximum 365997
Range 364521
IQR 18301

Descriptive Statistics

Mean 29440.3138
Standard Deviation 35409.3392
Variance 1.2538e+09
Sum 5.5348e+06
Skewness 5.4444
Kurtosis 43.5267
Coefficient of Variation 1.2028
  • Cigna HMO is not normally distributed (p-value 6.092997614308463e-16)
  • Cigna HMO has 18 outliers

Cigna

numerical

Approximate Distinct Count 52
Approximate Unique (%) 100.0%
Missing 450
Missing (%) 89.6%
Infinite 0
Infinite (%) 0.0%
Memory Size 832.0 B
Mean 35539.4423
Minimum 1527
Maximum 147672
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Cigna is skewed right (γ1 = 1.5697)

Quantile Statistics

Minimum 1527
5-th Percentile 4008.15
Q1 11332.75
Median 21101.5
Q3 38155.75
95-th Percentile 116447.3
Maximum 147672
Range 146145
IQR 26823

Descriptive Statistics

Mean 35539.4423
Standard Deviation 35865.4308
Variance 1.2863e+09
Sum 1.8481e+06
Skewness 1.5697
Kurtosis 1.641
Coefficient of Variation 1.0092
  • Cigna is not normally distributed (p-value 6.321906065542095e-07)
  • Cigna has 7 outliers

Coventry - First Health

numerical

Approximate Distinct Count 12
Approximate Unique (%) 100.0%
Missing 490
Missing (%) 97.6%
Infinite 0
Infinite (%) 0.0%
Memory Size 192.0 B
Mean 55628.25
Minimum 7477
Maximum 229696
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Coventry - First Health is skewed right (γ1 = 2.2078)

Quantile Statistics

Minimum 7477
5-th Percentile 9677.55
Q1 22273.5
Median 45121
Q3 55852.5
95-th Percentile 155428.4
Maximum 229696
Range 222219
IQR 33579

Descriptive Statistics

Mean 55628.25
Standard Deviation 60030.3318
Variance 3.6036e+09
Sum 667539
Skewness 2.2078
Kurtosis 4.1251
Coefficient of Variation 1.0791
  • Coventry - First Health is not normally distributed (p-value 1.0721131284013256e-11)
  • Coventry - First Health has 1 outliers

Humana

categorical

Approximate Distinct Count 8
Approximate Unique (%) 100.0%
Missing 494
Missing (%) 98.4%
Memory Size 575.0 B

Length

Mean 6.875
Standard Deviation 0.3536
Median 7
Minimum 6
Maximum 7

Sample

1st row 90695.0
2nd row 83161.0
3rd row 12658.0
4th row 38461.0
5th row 40571.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 47

Multiplan

numerical

Approximate Distinct Count 45
Approximate Unique (%) 100.0%
Missing 457
Missing (%) 91.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 720.0 B
Mean 40919.5111
Minimum 1616
Maximum 218076
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Multiplan is skewed right (γ1 = 2.4126)

Quantile Statistics

Minimum 1616
5-th Percentile 4048
Q1 10067
Median 28372
Q3 40868
95-th Percentile 158252
Maximum 218076
Range 216460
IQR 30801

Descriptive Statistics

Mean 40919.5111
Standard Deviation 48193.909
Variance 2.3227e+09
Sum 1.8414e+06
Skewness 2.4126
Kurtosis 5.4933
Coefficient of Variation 1.1778
  • Multiplan is not normally distributed (p-value 2.118106981623385e-11)
  • Multiplan has 5 outliers

Medcost

numerical

Approximate Distinct Count 62
Approximate Unique (%) 100.0%
Missing 440
Missing (%) 87.6%
Infinite 0
Infinite (%) 0.0%
Memory Size 992.0 B
Mean 32109.9355
Minimum 1214
Maximum 240757
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Medcost is skewed right (γ1 = 3.2363)

Quantile Statistics

Minimum 1214
5-th Percentile 2551.25
Q1 10027
Median 16781.5
Q3 32906.25
95-th Percentile 80564.25
Maximum 240757
Range 239543
IQR 22879.25

Descriptive Statistics

Mean 32109.9355
Standard Deviation 44217.47
Variance 1.9552e+09
Sum 1.9908e+06
Skewness 3.2363
Kurtosis 11.2157
Coefficient of Variation 1.3771
  • Medcost is not normally distributed (p-value 3.203771443933949e-11)
  • Medcost has 8 outliers

Medcost Southeast

categorical

Approximate Distinct Count 4
Approximate Unique (%) 100.0%
Missing 498
Missing (%) 99.2%
Memory Size 285.0 B

Length

Mean 6.25
Standard Deviation 0.5
Median 6
Minimum 6
Maximum 7

Sample

1st row 3681.0
2nd row 6663.0
3rd row 8278.0
4th row 75112.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 21

Medcost Columbus

categorical

Approximate Distinct Count 8
Approximate Unique (%) 100.0%
Missing 494
Missing (%) 98.4%
Memory Size 575.0 B

Length

Mean 6.875
Standard Deviation 0.3536
Median 7
Minimum 6
Maximum 7

Sample

1st row 10237.0
2nd row 13339.0
3rd row 11120.0
4th row 39778.0
5th row 17706.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 47

UHC

numerical

Approximate Distinct Count 278
Approximate Unique (%) 97.5%
Missing 217
Missing (%) 43.2%
Infinite 0
Infinite (%) 0.0%
Memory Size 4.5 KB
Mean 35236.7158
Minimum 1054
Maximum 371175
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • UHC is skewed right (γ1 = 4.0658)

Quantile Statistics

Minimum 1054
5-th Percentile 6032.8
Q1 14238
Median 23850
Q3 39838
95-th Percentile 103219.8
Maximum 371175
Range 370121
IQR 25600

Descriptive Statistics

Mean 35236.7158
Standard Deviation 38271.39
Variance 1.4647e+09
Sum 1.0042e+07
Skewness 4.0658
Kurtosis 25.4463
Coefficient of Variation 1.0861
  • UHC is not normally distributed (p-value 1.9847530931604145e-11)
  • UHC has 25 outliers

UHC NHRMC Employee Plan

numerical

Approximate Distinct Count 162
Approximate Unique (%) 100.0%
Missing 340
Missing (%) 67.7%
Infinite 0
Infinite (%) 0.0%
Memory Size 2.5 KB
Mean 19984.7716
Minimum 1063
Maximum 118803
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • UHC NHRMC Employee Plan is skewed right (γ1 = 2.4816)

Quantile Statistics

Minimum 1063
5-th Percentile 2892.55
Q1 6812.75
Median 12555.5
Q3 23589.75
95-th Percentile 63401
Maximum 118803
Range 117740
IQR 16777

Descriptive Statistics

Mean 19984.7716
Standard Deviation 21143.8621
Variance 4.4706e+08
Sum 3.2375e+06
Skewness 2.4816
Kurtosis 6.8761
Coefficient of Variation 1.058
  • UHC NHRMC Employee Plan is not normally distributed (p-value 1.2952816303532048e-08)
  • UHC NHRMC Employee Plan has 15 outliers

De-Identified Minimum Negotiated Charge

numerical

Approximate Distinct Count 491
Approximate Unique (%) 97.8%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.8 KB
Mean 23686.7351
Minimum 1054
Maximum 432352
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • De-Identified Minimum Negotiated Charge is skewed right (γ1 = 6.4945)

Quantile Statistics

Minimum 1054
5-th Percentile 3998.2
Q1 9419.5
Median 15911.5
Q3 26022.5
95-th Percentile 69913.45
Maximum 432352
Range 431298
IQR 16603

Descriptive Statistics

Mean 23686.7351
Standard Deviation 29825.5482
Variance 8.8956e+08
Sum 1.1891e+07
Skewness 6.4945
Kurtosis 72.4432
Coefficient of Variation 1.2592
  • De-Identified Minimum Negotiated Charge is not normally distributed (p-value 4.7002009304540675e-15)
  • De-Identified Minimum Negotiated Charge has 50 outliers

De-Identified Maximum Negotiated Charge

numerical

Approximate Distinct Count 498
Approximate Unique (%) 99.2%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.8 KB
Mean 42407.5139
Minimum 2409
Maximum 567266
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • De-Identified Maximum Negotiated Charge is skewed right (γ1 = 4.593)

Quantile Statistics

Minimum 2409
5-th Percentile 9427
Q1 16356.25
Median 26959.5
Q3 45030.75
95-th Percentile 127702.2
Maximum 567266
Range 564857
IQR 28674.5

Descriptive Statistics

Mean 42407.5139
Standard Deviation 50873.5718
Variance 2.5881e+09
Sum 2.1289e+07
Skewness 4.593
Kurtosis 32.5132
Coefficient of Variation 1.1996
  • De-Identified Maximum Negotiated Charge is not normally distributed (p-value 3.4756899582720415e-15)
  • De-Identified Maximum Negotiated Charge has 55 outliers

Filename

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 62.8 KB

Length

Mean 63
Standard Deviation 0
Median 63
Minimum 63
Maximum 63

Sample

1st row 340141-new-hanover...
2nd row 340141-new-hanover...
3rd row 340141-new-hanover...
4th row 340141-new-hanover...
5th row 340141-new-hanover...

Letter

Count 24598
Lowercase Letter 24598
Space Separator 0
Uppercase Letter 0
Dash Punctuation 3514
Decimal Number 3012
  • Filename has words of constant length

Patient Type

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 36.3 KB

Length

Mean 9
Standard Deviation 0
Median 9
Minimum 9
Maximum 9

Sample

1st row Inpatient
2nd row Inpatient
3rd row Inpatient
4th row Inpatient
5th row Inpatient

Letter

Count 4518
Lowercase Letter 4016
Space Separator 0
Uppercase Letter 502
Dash Punctuation 0
Decimal Number 0
  • Patient Type has words of constant length

Interactions

Correlations

Missing Values