Overview

Dataset Statistics

Number of Variables 12
Number of Rows 1606
Missing Cells 10008
Missing Cells (%) 51.9%
Duplicate Rows 970
Duplicate Rows (%) 60.4%
Total Size in Memory 988.4 KB
Average Row Size in Memory 630.2 B
Variable Types
  • Categorical: 12

Dataset Insights

CPT/MS-DRG has 972 (60.52%) missing values Missing
Procedure Description has 972 (60.52%) missing values Missing
Self Pay has 1008 (62.76%) missing values Missing
Medicare has 1008 (62.76%) missing values Missing
Medicaid has 1008 (62.76%) missing values Missing
Aetna has 1008 (62.76%) missing values Missing
BCBS has 1008 (62.76%) missing values Missing
Cigna has 1008 (62.76%) missing values Missing
Medcost has 1008 (62.76%) missing values Missing
UHC has 1008 (62.76%) missing values Missing
Dataset has 970 (60.4%) duplicate rows Duplicates
CPT/MS-DRG has a high cardinality: 429 distinct values High Cardinality
Procedure Description has a high cardinality: 412 distinct values High Cardinality
Self Pay has a high cardinality: 366 distinct values High Cardinality
Medicare has a high cardinality: 336 distinct values High Cardinality
Medicaid has a high cardinality: 513 distinct values High Cardinality
Aetna has a high cardinality: 488 distinct values High Cardinality
BCBS has a high cardinality: 474 distinct values High Cardinality
Cigna has a high cardinality: 372 distinct values High Cardinality
Medcost has a high cardinality: 477 distinct values High Cardinality
UHC has a high cardinality: 366 distinct values High Cardinality
system has constant value "APP" Constant
system has constant length 3 Constant Length
  • 1
  • 2
  • 3

Variables

CPT/MS-DRG

categorical

Approximate Distinct Count 429
Approximate Unique (%) 67.7%
Missing 972
Missing (%) 60.5%
Memory Size 43.4 KB

Length

Mean 5.0221
Standard Deviation 1.0084
Median 5
Minimum 3
Maximum 11

Sample

1st row 166
2nd row 190
3rd row 243
4th row 247
5th row 286

Letter

Count 25
Lowercase Letter 0
Space Separator 0
Uppercase Letter 25
Dash Punctuation 0
Decimal Number 3145

Procedure Description

categorical

Approximate Distinct Count 412
Approximate Unique (%) 65.0%
Missing 972
Missing (%) 60.5%
Memory Size 61.4 KB

Length

Mean 34.2287
Standard Deviation 19.5949
Median 28
Minimum 4
Maximum 139

Sample

1st row Other resp system...
2nd row Chronic obstructi...
3rd row Permanent cardiac...
4th row Perc cardiovasc p...
5th row Circulatory disor...

Letter

Count 17824
Lowercase Letter 14633
Space Separator 2786
Uppercase Letter 3191
Dash Punctuation 218
Decimal Number 298

Self Pay

categorical

Approximate Distinct Count 366
Approximate Unique (%) 61.2%
Missing 1008
Missing (%) 62.8%
Memory Size 42.8 KB

Length

Mean 8.2943
Standard Deviation 1.3251
Median 8
Minimum 6
Maximum 11

Sample

1st row 32,838.53
2nd row 11,543.50
3rd row 21,950.21
4th row 35,016.26
5th row 22,272.58

Letter

Count 0
Lowercase Letter 0
Space Separator 1196
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3002

Medicare

categorical

Approximate Distinct Count 336
Approximate Unique (%) 56.2%
Missing 1008
Missing (%) 62.8%
Memory Size 42.2 KB

Length

Mean 7.3144
Standard Deviation 2.1937
Median 8
Minimum 4
Maximum 11

Sample

1st row 24,408.80
2nd row 7,238.85
3rd row 16,312.04
4th row 12,805.65
5th row 14,260.63

Letter

Count 0
Lowercase Letter 0
Space Separator 858
Uppercase Letter 0
Dash Punctuation 11
Decimal Number 2771

Medicaid

categorical

Approximate Distinct Count 513
Approximate Unique (%) 85.8%
Missing 1008
Missing (%) 62.8%
Memory Size 42.4 KB

Length

Mean 7.6438
Standard Deviation 1.8911
Median 8
Minimum 4
Maximum 11

Sample

1st row 30,736.86
2nd row 10,804.72
3rd row 20,545.40
4th row 32,775.22
5th row 20,847.13

Letter

Count 0
Lowercase Letter 0
Space Separator 996
Uppercase Letter 0
Dash Punctuation 1
Decimal Number 2823

Aetna

categorical

Approximate Distinct Count 488
Approximate Unique (%) 81.6%
Missing 1008
Missing (%) 62.8%
Memory Size 42.9 KB

Length

Mean 8.3763
Standard Deviation 1.5348
Median 8
Minimum 5
Maximum 11

Sample

1st row 55,825.49
2nd row 19,623.95
3rd row 37,315.36
4th row 59,527.65
5th row 37,863.38

Letter

Count 0
Lowercase Letter 0
Space Separator 1212
Uppercase Letter 0
Dash Punctuation 8
Decimal Number 3018

BCBS

categorical

Approximate Distinct Count 474
Approximate Unique (%) 79.3%
Missing 1008
Missing (%) 62.8%
Memory Size 42.9 KB

Length

Mean 8.3913
Standard Deviation 1.3453
Median 8
Minimum 5
Maximum 11

Sample

1st row 51,198.61
2nd row 16,529.20
3rd row 37,918.82
4th row 30,559.07
5th row 31,712.27

Letter

Count 0
Lowercase Letter 0
Space Separator 1202
Uppercase Letter 0
Dash Punctuation 3
Decimal Number 3043

Cigna

categorical

Approximate Distinct Count 372
Approximate Unique (%) 62.2%
Missing 1008
Missing (%) 62.8%
Memory Size 42.9 KB

Length

Mean 8.4933
Standard Deviation 1.4044
Median 8
Minimum 6
Maximum 11

Sample

1st row 55,825.49
2nd row 19,623.95
3rd row 37,315.36
4th row 59,527.65
5th row 37,863.38

Letter

Count 0
Lowercase Letter 0
Space Separator 1196
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3098

Medcost

categorical

Approximate Distinct Count 477
Approximate Unique (%) 79.8%
Missing 1008
Missing (%) 62.8%
Memory Size 42.9 KB

Length

Mean 8.4799
Standard Deviation 1.3582
Median 8
Minimum 6
Maximum 11

Sample

1st row 48,601.02
2nd row 17,084.38
3rd row 32,486.32
4th row 51,824.07
5th row 32,963.41

Letter

Count 0
Lowercase Letter 0
Space Separator 1196
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3091

UHC

categorical

Approximate Distinct Count 366
Approximate Unique (%) 61.2%
Missing 1008
Missing (%) 62.8%
Memory Size 42.9 KB

Length

Mean 8.4749
Standard Deviation 1.3667
Median 8
Minimum 6
Maximum 11

Sample

1st row 49,914.56
2nd row 17,546.12
3rd row 33,364.33
4th row 53,224.72
5th row 33,854.32

Letter

Count 0
Lowercase Letter 0
Space Separator 1196
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3088

Filename

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 194.5 KB
  • The largest value (56-0510824_Watauga_Medical_Center_Shoppable_Services.csv) is over 3.96 times larger than the second largest value (56-0529974_Charles_A_Cannon_Jr_Memorial_Hospital_Shoppable_Services.csv)

Length

Mean 59.0262
Standard Deviation 6.0214
Median 56
Minimum 56
Maximum 71

Sample

1st row 56-0510824_Watauga...
2nd row 56-0510824_Watauga...
3rd row 56-0510824_Watauga...
4th row 56-0510824_Watauga...
5th row 56-0510824_Watauga...

Letter

Count 68128
Lowercase Letter 59126
Space Separator 0
Uppercase Letter 9002
Dash Punctuation 1606
Decimal Number 14454
  • The top 2 categories (56-0510824_Watauga_Medical_Center_Shoppable_Services.csv, 56-0529974_Charles_A_Cannon_Jr_Memorial_Hospital_Shoppable_Services.csv) take over 50.0%
  • The largest value (560510824_watauga_medical_center_shoppable_servicescsv) is over 3.96 times larger than the second largest value (560529974_charles_a_cannon_jr_memorial_hospital_shoppable_servicescsv)

system

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.1%
Missing 0
Missing (%) 0.0%
Memory Size 106.6 KB

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row APP
2nd row APP
3rd row APP
4th row APP
5th row APP

Letter

Count 4818
Lowercase Letter 0
Space Separator 0
Uppercase Letter 4818
Dash Punctuation 0
Decimal Number 0
  • system has words of constant length

Missing Values