Overview

Dataset Statistics

Number of Variables 10
Number of Rows 7456
Missing Cells 3899
Missing Cells (%) 5.2%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 5.1 MB
Average Row Size in Memory 717.1 B
Variable Types
  • Categorical: 10

Dataset Insights

HCPCS has 3899 (52.29%) missing values Missing
Code has a high cardinality: 7456 distinct values High Cardinality
HCPCS has a high cardinality: 1995 distinct values High Cardinality
Billing Description has a high cardinality: 7183 distinct values High Cardinality
RevCode has a high cardinality: 101 distinct values High Cardinality
Gross Chg has a high cardinality: 3654 distinct values High Cardinality
Self-Pay Discount has a high cardinality: 3473 distinct values High Cardinality
Min Allowable has a high cardinality: 3246 distinct values High Cardinality
Average Allowable has a high cardinality: 4125 distinct values High Cardinality
Max Allowable has a high cardinality: 3625 distinct values High Cardinality
Filename has constant value "56-0529974_Charles_A_Cannon_Jr_Memorial_Hospital_CDM_with_Standard_Charges_FY2021.csv" Constant
Code has constant length 9 Constant Length
RevCode has constant length 3 Constant Length
Filename has constant length 85 Constant Length
Code has all distinct values Unique
  • 1
  • 2

Variables

Code

categorical

Approximate Distinct Count 7456
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory Size 538.8 KB

Length

Mean 9
Standard Deviation 0
Median 9
Minimum 9
Maximum 9

Sample

1st row 611020001
2nd row 611020003
3rd row 611020004
4th row 611020005
5th row 611020006

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 67104
  • Code contains many words: 7456 words
  • Code has words of constant length

HCPCS

categorical

Approximate Distinct Count 1995
Approximate Unique (%) 56.1%
Missing 3899
Missing (%) 52.3%
Memory Size 243.2 KB

Length

Mean 4.9994
Standard Deviation 0.03353
Median 5
Minimum 3
Maximum 5

Sample

1st row G0378
2nd row Q3014
3rd row G0378
4th row G0379
5th row 96365

Letter

Count 945
Lowercase Letter 0
Space Separator 0
Uppercase Letter 945
Dash Punctuation 0
Decimal Number 16838
  • HCPCS contains many words: 1995 words

Billing Description

categorical

Approximate Distinct Count 7183
Approximate Unique (%) 96.3%
Missing 0
Missing (%) 0.0%
Memory Size 648.8 KB
  • The largest value (DILATOR ESOPHAGEAL BOUGIE) is over 1.67 times larger than the second largest value (OPTH SP ACRYSOF TORIC IQ)

Length

Mean 24.1007
Standard Deviation 5.3399
Median 25
Minimum 3
Maximum 30

Sample

1st row MED/SURG A PRIVATE...
2nd row MED/SURG A HOSPICE...
3rd row MED/SURG OBSERVATI...
4th row MEDSURG ORIGINATIN...
5th row IMCU BED CHARGE

Letter

Count 140256
Lowercase Letter 1251
Space Separator 21308
Uppercase Letter 139005
Dash Punctuation 1071
Decimal Number 11958
  • Billing Description contains many words: 7597 words

RevCode

categorical

Approximate Distinct Count 101
Approximate Unique (%) 1.4%
Missing 0
Missing (%) 0.0%
Memory Size 495.1 KB

Length

Mean 3
Standard Deviation 0
Median 3
Minimum 3
Maximum 3

Sample

1st row 110
2nd row 650
3rd row 762
4th row 780
5th row 206

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 22368
  • RevCode has words of constant length

Gross Chg

categorical

Approximate Distinct Count 3654
Approximate Unique (%) 49.0%
Missing 0
Missing (%) 0.0%
Memory Size 528.4 KB

Length

Mean 7.5709
Standard Deviation 1.2299
Median 8
Minimum 5
Maximum 11

Sample

1st row 987.00
2nd row 846.00
3rd row 41.50
4th row 91.00
5th row 1,214.00

Letter

Count 0
Lowercase Letter 0
Space Separator 15692
Uppercase Letter 0
Dash Punctuation 390
Decimal Number 32427
  • Gross Chg contains many words: 3653 words
  • The largest value (500) is over 6.46 times larger than the second largest value (6926)

Self-Pay Discount

categorical

Approximate Distinct Count 3473
Approximate Unique (%) 46.6%
Missing 0
Missing (%) 0.0%
Memory Size 526.0 KB

Length

Mean 7.2448
Standard Deviation 1.0886
Median 7
Minimum 5
Maximum 11

Sample

1st row 493.50
2nd row 423.00
3rd row 20.75
4th row 45.50
5th row 607.00

Letter

Count 0
Lowercase Letter 0
Space Separator 15692
Uppercase Letter 0
Dash Punctuation 390
Decimal Number 30454
  • Self-Pay Discount contains many words: 3472 words
  • The largest value (250) is over 6.46 times larger than the second largest value (3463)

Min Allowable

categorical

Approximate Distinct Count 3246
Approximate Unique (%) 43.5%
Missing 0
Missing (%) 0.0%
Memory Size 523.9 KB
  • The largest value ( - ) is over 1.91 times larger than the second largest value ( 2.10 )

Length

Mean 6.9565
Standard Deviation 1.0735
Median 7
Minimum 5
Maximum 18

Sample

1st row 414.54
2nd row 355.32
3rd row 17.43
4th row 38.22
5th row 509.88

Letter

Count 60
Lowercase Letter 60
Space Separator 16124
Uppercase Letter 0
Dash Punctuation 604
Decimal Number 28031
  • Min Allowable contains many words: 3246 words
  • The largest value (210) is over 5.98 times larger than the second largest value (2909)

Average Allowable

categorical

Approximate Distinct Count 4125
Approximate Unique (%) 55.3%
Missing 0
Missing (%) 0.0%
Memory Size 526.8 KB

Length

Mean 7.3451
Standard Deviation 1.1294
Median 7
Minimum 5
Maximum 11

Sample

1st row 651.42
2nd row 558.36
3rd row 27.39
4th row 60.06
5th row 801.24

Letter

Count 20
Lowercase Letter 0
Space Separator 15682
Uppercase Letter 20
Dash Punctuation 389
Decimal Number 31053
  • Average Allowable contains many words: 4124 words
  • The largest value (330) is over 6.24 times larger than the second largest value (4571)

Max Allowable

categorical

Approximate Distinct Count 3625
Approximate Unique (%) 48.6%
Missing 0
Missing (%) 0.0%
Memory Size 528.1 KB

Length

Mean 7.5245
Standard Deviation 1.2146
Median 8
Minimum 5
Maximum 11

Sample

1st row 888.30
2nd row 761.40
3rd row 37.35
4th row 81.90
5th row 1,092.60

Letter

Count 0
Lowercase Letter 0
Space Separator 15692
Uppercase Letter 0
Dash Punctuation 390
Decimal Number 32143
  • Max Allowable contains many words: 3624 words
  • The largest value (450) is over 6.46 times larger than the second largest value (6233)

Filename

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.1 MB

Length

Mean 85
Standard Deviation 0
Median 85
Minimum 85
Maximum 85

Sample

1st row 56-0529974_Charles...
2nd row 56-0529974_Charles...
3rd row 56-0529974_Charles...
4th row 56-0529974_Charles...
5th row 56-0529974_Charles...

Letter

Count 439904
Lowercase Letter 342976
Space Separator 0
Uppercase Letter 96928
Dash Punctuation 7456
Decimal Number 96928
  • Filename has words of constant length

Missing Values