Overview

Dataset Statistics

Number of Variables 11
Number of Rows 20333
Missing Cells 30794
Missing Cells (%) 13.8%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 14.5 MB
Average Row Size in Memory 745.3 B
Variable Types
  • Categorical: 11

Dataset Insights

HCPCS has 10452 (51.4%) missing values Missing
Unnamed: 9 has 20333 (100.0%) missing values Missing
Code has a high cardinality: 20333 distinct values High Cardinality
HCPCS has a high cardinality: 2656 distinct values High Cardinality
Billing Description has a high cardinality: 18587 distinct values High Cardinality
RevCode has a high cardinality: 132 distinct values High Cardinality
Gross Chg has a high cardinality: 6744 distinct values High Cardinality
Self-Pay has a high cardinality: 6470 distinct values High Cardinality
Min Allowable has a high cardinality: 5946 distinct values High Cardinality
Average Allowable has a high cardinality: 7116 distinct values High Cardinality
Max Allowable has a high cardinality: 6673 distinct values High Cardinality
Filename has constant value "56-0510824_Watauga_Medical_Center_CDM_with_Standard_Charges_FY2021_CSV.csv" Constant
Code has constant length 9 Constant Length
RevCode has constant length 5 Constant Length
Filename has constant length 74 Constant Length
Code has all distinct values Unique
Unnamed: 9 has all distinct values Unique
  • 1
  • 2

Variables

Code

categorical

Approximate Distinct Count 20333
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 9
Standard Deviation 0
Median 9
Minimum 9
Maximum 9

Sample

1st row 611020001
2nd row 611020002
3rd row 611020003
4th row 611020004
5th row 611020005

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 182997
  • Code contains many words: 20333 words
  • Code has words of constant length

HCPCS

categorical

Approximate Distinct Count 2656
Approximate Unique (%) 26.9%
Missing 10452
Missing (%) 51.4%
Memory Size 675.5 KB
  • The largest value (C1713) is over 2.65 times larger than the second largest value (C1776)

Length

Mean 4.9997
Standard Deviation 0.02249
Median 5
Minimum 3
Maximum 5

Sample

1st row G0378
2nd row Q3014
3rd row G0379
4th row 96365
5th row 96366

Letter

Count 5213
Lowercase Letter 0
Space Separator 0
Uppercase Letter 5213
Dash Punctuation 0
Decimal Number 44189
  • HCPCS contains many words: 2656 words
  • The largest value (c1713) is over 2.65 times larger than the second largest value (c1776)

Billing Description

categorical

Approximate Distinct Count 18587
Approximate Unique (%) 91.4%
Missing 0
Missing (%) 0.0%
Memory Size 1.7 MB

Length

Mean 25.1506
Standard Deviation 4.7839
Median 26
Minimum 3
Maximum 30

Sample

1st row MED/SURG A PRIVATE...
2nd row MED/SURG A ROOM CH...
3rd row MEDSURG A HOSPICE ...
4th row MED/SURG A OBSERVA...
5th row MEDSURG A ORIGINAT...

Letter

Count 383358
Lowercase Letter 1222
Space Separator 66197
Uppercase Letter 382136
Dash Punctuation 2814
Decimal Number 45012
  • Billing Description contains many words: 14359 words
  • The largest value (screw) is over 1.62 times larger than the second largest value (x)

RevCode

categorical

Approximate Distinct Count 132
Approximate Unique (%) 0.6%
Missing 9
Missing (%) 0.0%
Memory Size 1.4 MB
  • The largest value (278.0) is over 1.73 times larger than the second largest value (270.0)

Length

Mean 5
Standard Deviation 0
Median 5
Minimum 5
Maximum 5

Sample

1st row 110.0
2nd row 120.0
3rd row 650.0
4th row 762.0
5th row 780.0

Letter

Count 0
Lowercase Letter 0
Space Separator 0
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 81296
  • The top 2 categories (278.0, 270.0) take over 50.0%
  • The largest value (2780) is over 1.73 times larger than the second largest value (2700)
  • RevCode has words of constant length

Gross Chg

categorical

Approximate Distinct Count 6744
Approximate Unique (%) 33.2%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 8.4693
Standard Deviation 1.3844
Median 8
Minimum 5
Maximum 12

Sample

1st row 987.00
2nd row 987.00
3rd row 846.00
4th row 42.00
5th row 91.00

Letter

Count 0
Lowercase Letter 0
Space Separator 41440
Uppercase Letter 0
Dash Punctuation 387
Decimal Number 102897
  • Gross Chg contains many words: 6743 words

Self-Pay

categorical

Approximate Distinct Count 6470
Approximate Unique (%) 31.8%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 8.051
Standard Deviation 1.2937
Median 8
Minimum 5
Maximum 11

Sample

1st row 493.50
2nd row 493.50
3rd row 423.00
4th row 21.00
5th row 45.50

Letter

Count 0
Lowercase Letter 0
Space Separator 41440
Uppercase Letter 0
Dash Punctuation 387
Decimal Number 97109
  • Self-Pay contains many words: 6469 words

Min Allowable

categorical

Approximate Distinct Count 5946
Approximate Unique (%) 29.2%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 7.6008
Standard Deviation 1.2021
Median 8
Minimum 5
Maximum 18

Sample

1st row 256.62
2nd row 256.62
3rd row 219.96
4th row 10.92
5th row 23.66

Letter

Count 45
Lowercase Letter 45
Space Separator 41911
Uppercase Letter 0
Dash Punctuation 621
Decimal Number 89797
  • Min Allowable contains many words: 5946 words

Average Allowable

categorical

Approximate Distinct Count 7116
Approximate Unique (%) 35.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 8.0864
Standard Deviation 1.2962
Median 8
Minimum 5
Maximum 11

Sample

1st row 547.79
2nd row 547.79
3rd row 469.53
4th row 23.31
5th row 50.51

Letter

Count 15
Lowercase Letter 0
Space Separator 41428
Uppercase Letter 15
Dash Punctuation 384
Decimal Number 97641
  • Average Allowable contains many words: 7115 words

Max Allowable

categorical

Approximate Distinct Count 6673
Approximate Unique (%) 32.8%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 8.3636
Standard Deviation 1.3623
Median 8
Minimum 5
Maximum 11

Sample

1st row 838.95
2nd row 838.95
3rd row 719.10
4th row 35.70
5th row 77.35

Letter

Count 0
Lowercase Letter 0
Space Separator 41440
Uppercase Letter 0
Dash Punctuation 387
Decimal Number 101465
  • Max Allowable contains many words: 6672 words

Unnamed: 9

categorical

Approximate Distinct Count 2
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.3 MB
  • The largest value (nan) is over 202.33 times larger than the second largest value (<NA>)

Length

Mean 3.0049
Standard Deviation 0.06996
Median 3
Minimum 3
Maximum 4

Sample

1st row
2nd row
3rd row
4th row
5th row

Letter

Count 60899
Lowercase Letter 60699
Space Separator 0
Uppercase Letter 200
Dash Punctuation 0
Decimal Number 0
  • The top 2 categories (nan, <NA>) take over 50.0%
  • The largest value (nan) is over 202.33 times larger than the second largest value (na)

Filename

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 2.7 MB

Length

Mean 74
Standard Deviation 0
Median 74
Minimum 74
Maximum 74

Sample

1st row 56-0510824_Watauga...
2nd row 56-0510824_Watauga...
3rd row 56-0510824_Watauga...
4th row 56-0510824_Watauga...
5th row 56-0510824_Watauga...

Letter

Count 1016650
Lowercase Letter 752321
Space Separator 0
Uppercase Letter 264329
Dash Punctuation 20333
Decimal Number 264329
  • Filename has words of constant length

Missing Values