Overview

Dataset Statistics

Number of Variables 6
Number of Rows 497
Missing Cells 61
Missing Cells (%) 2.0%
Duplicate Rows 1
Duplicate Rows (%) 0.2%
Total Size in Memory 187.6 KB
Average Row Size in Memory 386.6 B
Variable Types
  • Categorical: 5
  • Numerical: 1

Dataset Insights

HCPCS/CPT Code has 21 (4.23%) missing values Missing
Gross Charge has 14 (2.82%) missing values Missing
Discounted Cash Price (Gross Charges) has 24 (4.83%) missing values Missing
Discounted Cash Price (Gross Charges) is skewed Skewed
Procedure ID has a high cardinality: 486 distinct values High Cardinality
HCPCS/CPT Code has a high cardinality: 356 distinct values High Cardinality
Description has a high cardinality: 472 distinct values High Cardinality
Gross Charge has a high cardinality: 370 distinct values High Cardinality
Filename has constant value "30-1114775_carepartners-op_standardcharges.csv" Constant
Filename has constant length 46 Constant Length

Variables

Procedure ID

categorical

Approximate Distinct Count 486
Approximate Unique (%) 97.8%
Missing 0
Missing (%) 0.0%
Memory Size 34.6 KB

Length

Mean 6.3662
Standard Deviation 2.101
Median 6
Minimum 2
Maximum 24

Sample

1st row 111994
2nd row 112649
3rd row 112650
4th row 112651
5th row 112653

Letter

Count 304
Lowercase Letter 257
Space Separator 22
Uppercase Letter 47
Dash Punctuation 0
Decimal Number 2838
  • The largest value (other) is over 1.8 times larger than the second largest value (outpatient)

HCPCS/CPT Code

categorical

Approximate Distinct Count 356
Approximate Unique (%) 74.8%
Missing 21
Missing (%) 4.2%
Memory Size 36.7 KB

Length

Mean 13.9454
Standard Deviation 0.6907
Median 14
Minimum 4
Maximum 14

Sample

1st row 0L7009
2nd row 0L5969
3rd row 0L8410
4th row 0L3320
5th row 0L3913

Letter

Count 452
Lowercase Letter 13
Space Separator 4006
Uppercase Letter 439
Dash Punctuation 0
Decimal Number 2180
  • The largest value (0v2632) is over 2.89 times larger than the second largest value (0c1713)

Description

categorical

Approximate Distinct Count 472
Approximate Unique (%) 95.3%
Missing 2
Missing (%) 0.4%
Memory Size 42.7 KB
  • The largest value (103.5% of MCR) is over 2.25 times larger than the second largest value (100% of MCD)

Length

Mean 23.3939
Standard Deviation 2.8499
Median 24
Minimum 3
Maximum 24

Sample

1st row ELECTRC HOOK MYOEL...
2nd row ADD ENDOSK ANKLE F...
3rd row PROS SHEATH ABOVE ...
4th row CORK ELEV HEEL/SOL...
5th row HFO W/O JOINTS CF ...

Letter

Count 7891
Lowercase Letter 38
Space Separator 3061
Uppercase Letter 7853
Dash Punctuation 12
Decimal Number 466

Gross Charge

categorical

Approximate Distinct Count 370
Approximate Unique (%) 76.6%
Missing 14
Missing (%) 2.8%
Memory Size 33.6 KB
  • The largest value (411.69) is over 11.0 times larger than the second largest value (31.15)

Length

Mean 6.3126
Standard Deviation 1.0444
Median 6
Minimum 3
Maximum 13

Sample

1st row 3600.00
2nd row 38000.00
3rd row 33.00
4th row 110.00
5th row 364.00

Letter

Count 38
Lowercase Letter 16
Space Separator 14
Uppercase Letter 22
Dash Punctuation 0
Decimal Number 2507
  • The largest value (41169) is over 11.0 times larger than the second largest value (mcr)

Discounted Cash Price (Gross Charges)

numerical

Approximate Distinct Count 364
Approximate Unique (%) 77.0%
Missing 24
Missing (%) 4.8%
Infinite 0
Infinite (%) 0.0%
Memory Size 7.4 KB
Mean 1871.5075
Minimum 11
Maximum 47687
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • Discounted Cash Price (Gross Charges) is skewed right (γ1 = 6.112)

Quantile Statistics

Minimum 11
5-th Percentile 43.522
Q1 187
Median 467.25
Q3 1514
95-th Percentile 7037.408
Maximum 47687
Range 47676
IQR 1327

Descriptive Statistics

Mean 1871.5075
Standard Deviation 4609.015
Variance 2.1243e+07
Sum 885223.05
Skewness 6.112
Kurtosis 45.3677
Coefficient of Variation 2.4627
  • Discounted Cash Price (Gross Charges) is not normally distributed (p-value 2.6561216427297412e-24)
  • Discounted Cash Price (Gross Charges) has 59 outliers

Filename

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 53.9 KB

Length

Mean 46
Standard Deviation 0
Median 46
Minimum 46
Maximum 46

Sample

1st row 30-1114775_carepar...
2nd row 30-1114775_carepar...
3rd row 30-1114775_carepar...
4th row 30-1114775_carepar...
5th row 30-1114775_carepar...

Letter

Count 15904
Lowercase Letter 15904
Space Separator 0
Uppercase Letter 0
Dash Punctuation 994
Decimal Number 4473
  • Filename has words of constant length

Interactions

Correlations

Missing Values