Overview

Dataset Statistics

Number of Variables 4
Number of Rows 509
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 154.8 KB
Average Row Size in Memory 311.5 B
Variable Types
  • Categorical: 4

Dataset Insights

DRG has a high cardinality: 503 distinct values High Cardinality
DRG Desc has a high cardinality: 509 distinct values High Cardinality
Average of TOTAL CHARGES has a high cardinality: 509 distinct values High Cardinality
Filename has constant value "ms-drg_internet-2020.csv" Constant
DRG has constant length 5 Constant Length
Filename has constant length 24 Constant Length
DRG Desc has all distinct values Unique
Average of TOTAL CHARGES has all distinct values Unique

Variables

DRG

categorical

Approximate Distinct Count 503
Approximate Unique (%) 98.8%
Missing 0
Missing (%) 0.0%
Memory Size 34.8 KB

Length

Mean 5
Standard Deviation 0
Median 5
Minimum 5
Maximum 5

Sample

1st row * 003
2nd row * 004
3rd row * 037
4th row * 038
5th row * 039

Letter

Count 0
Lowercase Letter 0
Space Separator 509
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 1527
  • DRG has words of constant length

DRG Desc

categorical

Approximate Distinct Count 509
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory Size 53.8 KB

Length

Mean 43.2279
Standard Deviation 15.6636
Median 43
Minimum 9
Maximum 116

Sample

1st row ECMO OR TRACH W MV...
2nd row TRACH W MV >96 HRS...
3rd row Extracranial proce...
4th row Extracranial proce...
5th row Extracranial proce...

Letter

Count 18115
Lowercase Letter 13192
Space Separator 3005
Uppercase Letter 4923
Dash Punctuation 31
Decimal Number 24
  • The largest value (w) is over 2.58 times larger than the second largest value (mcc)

Average of TOTAL CHARGES

categorical

Approximate Distinct Count 509
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Memory Size 37.8 KB

Length

Mean 11.0275
Standard Deviation 0.3125
Median 11
Minimum 10
Maximum 12

Sample

1st row $486,707.33
2nd row $314,344.96
3rd row $104,681.30
4th row $50,246.94
5th row $41,424.26

Letter

Count 0
Lowercase Letter 0
Space Separator 509
Uppercase Letter 0
Dash Punctuation 0
Decimal Number 3577

Filename

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.2%
Missing 0
Missing (%) 0.0%
Memory Size 44.2 KB

Length

Mean 24
Standard Deviation 0
Median 24
Minimum 24
Maximum 24

Sample

1st row ms-drg_internet-20...
2nd row ms-drg_internet-20...
3rd row ms-drg_internet-20...
4th row ms-drg_internet-20...
5th row ms-drg_internet-20...

Letter

Count 8144
Lowercase Letter 8144
Space Separator 0
Uppercase Letter 0
Dash Punctuation 1018
Decimal Number 2036
  • Filename has words of constant length

Missing Values