Overview

Dataset Statistics

Number of Variables 4
Number of Rows 20114
Missing Cells 0
Missing Cells (%) 0.0%
Duplicate Rows 0
Duplicate Rows (%) 0.0%
Total Size in Memory 4.8 MB
Average Row Size in Memory 249.3 B
Variable Types
  • Numerical: 1
  • Categorical: 3

Dataset Insights

295350 is skewed Skewed
88304 AP Bill Surgical Pathology Level III Complexity has a high cardinality: 19823 distinct values High Cardinality
$189.15 has a high cardinality: 5646 distinct values High Cardinality
Filename has constant value "cdm_for_internet_10-1-2020.csv" Constant
Filename has constant length 30 Constant Length

Variables

295350

numerical

Approximate Distinct Count 20114
Approximate Unique (%) 100.0%
Missing 0
Missing (%) 0.0%
Infinite 0
Infinite (%) 0.0%
Memory Size 314.3 KB
Mean 1.6495e+07
Minimum 295352
Maximum 5.0408e+07
Zeros 0
Zeros (%) 0.0%
Negatives 0
Negatives (%) 0.0%
  • 295350 is skewed right (γ1 = 0.966)

Quantile Statistics

Minimum 295352
5-th Percentile 1.3402e+06
Q1 2.595e+06
Median 8.829e+06
Q3 3.0556e+07
95-th Percentile 4.9967e+07
Maximum 5.0408e+07
Range 5.0113e+07
IQR 2.7961e+07

Descriptive Statistics

Mean 1.6495e+07
Standard Deviation 1.8739e+07
Variance 3.5116e+14
Sum 3.3177e+11
Skewness 0.966
Kurtosis -0.8215
Coefficient of Variation 1.1361
  • 295350 is not normally distributed (p-value 1.418243163789096e-10)

88304 AP Bill Surgical Pathology Level III Complexity

categorical

Approximate Distinct Count 19823
Approximate Unique (%) 98.6%
Missing 0
Missing (%) 0.0%
Memory Size 1.8 MB
  • The largest value (Yes) is over 1.75 times larger than the second largest value (Phonophoresis Charge)

Length

Mean 30.4235
Standard Deviation 10.9017
Median 29
Minimum 3
Maximum 100

Sample

1st row 88305 AP Bill Surg...
2nd row 88307 AP Bill Surg...
3rd row 88309 AP Bill Surg...
4th row 88311 AP Bill Deca...
5th row 88312 AP Bill Spec...

Letter

Count 430114
Lowercase Letter 59324
Space Separator 85167
Uppercase Letter 370790
Dash Punctuation 3612
Decimal Number 60956
  • 88304 AP Bill Surgical Pathology Level III Complexity contains many words: 12725 words
  • The largest value (x) is over 1.66 times larger than the second largest value (screw)

$189.15

categorical

Approximate Distinct Count 5646
Approximate Unique (%) 28.1%
Missing 0
Missing (%) 0.0%
Memory Size 1.4 MB

Length

Mean 9.7259
Standard Deviation 1.5615
Median 9
Minimum 6
Maximum 13

Sample

1st row $275.00
2nd row $383.45
3rd row $528.00
4th row $20.80
5th row $167.90

Letter

Count 0
Lowercase Letter 0
Space Separator 41784
Uppercase Letter 0
Dash Punctuation 778
Decimal Number 103612
  • $189.15 contains many words: 5645 words
  • The largest value (001) is over 3.35 times larger than the second largest value (170000)

Filename

categorical

Approximate Distinct Count 1
Approximate Unique (%) 0.0%
Missing 0
Missing (%) 0.0%
Memory Size 1.8 MB

Length

Mean 30
Standard Deviation 0
Median 30
Minimum 30
Maximum 30

Sample

1st row cdm_for_internet_1...
2nd row cdm_for_internet_1...
3rd row cdm_for_internet_1...
4th row cdm_for_internet_1...
5th row cdm_for_internet_1...

Letter

Count 341938
Lowercase Letter 341938
Space Separator 0
Uppercase Letter 0
Dash Punctuation 40228
Decimal Number 140798
  • Filename has words of constant length

Interactions

Correlations

Missing Values