Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 8399 |
| Missing cells | 63 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.3 MiB |
| Average record size in memory | 168.0 B |
Variable types
| CAT | 12 |
|---|---|
| NUM | 9 |
Reproduction
| Analysis started | 2020-09-04 08:59:34.891417 |
|---|---|
| Analysis finished | 2020-09-04 09:00:02.160756 |
| Duration | 27.27 seconds |
| Version | pandas-profiling v2.8.0 |
| Command line | pandas_profiling --config_file config.yaml [YOUR_FILE.csv] |
| Download configuration | config.yaml |
Order Date has a high cardinality: 1418 distinct values | High cardinality |
Customer Name has a high cardinality: 795 distinct values | High cardinality |
Product Name has a high cardinality: 1263 distinct values | High cardinality |
Ship Date has a high cardinality: 1450 distinct values | High cardinality |
Order ID is highly correlated with Row ID | High correlation |
Row ID is highly correlated with Order ID | High correlation |
Region is highly correlated with Province | High correlation |
Province is highly correlated with Region | High correlation |
Product Sub-Category is highly correlated with Product Category | High correlation |
Product Category is highly correlated with Product Sub-Category | High correlation |
Row ID has unique values | Unique |
Discount has 756 (9.0%) zeros | Zeros |
| Distinct count | 8399 |
|---|---|
| Unique (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4200.0 |
|---|---|
| Minimum | 1 |
| Maximum | 8399 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 420.9 |
| Q1 | 2100.5 |
| median | 4200 |
| Q3 | 6299.5 |
| 95-th percentile | 7979.1 |
| Maximum | 8399 |
| Range | 8398 |
| Interquartile range (IQR) | 4199 |
Descriptive statistics
| Standard deviation | 2424.726789 |
|---|---|
| Coefficient of variation (CV) | 0.5773159021 |
| Kurtosis | -1.2 |
| Mean | 4200 |
| Median Absolute Deviation (MAD) | 2100 |
| Skewness | 0 |
| Sum | 35275800 |
| Variance | 5879300 |
| Value | Count | Frequency (%) | |
| 2047 | 1 | < 0.1% | |
| 3371 | 1 | < 0.1% | |
| 7473 | 1 | < 0.1% | |
| 1330 | 1 | < 0.1% | |
| 3379 | 1 | < 0.1% | |
| 5432 | 1 | < 0.1% | |
| 7481 | 1 | < 0.1% | |
| 1338 | 1 | < 0.1% | |
| 3387 | 1 | < 0.1% | |
| 5440 | 1 | < 0.1% | |
| Other values (8389) | 8389 | 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 8399 | 1 | < 0.1% | |
| 8398 | 1 | < 0.1% | |
| 8397 | 1 | < 0.1% | |
| 8396 | 1 | < 0.1% | |
| 8395 | 1 | < 0.1% |
| Distinct count | 5496 |
|---|---|
| Unique (%) | 65.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 29965.179783307536 |
|---|---|
| Minimum | 3 |
| Maximum | 59973 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 2818 |
| Q1 | 15011.5 |
| median | 29857 |
| Q3 | 44596 |
| 95-th percentile | 57061 |
| Maximum | 59973 |
| Range | 59970 |
| Interquartile range (IQR) | 29584.5 |
Descriptive statistics
| Standard deviation | 17260.88345 |
|---|---|
| Coefficient of variation (CV) | 0.5760313661 |
| Kurtosis | -1.178316663 |
| Mean | 29965.17978 |
| Median Absolute Deviation (MAD) | 14778 |
| Skewness | 0.00381089223 |
| Sum | 251677545 |
| Variance | 297938097.4 |
| Value | Count | Frequency (%) | |
| 24132 | 6 | 0.1% | |
| 43745 | 6 | 0.1% | |
| 48452 | 5 | 0.1% | |
| 13540 | 5 | 0.1% | |
| 8995 | 5 | 0.1% | |
| 1444 | 5 | 0.1% | |
| 58784 | 5 | 0.1% | |
| 33797 | 5 | 0.1% | |
| 12067 | 5 | 0.1% | |
| 15109 | 5 | 0.1% | |
| Other values (5486) | 8347 | 99.4% |
| Value | Count | Frequency (%) | |
| 3 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% | |
| 32 | 4 | < 0.1% | |
| 35 | 2 | < 0.1% | |
| 36 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 59973 | 2 | < 0.1% | |
| 59971 | 3 | < 0.1% | |
| 59969 | 2 | < 0.1% | |
| 59943 | 1 | < 0.1% | |
| 59942 | 1 | < 0.1% |
| Distinct count | 1418 |
|---|---|
| Unique (%) | 16.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| 9/15/2011 | 20 |
|---|---|
| 3/28/2012 | 20 |
| 12/12/2010 | 18 |
| 8/4/2010 | 17 |
| 11/19/2011 | 17 |
| Other values (1413) |
| Value | Count | Frequency (%) | |
| 9/15/2011 | 20 | 0.2% | |
| 3/28/2012 | 20 | 0.2% | |
| 12/12/2010 | 18 | 0.2% | |
| 8/4/2010 | 17 | 0.2% | |
| 11/19/2011 | 17 | 0.2% | |
| 2/27/2010 | 17 | 0.2% | |
| 4/20/2010 | 17 | 0.2% | |
| 2/4/2009 | 16 | 0.2% | |
| 10/30/2012 | 16 | 0.2% | |
| 10/9/2010 | 15 | 0.2% | |
| Other values (1408) | 8226 | 97.9% |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.948803429 |
| Min length | 8 |
Order Priority
Categorical
| Distinct count | 5 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| High | |
|---|---|
| Low | |
| Not Specified | |
| Medium | |
| Critical |
| Value | Count | Frequency (%) | |
| High | 1768 | 21.1% | |
| Low | 1720 | 20.5% | |
| Not Specified | 1672 | 19.9% | |
| Medium | 1631 | 19.4% | |
| Critical | 1608 | 19.1% |
Length
| Max length | 13 |
|---|---|
| Median length | 6 |
| Mean length | 6.7410406 |
| Min length | 3 |
Order Quantity
Real number (ℝ≥0)
| Distinct count | 50 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 25.571734730325037 |
|---|---|
| Minimum | 1 |
| Maximum | 50 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 3 |
| Q1 | 13 |
| median | 26 |
| Q3 | 38 |
| 95-th percentile | 48 |
| Maximum | 50 |
| Range | 49 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 14.48107111 |
|---|---|
| Coefficient of variation (CV) | 0.5662920903 |
| Kurtosis | -1.208020269 |
| Mean | 25.57173473 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.01731778213 |
| Sum | 214777 |
| Variance | 209.7014206 |
| Value | Count | Frequency (%) | |
| 31 | 202 | 2.4% | |
| 4 | 196 | 2.3% | |
| 39 | 195 | 2.3% | |
| 46 | 193 | 2.3% | |
| 23 | 192 | 2.3% | |
| 24 | 192 | 2.3% | |
| 3 | 189 | 2.3% | |
| 42 | 189 | 2.3% | |
| 43 | 184 | 2.2% | |
| 41 | 183 | 2.2% | |
| Other values (40) | 6484 | 77.2% |
| Value | Count | Frequency (%) | |
| 1 | 165 | 2.0% | |
| 2 | 152 | 1.8% | |
| 3 | 189 | 2.3% | |
| 4 | 196 | 2.3% | |
| 5 | 166 | 2.0% |
| Value | Count | Frequency (%) | |
| 50 | 182 | 2.2% | |
| 49 | 136 | 1.6% | |
| 48 | 172 | 2.0% | |
| 47 | 166 | 2.0% | |
| 46 | 193 | 2.3% |
Sales
Real number (ℝ≥0)
| Distinct count | 8153 |
|---|---|
| Unique (%) | 97.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1775.8781788308133 |
|---|---|
| Minimum | 2.24 |
| Maximum | 89061.05 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 2.24 |
|---|---|
| 5-th percentile | 34.178 |
| Q1 | 143.195 |
| median | 449.42 |
| Q3 | 1709.32 |
| 95-th percentile | 7844.335 |
| Maximum | 89061.05 |
| Range | 89058.81 |
| Interquartile range (IQR) | 1566.125 |
Descriptive statistics
| Standard deviation | 3585.050525 |
|---|---|
| Coefficient of variation (CV) | 2.018748002 |
| Kurtosis | 60.92837614 |
| Mean | 1775.878179 |
| Median Absolute Deviation (MAD) | 381.95 |
| Skewness | 5.386982374 |
| Sum | 14915600.82 |
| Variance | 12852587.27 |
| Value | Count | Frequency (%) | |
| 46.94 | 3 | < 0.1% | |
| 75.19 | 3 | < 0.1% | |
| 10.48 | 3 | < 0.1% | |
| 224.58 | 3 | < 0.1% | |
| 43.29 | 3 | < 0.1% | |
| 127.56 | 3 | < 0.1% | |
| 151.19 | 3 | < 0.1% | |
| 74.02 | 3 | < 0.1% | |
| 20.19 | 3 | < 0.1% | |
| 19.36 | 3 | < 0.1% | |
| Other values (8143) | 8369 | 99.6% |
| Value | Count | Frequency (%) | |
| 2.24 | 1 | < 0.1% | |
| 3.2 | 1 | < 0.1% | |
| 3.23 | 1 | < 0.1% | |
| 3.41 | 1 | < 0.1% | |
| 3.42 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 89061.05 | 1 | < 0.1% | |
| 45923.76 | 1 | < 0.1% | |
| 41343.21 | 1 | < 0.1% | |
| 33367.85 | 1 | < 0.1% | |
| 29884.6 | 1 | < 0.1% |
| Distinct count | 16 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.04967138945112514 |
|---|---|
| Minimum | 0.0 |
| Maximum | 0.25 |
| Zeros | 756 |
| Zeros (%) | 9.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.02 |
| median | 0.05 |
| Q3 | 0.08 |
| 95-th percentile | 0.1 |
| Maximum | 0.25 |
| Range | 0.25 |
| Interquartile range (IQR) | 0.06 |
Descriptive statistics
| Standard deviation | 0.0318230196 |
|---|---|
| Coefficient of variation (CV) | 0.6406710172 |
| Kurtosis | -0.9594110633 |
| Mean | 0.04967138945 |
| Median Absolute Deviation (MAD) | 0.03 |
| Skewness | 0.07391696254 |
| Sum | 417.19 |
| Variance | 0.001012704577 |
| Value | Count | Frequency (%) | |
| 0.01 | 806 | 9.6% | |
| 0.05 | 786 | 9.4% | |
| 0.03 | 779 | 9.3% | |
| 0.09 | 778 | 9.3% | |
| 0.04 | 770 | 9.2% | |
| 0.08 | 765 | 9.1% | |
| 0.02 | 765 | 9.1% | |
| 0 | 756 | 9.0% | |
| 0.1 | 745 | 8.9% | |
| 0.06 | 734 | 8.7% | |
| Other values (6) | 715 | 8.5% |
| Value | Count | Frequency (%) | |
| 0 | 756 | 9.0% | |
| 0.01 | 806 | 9.6% | |
| 0.02 | 765 | 9.1% | |
| 0.03 | 779 | 9.3% | |
| 0.04 | 770 | 9.2% |
| Value | Count | Frequency (%) | |
| 0.25 | 1 | < 0.1% | |
| 0.21 | 1 | < 0.1% | |
| 0.17 | 1 | < 0.1% | |
| 0.16 | 1 | < 0.1% | |
| 0.11 | 1 | < 0.1% |
Ship Mode
Categorical
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Regular Air | |
|---|---|
| Delivery Truck | 1146 |
| Express Air | 983 |
| Value | Count | Frequency (%) | |
| Regular Air | 6270 | 74.7% | |
| Delivery Truck | 1146 | 13.6% | |
| Express Air | 983 | 11.7% |
Length
| Max length | 14 |
|---|---|
| Median length | 11 |
| Mean length | 11.40933444 |
| Min length | 11 |
Profit
Real number (ℝ)
| Distinct count | 7807 |
|---|---|
| Unique (%) | 93.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 181.1844243362305 |
|---|---|
| Minimum | -14140.7 |
| Maximum | 27220.69 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | -14140.7 |
|---|---|
| 5-th percentile | -592.439 |
| Q1 | -83.315 |
| median | -1.5 |
| Q3 | 162.75 |
| 95-th percentile | 1542.309 |
| Maximum | 27220.69 |
| Range | 41361.39 |
| Interquartile range (IQR) | 246.065 |
Descriptive statistics
| Standard deviation | 1196.653371 |
|---|---|
| Coefficient of variation (CV) | 6.604615026 |
| Kurtosis | 67.34970524 |
| Mean | 181.1844243 |
| Median Absolute Deviation (MAD) | 104.33 |
| Skewness | 3.647238938 |
| Sum | 1521767.98 |
| Variance | 1431979.291 |
| Value | Count | Frequency (%) | |
| -969.05 | 8 | 0.1% | |
| 11.65 | 6 | 0.1% | |
| -433.29 | 6 | 0.1% | |
| -528.65 | 5 | 0.1% | |
| -1331.55 | 5 | 0.1% | |
| -715.78 | 5 | 0.1% | |
| -505.98 | 5 | 0.1% | |
| -513.79 | 4 | < 0.1% | |
| -66.87 | 4 | < 0.1% | |
| 0.35 | 4 | < 0.1% | |
| Other values (7797) | 8347 | 99.4% |
| Value | Count | Frequency (%) | |
| -14140.7 | 1 | < 0.1% | |
| -12558 | 1 | < 0.1% | |
| -11984.4 | 1 | < 0.1% | |
| -11861.46 | 1 | < 0.1% | |
| -11769.17 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 27220.69 | 1 | < 0.1% | |
| 14440.39 | 1 | < 0.1% | |
| 13340.26 | 1 | < 0.1% | |
| 12748.86 | 1 | < 0.1% | |
| 12606.81 | 1 | < 0.1% |
Unit Price
Real number (ℝ≥0)
| Distinct count | 751 |
|---|---|
| Unique (%) | 8.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 89.34625907846171 |
|---|---|
| Minimum | 0.99 |
| Maximum | 6783.02 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 0.99 |
|---|---|
| 5-th percentile | 2.88 |
| Q1 | 6.48 |
| median | 20.99 |
| Q3 | 85.99 |
| 95-th percentile | 320.64 |
| Maximum | 6783.02 |
| Range | 6782.03 |
| Interquartile range (IQR) | 79.51 |
Descriptive statistics
| Standard deviation | 290.354383 |
|---|---|
| Coefficient of variation (CV) | 3.249765418 |
| Kurtosis | 271.1687334 |
| Mean | 89.34625908 |
| Median Absolute Deviation (MAD) | 17.01 |
| Skewness | 14.12779334 |
| Sum | 750419.23 |
| Variance | 84305.66773 |
| Value | Count | Frequency (%) | |
| 6.48 | 264 | 3.1% | |
| 65.99 | 192 | 2.3% | |
| 4.98 | 136 | 1.6% | |
| 125.99 | 115 | 1.4% | |
| 5.98 | 102 | 1.2% | |
| 2.88 | 81 | 1.0% | |
| 30.98 | 73 | 0.9% | |
| 20.99 | 73 | 0.9% | |
| 35.99 | 70 | 0.8% | |
| 205.99 | 66 | 0.8% | |
| Other values (741) | 7227 | 86.0% |
| Value | Count | Frequency (%) | |
| 0.99 | 2 | < 0.1% | |
| 1.14 | 10 | 0.1% | |
| 1.26 | 13 | 0.2% | |
| 1.48 | 12 | 0.1% | |
| 1.6 | 5 | 0.1% |
| Value | Count | Frequency (%) | |
| 6783.02 | 7 | 0.1% | |
| 3502.14 | 6 | 0.1% | |
| 3499.99 | 7 | 0.1% | |
| 2550.14 | 7 | 0.1% | |
| 2036.48 | 6 | 0.1% |
Shipping Cost
Real number (ℝ≥0)
| Distinct count | 652 |
|---|---|
| Unique (%) | 7.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.838556971067984 |
|---|---|
| Minimum | 0.49 |
| Maximum | 164.73 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 0.49 |
|---|---|
| 5-th percentile | 0.8 |
| Q1 | 3.3 |
| median | 6.07 |
| Q3 | 13.99 |
| 95-th percentile | 55.351 |
| Maximum | 164.73 |
| Range | 164.24 |
| Interquartile range (IQR) | 10.69 |
Descriptive statistics
| Standard deviation | 17.26405197 |
|---|---|
| Coefficient of variation (CV) | 1.344703459 |
| Kurtosis | 7.751587174 |
| Mean | 12.83855697 |
| Median Absolute Deviation (MAD) | 3.61 |
| Skewness | 2.553800841 |
| Sum | 107831.04 |
| Variance | 298.0474903 |
| Value | Count | Frequency (%) | |
| 19.99 | 352 | 4.2% | |
| 8.99 | 321 | 3.8% | |
| 1.99 | 247 | 2.9% | |
| 0.5 | 190 | 2.3% | |
| 0.99 | 144 | 1.7% | |
| 4 | 143 | 1.7% | |
| 1.49 | 138 | 1.6% | |
| 0.7 | 138 | 1.6% | |
| 24.49 | 132 | 1.6% | |
| 2.99 | 124 | 1.5% | |
| Other values (642) | 6470 | 77.0% |
| Value | Count | Frequency (%) | |
| 0.49 | 34 | 0.4% | |
| 0.5 | 190 | 2.3% | |
| 0.7 | 138 | 1.6% | |
| 0.71 | 22 | 0.3% | |
| 0.73 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 164.73 | 1 | < 0.1% | |
| 154.12 | 1 | < 0.1% | |
| 147.12 | 2 | < 0.1% | |
| 143.71 | 1 | < 0.1% | |
| 130 | 1 | < 0.1% |
| Distinct count | 795 |
|---|---|
| Unique (%) | 9.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Darren Budd | 41 |
|---|---|
| Ed Braxton | 38 |
| Brad Thomas | 35 |
| Carlos Soltero | 33 |
| Patrick Jones | 30 |
| Other values (790) |
| Value | Count | Frequency (%) | |
| Darren Budd | 41 | 0.5% | |
| Ed Braxton | 38 | 0.5% | |
| Brad Thomas | 35 | 0.4% | |
| Carlos Soltero | 33 | 0.4% | |
| Patrick Jones | 30 | 0.4% | |
| Tony Sayre | 29 | 0.3% | |
| Lena Creighton | 28 | 0.3% | |
| Joy Smith | 28 | 0.3% | |
| Jack O'Briant | 28 | 0.3% | |
| Giulietta Dortch | 28 | 0.3% | |
| Other values (785) | 8081 | 96.2% |
Length
| Max length | 22 |
|---|---|
| Median length | 13 |
| Mean length | 12.86712704 |
| Min length | 7 |
| Distinct count | 13 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Ontario | |
|---|---|
| British Columbia | |
| Saskachewan | |
| Alberta | |
| Manitoba | |
| Other values (8) |
| Value | Count | Frequency (%) | |
| Ontario | 1826 | 21.7% | |
| British Columbia | 1126 | 13.4% | |
| Saskachewan | 913 | 10.9% | |
| Alberta | 865 | 10.3% | |
| Manitoba | 793 | 9.4% | |
| Quebec | 781 | 9.3% | |
| Yukon | 542 | 6.5% | |
| Nova Scotia | 464 | 5.5% | |
| Northwest Territories | 394 | 4.7% | |
| New Brunswick | 323 | 3.8% | |
| Other values (3) | 372 | 4.4% |
Length
| Max length | 21 |
|---|---|
| Median length | 8 |
| Mean length | 9.997618764 |
| Min length | 5 |
| Distinct count | 8 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| West | |
|---|---|
| Ontario | |
| Prarie | |
| Atlantic | |
| Quebec | |
| Other values (3) |
| Value | Count | Frequency (%) | |
| West | 1991 | 23.7% | |
| Ontario | 1826 | 21.7% | |
| Prarie | 1706 | 20.3% | |
| Atlantic | 1080 | 12.9% | |
| Quebec | 781 | 9.3% | |
| Yukon | 542 | 6.5% | |
| Northwest Territories | 394 | 4.7% | |
| Nunavut | 79 | 0.9% |
Length
| Max length | 21 |
|---|---|
| Median length | 6 |
| Mean length | 6.649005834 |
| Min length | 4 |
Customer Segment
Categorical
| Distinct count | 4 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Corporate | |
|---|---|
| Home Office | |
| Consumer | |
| Small Business |
| Value | Count | Frequency (%) | |
| Corporate | 3076 | 36.6% | |
| Home Office | 2032 | 24.2% | |
| Consumer | 1649 | 19.6% | |
| Small Business | 1642 | 19.5% |
Length
| Max length | 14 |
|---|---|
| Median length | 9 |
| Mean length | 10.26503155 |
| Min length | 8 |
| Distinct count | 3 |
|---|---|
| Unique (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Office Supplies | |
|---|---|
| Technology | |
| Furniture |
| Value | Count | Frequency (%) | |
| Office Supplies | 4610 | 54.9% | |
| Technology | 2065 | 24.6% | |
| Furniture | 1724 | 20.5% |
Length
| Max length | 15 |
|---|---|
| Median length | 15 |
| Mean length | 12.5391118 |
| Min length | 9 |
| Distinct count | 17 |
|---|---|
| Unique (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Paper | |
|---|---|
| Binders and Binder Accessories | |
| Telephones and Communication | |
| Office Furnishings | |
| Computer Peripherals | 758 |
| Other values (12) |
| Value | Count | Frequency (%) | |
| Paper | 1225 | 14.6% | |
| Binders and Binder Accessories | 915 | 10.9% | |
| Telephones and Communication | 883 | 10.5% | |
| Office Furnishings | 788 | 9.4% | |
| Computer Peripherals | 758 | 9.0% | |
| Pens & Art Supplies | 633 | 7.5% | |
| Storage & Organization | 546 | 6.5% | |
| Appliances | 434 | 5.2% | |
| Chairs & Chairmats | 386 | 4.6% | |
| Tables | 361 | 4.3% | |
| Other values (7) | 1470 | 17.5% |
Length
| Max length | 30 |
|---|---|
| Median length | 18 |
| Mean length | 17.08096202 |
| Min length | 5 |
| Distinct count | 1263 |
|---|---|
| Unique (%) | 15.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Global High-Back Leather Tilter, Burgundy | 24 |
|---|---|
| Master Giant Foot® Doorstop, Safety Yellow | 22 |
| Bevis 36 x 72 Conference Tables | 22 |
| Fiskars® Softgrip Scissors | 22 |
| BoxOffice By Design Rectangular and Half-Moon Meeting Room Tables | 22 |
| Other values (1258) |
| Value | Count | Frequency (%) | |
| Global High-Back Leather Tilter, Burgundy | 24 | 0.3% | |
| Master Giant Foot® Doorstop, Safety Yellow | 22 | 0.3% | |
| Bevis 36 x 72 Conference Tables | 22 | 0.3% | |
| Fiskars® Softgrip Scissors | 22 | 0.3% | |
| BoxOffice By Design Rectangular and Half-Moon Meeting Room Tables | 22 | 0.3% | |
| Wilson Jones Hanging View Binder, White, 1" | 21 | 0.3% | |
| StarTAC 7760 | 20 | 0.2% | |
| 80 Minute CD-R Spindle, 100/Pack - Staples | 20 | 0.2% | |
| Peel & Seel® Recycled Catalog Envelopes, Brown | 19 | 0.2% | |
| Office Star - Mid Back Dual function Ergonomic High Back Chair with 2-Way Adjustable Arms | 19 | 0.2% | |
| Other values (1253) | 8188 | 97.5% |
Length
| Max length | 98 |
|---|---|
| Median length | 34 |
| Mean length | 34.35170854 |
| Min length | 3 |
Product Container
Categorical
| Distinct count | 7 |
|---|---|
| Unique (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| Small Box | |
|---|---|
| Wrap Bag | |
| Small Pack | |
| Jumbo Drum | 624 |
| Jumbo Box | 532 |
| Other values (2) | 772 |
| Value | Count | Frequency (%) | |
| Small Box | 4347 | 51.8% | |
| Wrap Bag | 1168 | 13.9% | |
| Small Pack | 956 | 11.4% | |
| Jumbo Drum | 624 | 7.4% | |
| Jumbo Box | 532 | 6.3% | |
| Large Box | 406 | 4.8% | |
| Medium Box | 366 | 4.4% |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.092630075 |
| Min length | 8 |
Product Base Margin
Real number (ℝ≥0)
| Distinct count | 51 |
|---|---|
| Unique (%) | 0.6% |
| Missing | 63 |
| Missing (%) | 0.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5125131957773512 |
|---|---|
| Minimum | 0.35 |
| Maximum | 0.85 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 65.6 KiB |
Quantile statistics
| Minimum | 0.35 |
|---|---|
| 5-th percentile | 0.36 |
| Q1 | 0.38 |
| median | 0.52 |
| Q3 | 0.59 |
| 95-th percentile | 0.78 |
| Maximum | 0.85 |
| Range | 0.5 |
| Interquartile range (IQR) | 0.21 |
Descriptive statistics
| Standard deviation | 0.1355889411 |
|---|---|
| Coefficient of variation (CV) | 0.2645569758 |
| Kurtosis | -0.6608702254 |
| Mean | 0.5125131958 |
| Median Absolute Deviation (MAD) | 0.12 |
| Skewness | 0.5593995872 |
| Sum | 4272.31 |
| Variance | 0.01838436095 |
| Value | Count | Frequency (%) | |
| 0.37 | 761 | 9.1% | |
| 0.38 | 678 | 8.1% | |
| 0.36 | 628 | 7.5% | |
| 0.59 | 497 | 5.9% | |
| 0.39 | 482 | 5.7% | |
| 0.56 | 459 | 5.5% | |
| 0.57 | 459 | 5.5% | |
| 0.4 | 408 | 4.9% | |
| 0.58 | 387 | 4.6% | |
| 0.55 | 314 | 3.7% | |
| Other values (41) | 3263 | 38.8% |
| Value | Count | Frequency (%) | |
| 0.35 | 262 | 3.1% | |
| 0.36 | 628 | 7.5% | |
| 0.37 | 761 | 9.1% | |
| 0.38 | 678 | 8.1% | |
| 0.39 | 482 | 5.7% |
| Value | Count | Frequency (%) | |
| 0.85 | 36 | 0.4% | |
| 0.84 | 25 | 0.3% | |
| 0.83 | 83 | 1.0% | |
| 0.82 | 32 | 0.4% | |
| 0.81 | 73 | 0.9% |
| Distinct count | 1450 |
|---|---|
| Unique (%) | 17.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 65.6 KiB |
| 5/21/2011 | 19 |
|---|---|
| 4/11/2009 | 16 |
| 3/30/2012 | 16 |
| 10/9/2009 | 16 |
| 5/9/2012 | 15 |
| Other values (1445) |
| Value | Count | Frequency (%) | |
| 5/21/2011 | 19 | 0.2% | |
| 4/11/2009 | 16 | 0.2% | |
| 3/30/2012 | 16 | 0.2% | |
| 10/9/2009 | 16 | 0.2% | |
| 5/9/2012 | 15 | 0.2% | |
| 10/4/2012 | 15 | 0.2% | |
| 3/28/2009 | 15 | 0.2% | |
| 4/15/2012 | 14 | 0.2% | |
| 8/16/2009 | 14 | 0.2% | |
| 5/27/2012 | 14 | 0.2% | |
| Other values (1440) | 8245 | 98.2% |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.9495178 |
| Min length | 8 |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| Row ID | Order ID | Order Date | Order Priority | Order Quantity | Sales | Discount | Ship Mode | Profit | Unit Price | Shipping Cost | Customer Name | Province | Region | Customer Segment | Product Category | Product Sub-Category | Product Name | Product Container | Product Base Margin | Ship Date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 3 | 10/13/2010 | Low | 6 | 261.5400 | 0.04 | Regular Air | -213.25 | 38.94 | 35.00 | Muhammed MacIntyre | Nunavut | Nunavut | Small Business | Office Supplies | Storage & Organization | Eldon Base for stackable storage shelf, platinum | Large Box | 0.80 | 10/20/2010 |
| 1 | 49 | 293 | 10/1/2012 | High | 49 | 10123.0200 | 0.07 | Delivery Truck | 457.81 | 208.16 | 68.02 | Barry French | Nunavut | Nunavut | Consumer | Office Supplies | Appliances | 1.7 Cubic Foot Compact "Cube" Office Refrigerators | Jumbo Drum | 0.58 | 10/2/2012 |
| 2 | 50 | 293 | 10/1/2012 | High | 27 | 244.5700 | 0.01 | Regular Air | 46.71 | 8.69 | 2.99 | Barry French | Nunavut | Nunavut | Consumer | Office Supplies | Binders and Binder Accessories | Cardinal Slant-D® Ring Binder, Heavy Gauge Vinyl | Small Box | 0.39 | 10/3/2012 |
| 3 | 80 | 483 | 7/10/2011 | High | 30 | 4965.7595 | 0.08 | Regular Air | 1198.97 | 195.99 | 3.99 | Clay Rozendal | Nunavut | Nunavut | Corporate | Technology | Telephones and Communication | R380 | Small Box | 0.58 | 7/12/2011 |
| 4 | 85 | 515 | 8/28/2010 | Not Specified | 19 | 394.2700 | 0.08 | Regular Air | 30.94 | 21.78 | 5.94 | Carlos Soltero | Nunavut | Nunavut | Consumer | Office Supplies | Appliances | Holmes HEPA Air Purifier | Medium Box | 0.50 | 8/30/2010 |
| 5 | 86 | 515 | 8/28/2010 | Not Specified | 21 | 146.6900 | 0.05 | Regular Air | 4.43 | 6.64 | 4.95 | Carlos Soltero | Nunavut | Nunavut | Consumer | Furniture | Office Furnishings | G.E. Longer-Life Indoor Recessed Floodlight Bulbs | Small Pack | 0.37 | 8/30/2010 |
| 6 | 97 | 613 | 6/17/2011 | High | 12 | 93.5400 | 0.03 | Regular Air | -54.04 | 7.30 | 7.72 | Carl Jackson | Nunavut | Nunavut | Corporate | Office Supplies | Binders and Binder Accessories | Angle-D Binders with Locking Rings, Label Holders | Small Box | 0.38 | 6/17/2011 |
| 7 | 98 | 613 | 6/17/2011 | High | 22 | 905.0800 | 0.09 | Regular Air | 127.70 | 42.76 | 6.22 | Carl Jackson | Nunavut | Nunavut | Corporate | Office Supplies | Storage & Organization | SAFCO Mobile Desk Side File, Wire Frame | Small Box | NaN | 6/18/2011 |
| 8 | 103 | 643 | 3/24/2011 | High | 21 | 2781.8200 | 0.07 | Express Air | -695.26 | 138.14 | 35.00 | Monica Federle | Nunavut | Nunavut | Corporate | Office Supplies | Storage & Organization | SAFCO Commercial Wire Shelving, Black | Large Box | NaN | 3/25/2011 |
| 9 | 107 | 678 | 2/26/2010 | Low | 44 | 228.4100 | 0.07 | Regular Air | -226.36 | 4.98 | 8.33 | Dorothy Badders | Nunavut | Nunavut | Home Office | Office Supplies | Paper | Xerox 198 | Small Box | 0.38 | 2/26/2010 |
Last rows
| Row ID | Order ID | Order Date | Order Priority | Order Quantity | Sales | Discount | Ship Mode | Profit | Unit Price | Shipping Cost | Customer Name | Province | Region | Customer Segment | Product Category | Product Sub-Category | Product Name | Product Container | Product Base Margin | Ship Date | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 8389 | 6492 | 46212 | 9/12/2012 | Not Specified | 43 | 322.4700 | 0.09 | Express Air | 72.28 | 7.78 | 2.50 | Grant Donatelli | Alberta | West | Consumer | Office Supplies | Envelopes | Staples #10 Colored Envelopes | Small Box | 0.38 | 9/14/2012 |
| 8390 | 6526 | 46437 | 9/15/2009 | Medium | 49 | 1488.6600 | 0.00 | Regular Air | 385.37 | 29.34 | 7.87 | Mick Brown | Alberta | West | Consumer | Furniture | Office Furnishings | Seth Thomas 14" Putty-Colored Wall Clock | Small Box | 0.54 | 9/17/2009 |
| 8391 | 6657 | 47360 | 10/8/2010 | Not Specified | 25 | 2200.6400 | 0.05 | Delivery Truck | -514.18 | 89.99 | 42.00 | Frank Hawley | Alberta | West | Home Office | Furniture | Chairs & Chairmats | Global Leather Task Chair, Black | Jumbo Drum | 0.66 | 10/10/2010 |
| 8392 | 7396 | 52706 | 7/9/2012 | Low | 34 | 1041.6600 | 0.02 | Express Air | 480.53 | 28.53 | 1.49 | Harry Greene | Alberta | West | Corporate | Office Supplies | Binders and Binder Accessories | Lock-Up Easel 'Spel-Binder' | Small Box | 0.38 | 7/16/2012 |
| 8393 | 7586 | 54279 | 7/30/2011 | High | 41 | 10071.0900 | 0.10 | Delivery Truck | 1977.69 | 264.98 | 17.86 | Harry Greene | Alberta | West | Corporate | Technology | Office Machines | Panasonic KX-P1131 Dot Matrix Printer | Jumbo Drum | 0.58 | 7/31/2011 |
| 8394 | 7765 | 55558 | 8/9/2010 | Medium | 8 | 1294.0400 | 0.05 | Delivery Truck | -323.18 | 150.98 | 66.27 | Mick Brown | Alberta | West | Consumer | Furniture | Bookcases | Bush Mission Pointe Library | Jumbo Box | 0.65 | 8/9/2010 |
| 8395 | 7766 | 55558 | 8/9/2010 | Medium | 23 | 392.5700 | 0.04 | Regular Air | 22.25 | 17.07 | 8.13 | Mick Brown | Alberta | West | Consumer | Office Supplies | Envelopes | Recycled Interoffice Envelopes with Re-Use-A-Seal® Closure, 10 x 13 | Small Box | 0.38 | 8/11/2010 |
| 8396 | 7906 | 56550 | 4/8/2011 | Not Specified | 37 | 823.7800 | 0.03 | Express Air | 343.05 | 22.23 | 5.08 | Frank Hawley | Alberta | West | Home Office | Furniture | Office Furnishings | Executive Impressions 14" | Small Pack | 0.41 | 4/10/2011 |
| 8397 | 7907 | 56550 | 4/8/2011 | Not Specified | 8 | 469.8375 | 0.00 | Regular Air | -159.24 | 65.99 | 8.99 | Frank Hawley | Alberta | West | Home Office | Technology | Telephones and Communication | Talkabout T8367 | Small Box | 0.56 | 4/9/2011 |
| 8398 | 7914 | 56581 | 2/8/2009 | High | 20 | 2026.0100 | 0.10 | Express Air | 580.43 | 105.98 | 13.99 | Grant Donatelli | Alberta | West | Consumer | Furniture | Office Furnishings | Tenex 46" x 60" Computer Anti-Static Chairmat, Rectangular Shaped | Medium Box | 0.65 | 2/11/2009 |