Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

측정항목 is highly overall correlated with 측정기 상태High correlation
평균값 is highly overall correlated with 측정기 상태 and 1 other fieldsHigh correlation
측정기 상태 is highly overall correlated with 측정항목 and 1 other fieldsHigh correlation
국가 기준초과 구분 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly imbalanced (98.1%)Imbalance
지자체 기준초과 구분 is highly imbalanced (96.1%)Imbalance
평균값 has 305 (3.0%) zerosZeros
측정기 상태 has 6885 (68.8%) zerosZeros

Reproduction

Analysis started2024-07-13 17:51:57.756317
Analysis finished2024-07-13 17:52:03.209813
Duration5.45 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct667
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0000115 × 109
Minimum2.0000101 × 109
Maximum2.0000128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:03.327210image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0000101 × 109
5-th percentile2.0000102 × 109
Q12.0000108 × 109
median2.0000115 × 109
Q32.0000121 × 109
95-th percentile2.0000127 × 109
Maximum2.0000128 × 109
Range2718
Interquartile range (IQR)1318

Descriptive statistics

Standard deviation797.76559
Coefficient of variation (CV)3.9888051 × 10-7
Kurtosis-1.1863041
Mean2.0000115 × 109
Median Absolute Deviation (MAD)697
Skewness-0.01154799
Sum2.0000115 × 1013
Variance636429.94
MonotonicityNot monotonic
2024-07-14T02:52:03.536893image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2000011519 31
 
0.3%
2000012013 24
 
0.2%
2000012217 24
 
0.2%
2000011821 24
 
0.2%
2000010722 24
 
0.2%
2000010813 24
 
0.2%
2000010805 23
 
0.2%
2000012206 23
 
0.2%
2000011505 23
 
0.2%
2000010416 23
 
0.2%
Other values (657) 9757
97.6%
ValueCountFrequency (%)
2000010100 10
0.1%
2000010101 11
0.1%
2000010102 15
0.1%
2000010103 18
0.2%
2000010104 13
0.1%
2000010105 11
0.1%
2000010106 14
0.1%
2000010107 9
0.1%
2000010108 12
0.1%
2000010109 15
0.1%
ValueCountFrequency (%)
2000012818 10
0.1%
2000012817 14
0.1%
2000012816 12
0.1%
2000012815 19
0.2%
2000012814 12
0.1%
2000012813 13
0.1%
2000012812 17
0.2%
2000012811 13
0.1%
2000012810 11
0.1%
2000012809 11
0.1%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.0394
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:03.749703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2322112
Coefficient of variation (CV)0.063979561
Kurtosis-1.206127
Mean113.0394
Median Absolute Deviation (MAD)6
Skewness-0.0066327395
Sum1130394
Variance52.304878
MonotonicityNot monotonic
2024-07-14T02:52:03.910251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
110 430
 
4.3%
121 428
 
4.3%
119 421
 
4.2%
108 417
 
4.2%
125 417
 
4.2%
101 414
 
4.1%
124 414
 
4.1%
107 411
 
4.1%
117 410
 
4.1%
113 406
 
4.1%
Other values (15) 5832
58.3%
ValueCountFrequency (%)
101 414
4.1%
102 400
4.0%
103 405
4.0%
104 376
3.8%
105 375
3.8%
106 390
3.9%
107 411
4.1%
108 417
4.2%
109 403
4.0%
110 430
4.3%
ValueCountFrequency (%)
125 417
4.2%
124 414
4.1%
123 374
3.7%
122 398
4.0%
121 428
4.3%
120 398
4.0%
119 421
4.2%
118 390
3.9%
117 410
4.1%
116 378
3.8%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3625
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:04.039466image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7521721
Coefficient of variation (CV)0.51322556
Kurtosis-1.2094311
Mean5.3625
Median Absolute Deviation (MAD)3
Skewness-0.20166693
Sum53625
Variance7.5744512
MonotonicityNot monotonic
2024-07-14T02:52:04.222922image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
9 1772
17.7%
3 1704
17.0%
5 1670
16.7%
6 1642
16.4%
1 1619
16.2%
8 1593
15.9%
ValueCountFrequency (%)
1 1619
16.2%
3 1704
17.0%
5 1670
16.7%
6 1642
16.4%
8 1593
15.9%
9 1772
17.7%
ValueCountFrequency (%)
9 1772
17.7%
8 1593
15.9%
6 1642
16.4%
5 1670
16.7%
3 1704
17.0%
1 1619
16.2%

평균값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct278
Distinct (%)2.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-2123.2736
Minimum-9999
Maximum11188
Zeros305
Zeros (%)3.0%
Negative2919
Negative (%)29.2%
Memory size166.0 KiB
2024-07-14T02:52:04.492276image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile-9999
Q1-9.999
median0.009
Q30.1
95-th percentile38
Maximum11188
Range21187
Interquartile range (IQR)10.099

Descriptive statistics

Standard deviation4083.9695
Coefficient of variation (CV)-1.9234306
Kurtosis0.00039266331
Mean-2123.2736
Median Absolute Deviation (MAD)0.191
Skewness-1.39981
Sum-21232736
Variance16678807
MonotonicityNot monotonic
2024-07-14T02:52:04.714756image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-9999.0 2114
 
21.1%
-9.999 630
 
6.3%
0.0 305
 
3.0%
0.004 267
 
2.7%
0.1 253
 
2.5%
0.006 235
 
2.4%
0.003 235
 
2.4%
0.2 219
 
2.2%
0.007 213
 
2.1%
0.005 212
 
2.1%
Other values (268) 5317
53.2%
ValueCountFrequency (%)
-9999.0 2114
21.1%
-3276.8 2
 
< 0.1%
-999.9 171
 
1.7%
-32.768 2
 
< 0.1%
-9.999 630
 
6.3%
0.0 305
 
3.0%
0.001 154
 
1.5%
0.002 199
 
2.0%
0.003 235
 
2.4%
0.004 267
 
2.7%
ValueCountFrequency (%)
11188.0 1
< 0.1%
6635.0 1
< 0.1%
6258.0 1
< 0.1%
5959.0 1
< 0.1%
4730.0 1
< 0.1%
3585.0 1
< 0.1%
3580.0 1
< 0.1%
3030.0 1
< 0.1%
225.0 1
< 0.1%
190.0 1
< 0.1%

측정기 상태
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.2203
Minimum0
Maximum9
Zeros6885
Zeros (%)68.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:04.878597image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.8417674
Coefficient of variation (CV)1.5092743
Kurtosis-1.0125014
Mean1.2203
Median Absolute Deviation (MAD)0
Skewness0.90162524
Sum12203
Variance3.3921071
MonotonicityNot monotonic
2024-07-14T02:52:05.024902image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 6885
68.8%
4 2955
29.5%
2 109
 
1.1%
1 35
 
0.4%
8 14
 
0.1%
9 2
 
< 0.1%
ValueCountFrequency (%)
0 6885
68.8%
1 35
 
0.4%
2 109
 
1.1%
4 2955
29.5%
8 14
 
0.1%
9 2
 
< 0.1%
ValueCountFrequency (%)
9 2
 
< 0.1%
8 14
 
0.1%
4 2955
29.5%
2 109
 
1.1%
1 35
 
0.4%
0 6885
68.8%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9982 
1
 
18

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9982
99.8%
1 18
 
0.2%

Length

2024-07-14T02:52:05.216478image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:52:05.363160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9982
99.8%
1 18
 
0.2%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9958 
1
 
42

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9958
99.6%
1 42
 
0.4%

Length

2024-07-14T02:52:05.512796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:52:05.661533image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9958
99.6%
1 42
 
0.4%

Interactions

2024-07-14T02:52:01.636721image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:58.612453image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:59.500078image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:00.158298image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:00.880442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:01.967398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:59.093787image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:59.604379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:00.281308image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:01.025715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:02.316134image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:59.192777image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:59.713972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/