Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

측정항목 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
평균값 is highly overall correlated with 측정항목 and 1 other fieldsHigh correlation
측정기 상태 is highly overall correlated with 측정항목 and 1 other fieldsHigh correlation
국가 기준초과 구분 is highly imbalanced (94.4%)Imbalance
지자체 기준초과 구분 is highly imbalanced (99.0%)Imbalance
평균값 has 388 (3.9%) zerosZeros
측정기 상태 has 4956 (49.6%) zerosZeros

Reproduction

Analysis started2024-07-13 17:52:47.766336
Analysis finished2024-07-13 17:52:51.708417
Duration3.94 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct1686
Distinct (%)16.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9920178 × 109
Minimum1.9920101 × 109
Maximum1.9920311 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:51.781937image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.9920101 × 109
5-th percentile1.9920104 × 109
Q11.9920116 × 109
median1.9920201 × 109
Q31.992022 × 109
95-th percentile1.9920307 × 109
Maximum1.9920311 × 109
Range21006
Interquartile range (IQR)10408

Descriptive statistics

Standard deviation6765.5041
Coefficient of variation (CV)3.3963071 × 10-6
Kurtosis-0.92419063
Mean1.9920178 × 109
Median Absolute Deviation (MAD)7706
Skewness0.54219701
Sum1.9920178 × 1013
Variance45772046
MonotonicityNot monotonic
2024-07-14T02:52:51.917950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1992011902 14
 
0.1%
1992012805 14
 
0.1%
1992031011 13
 
0.1%
1992011113 13
 
0.1%
1992021120 13
 
0.1%
1992012313 13
 
0.1%
1992030903 13
 
0.1%
1992022815 13
 
0.1%
1992010915 13
 
0.1%
1992011705 13
 
0.1%
Other values (1676) 9868
98.7%
ValueCountFrequency (%)
1992010100 6
0.1%
1992010101 4
 
< 0.1%
1992010102 6
0.1%
1992010103 8
0.1%
1992010104 4
 
< 0.1%
1992010105 7
0.1%
1992010106 5
0.1%
1992010107 6
0.1%
1992010108 6
0.1%
1992010109 10
0.1%
ValueCountFrequency (%)
1992031106 5
0.1%
1992031105 2
 
< 0.1%
1992031104 7
0.1%
1992031103 6
0.1%
1992031102 4
< 0.1%
1992031101 2
 
< 0.1%
1992031100 6
0.1%
1992031023 6
0.1%
1992031022 7
0.1%
1992031021 7
0.1%

측정소 코드
Real number (ℝ)

Distinct11
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.3014
Minimum103
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:52.030256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum103
5-th percentile103
Q1106
median111
Q3117
95-th percentile124
Maximum124
Range21
Interquartile range (IQR)11

Descriptive statistics

Standard deviation6.8651199
Coefficient of variation (CV)0.061131205
Kurtosis-1.1961433
Mean112.3014
Median Absolute Deviation (MAD)5
Skewness0.32871404
Sum1123014
Variance47.129871
MonotonicityNot monotonic
2024-07-14T02:52:52.134661image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=11)
ValueCountFrequency (%)
108 1060
10.6%
113 1019
10.2%
116 1017
10.2%
103 1015
10.2%
117 995
10.0%
105 991
9.9%
122 985
9.8%
107 980
9.8%
124 979
9.8%
106 495
5.0%
ValueCountFrequency (%)
103 1015
10.2%
105 991
9.9%
106 495
5.0%
107 980
9.8%
108 1060
10.6%
111 464
4.6%
113 1019
10.2%
116 1017
10.2%
117 995
10.0%
122 985
9.8%
ValueCountFrequency (%)
124 979
9.8%
122 985
9.8%
117 995
10.0%
116 1017
10.2%
113 1019
10.2%
111 464
4.6%
108 1060
10.6%
107 980
9.8%
106 495
5.0%
105 991
9.9%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3441
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:52.239637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7671033
Coefficient of variation (CV)0.51778659
Kurtosis-1.2224025
Mean5.3441
Median Absolute Deviation (MAD)3
Skewness-0.21179532
Sum53441
Variance7.6568609
MonotonicityNot monotonic
2024-07-14T02:52:52.370807image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
9 1707
17.1%
1 1695
17.0%
8 1678
16.8%
6 1643
16.4%
3 1642
16.4%
5 1635
16.4%
ValueCountFrequency (%)
1 1695
17.0%
3 1642
16.4%
5 1635
16.4%
6 1643
16.4%
8 1678
16.8%
9 1707
17.1%
ValueCountFrequency (%)
9 1707
17.1%
8 1678
16.8%
6 1643
16.4%
5 1635
16.4%
3 1642
16.4%
1 1695
17.0%

평균값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct320
Distinct (%)3.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-3233.4003
Minimum-9999
Maximum184
Zeros388
Zeros (%)3.9%
Negative4659
Negative (%)46.6%
Memory size166.0 KiB
2024-07-14T02:52:52.552769image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile-9999
Q1-9999
median0
Q30.042
95-th percentile2.5
Maximum184
Range10183
Interquartile range (IQR)9999.042

Descriptive statistics

Standard deviation4639.9477
Coefficient of variation (CV)-1.4350056
Kurtosis-1.4021549
Mean-3233.4003
Median Absolute Deviation (MAD)3.3
Skewness-0.76926309
Sum-32334003
Variance21529115
MonotonicityNot monotonic
2024-07-14T02:52:52.695801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-9999.0 3195
31.9%
-9.999 1080
 
10.8%
0.0 388
 
3.9%
-999.9 382
 
3.8%
0.001 151
 
1.5%
0.002 101
 
1.0%
0.003 74
 
0.7%
0.028 71
 
0.7%
0.02 71
 
0.7%
0.009 69
 
0.7%
Other values (310) 4418
44.2%
ValueCountFrequency (%)
-9999.0 3195
31.9%
-999.9 382
 
3.8%
-10.015 1
 
< 0.1%
-9.999 1080
 
10.8%
-0.005 1
 
< 0.1%
0.0 388
 
3.9%
0.001 151
 
1.5%
0.002 101
 
1.0%
0.003 74
 
0.7%
0.004 66
 
0.7%
ValueCountFrequency (%)
184.0 1
< 0.1%
168.0 1
< 0.1%
161.0 1
< 0.1%
150.0 1
< 0.1%
136.0 1
< 0.1%
135.0 1
< 0.1%
125.0 1
< 0.1%
113.0 1
< 0.1%
86.0 1
< 0.1%
78.0 1
< 0.1%

측정기 상태
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9541
Minimum0
Maximum9
Zeros4956
Zeros (%)49.6%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:52:52.809648image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median2
Q34
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation1.9940388
Coefficient of variation (CV)1.0204385
Kurtosis-1.5844115
Mean1.9541
Median Absolute Deviation (MAD)2
Skewness0.16018361
Sum19541
Variance3.9761908
MonotonicityNot monotonic
2024-07-14T02:52:52.904009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 4956
49.6%
4 4633
46.3%
2 367
 
3.7%
9 28
 
0.3%
1 15
 
0.1%
8 1
 
< 0.1%
ValueCountFrequency (%)
0 4956
49.6%
1 15
 
0.1%
2 367
 
3.7%
4 4633
46.3%
8 1
 
< 0.1%
9 28
 
0.3%
ValueCountFrequency (%)
9 28
 
0.3%
8 1
 
< 0.1%
4 4633
46.3%
2 367
 
3.7%
1 15
 
0.1%
0 4956
49.6%

국가 기준초과 구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9936 
1
 
64

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9936
99.4%
1 64
 
0.6%

Length

2024-07-14T02:52:53.019385image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:52:53.132063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9936
99.4%
1 64
 
0.6%

지자체 기준초과 구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9991 
1
 
9

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9991
99.9%
1 9
 
0.1%

Length

2024-07-14T02:52:53.262612image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:52:53.387335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9991
99.9%
1 9
 
0.1%

Interactions

2024-07-14T02:52:50.996336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:48.666637image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:49.205697image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:49.690184image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:50.463589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:51.103030image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:48.793360image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:49.297503image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:49.778938image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:50.568072image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:51.212255image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:48.894809image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:52:49.409379image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/