Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

측정항목 is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 측정항목High correlation
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly imbalanced (99.6%)Imbalance
지자체 기준초과 구분 is highly imbalanced (99.6%)Imbalance
측정기 상태 has 9817 (98.2%) zerosZeros

Reproduction

Analysis started2024-07-13 17:50:26.580717
Analysis finished2024-07-13 17:50:31.002524
Duration4.42 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct667
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0090115 × 109
Minimum2.0090101 × 109
Maximum2.0090128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:50:31.095883image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0090101 × 109
5-th percentile2.0090102 × 109
Q12.0090108 × 109
median2.0090115 × 109
Q32.0090121 × 109
95-th percentile2.0090127 × 109
Maximum2.0090128 × 109
Range2718
Interquartile range (IQR)1322

Descriptive statistics

Standard deviation802.32628
Coefficient of variation (CV)3.9936371 × 10-7
Kurtosis-1.2043121
Mean2.0090115 × 109
Median Absolute Deviation (MAD)698
Skewness-0.014393156
Sum2.0090115 × 1013
Variance643727.46
MonotonicityNot monotonic
2024-07-14T02:50:31.334596image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2009012016 30
 
0.3%
2009012313 29
 
0.3%
2009012223 26
 
0.3%
2009012409 26
 
0.3%
2009011516 25
 
0.2%
2009011204 25
 
0.2%
2009010312 25
 
0.2%
2009011211 24
 
0.2%
2009010511 24
 
0.2%
2009010108 24
 
0.2%
Other values (657) 9742
97.4%
ValueCountFrequency (%)
2009010100 14
0.1%
2009010101 18
0.2%
2009010102 19
0.2%
2009010103 17
0.2%
2009010104 15
0.1%
2009010105 11
0.1%
2009010106 11
0.1%
2009010107 12
0.1%
2009010108 24
0.2%
2009010109 16
0.2%
ValueCountFrequency (%)
2009012818 14
0.1%
2009012817 16
0.2%
2009012816 19
0.2%
2009012815 21
0.2%
2009012814 17
0.2%
2009012813 13
0.1%
2009012812 12
0.1%
2009012811 11
0.1%
2009012810 19
0.2%
2009012809 18
0.2%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.9395
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:50:31.516869image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.217884
Coefficient of variation (CV)0.063909297
Kurtosis-1.2133301
Mean112.9395
Median Absolute Deviation (MAD)6
Skewness0.012823717
Sum1129395
Variance52.09785
MonotonicityNot monotonic
2024-07-14T02:50:31.680800image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
108 427
 
4.3%
118 426
 
4.3%
105 422
 
4.2%
112 420
 
4.2%
103 420
 
4.2%
119 412
 
4.1%
104 412
 
4.1%
109 410
 
4.1%
101 408
 
4.1%
120 401
 
4.0%
Other values (15) 5842
58.4%
ValueCountFrequency (%)
101 408
4.1%
102 377
3.8%
103 420
4.2%
104 412
4.1%
105 422
4.2%
106 389
3.9%
107 399
4.0%
108 427
4.3%
109 410
4.1%
110 394
3.9%
ValueCountFrequency (%)
125 392
3.9%
124 400
4.0%
123 390
3.9%
122 399
4.0%
121 392
3.9%
120 401
4.0%
119 412
4.1%
118 426
4.3%
117 400
4.0%
116 381
3.8%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3846
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:50:31.818040image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7282645
Coefficient of variation (CV)0.50667914
Kurtosis-1.1845429
Mean5.3846
Median Absolute Deviation (MAD)2
Skewness-0.22145389
Sum53846
Variance7.4434272
MonotonicityNot monotonic
2024-07-14T02:50:31.979767image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
9 1699
17.0%
5 1685
16.9%
3 1683
16.8%
8 1679
16.8%
6 1679
16.8%
1 1575
15.8%
ValueCountFrequency (%)
1 1575
15.8%
3 1683
16.8%
5 1685
16.9%
6 1679
16.8%
8 1679
16.8%
9 1699
17.0%
ValueCountFrequency (%)
9 1699
17.0%
8 1679
16.8%
6 1679
16.8%
5 1685
16.9%
3 1683
16.8%
1 1575
15.8%

평균값
Real number (ℝ)

HIGH CORRELATION 

Distinct285
Distinct (%)2.9%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean7.3652652
Minimum-9999
Maximum18482
Zeros16
Zeros (%)0.2%
Negative26
Negative (%)0.3%
Memory size166.0 KiB
2024-07-14T02:50:32.191708image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile0.003
Q10.013
median0.25
Q323
95-th percentile70
Maximum18482
Range28481
Interquartile range (IQR)22.987

Descriptive statistics

Standard deviation354.20509
Coefficient of variation (CV)48.091287
Kurtosis1311.2661
Mean7.3652652
Median Absolute Deviation (MAD)0.248
Skewness-6.1108291
Sum73652.652
Variance125461.24
MonotonicityNot monotonic
2024-07-14T02:50:32.370218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.005 296
 
3.0%
0.006 290
 
2.9%
0.004 260
 
2.6%
0.007 258
 
2.6%
0.003 249
 
2.5%
0.002 244
 
2.4%
0.4 226
 
2.3%
0.5 206
 
2.1%
0.008 184
 
1.8%
0.6 174
 
1.7%
Other values (275) 7613
76.1%
ValueCountFrequency (%)
-9999.0 9
 
0.1%
-999.9 4
 
< 0.1%
-9.999 13
 
0.1%
0.0 16
 
0.2%
0.001 105
 
1.1%
0.002 244
2.4%
0.003 249
2.5%
0.004 260
2.6%
0.005 296
3.0%
0.006 290
2.9%
ValueCountFrequency (%)
18482.0 1
< 0.1%
681.0 1
< 0.1%
354.0 1
< 0.1%
206.0 1
< 0.1%
185.0 1
< 0.1%
180.0 1
< 0.1%
169.0 2
< 0.1%
168.0 1
< 0.1%
167.0 1
< 0.1%
163.0 1
< 0.1%

측정기 상태
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.0798
Minimum0
Maximum9
Zeros9817
Zeros (%)98.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:50:32.533796image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.7541146
Coefficient of variation (CV)9.4500576
Kurtosis120.88903
Mean0.0798
Median Absolute Deviation (MAD)0
Skewness10.867409
Sum798
Variance0.56868883
MonotonicityNot monotonic
2024-07-14T02:50:32.641207image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 9817
98.2%
1 66
 
0.7%
9 52
 
0.5%
2 32
 
0.3%
8 17
 
0.2%
4 16
 
0.2%
ValueCountFrequency (%)
0 9817
98.2%
1 66
 
0.7%
2 32
 
0.3%
4 16
 
0.2%
8 17
 
0.2%
9 52
 
0.5%
ValueCountFrequency (%)
9 52
 
0.5%
8 17
 
0.2%
4 16
 
0.2%
2 32
 
0.3%
1 66
 
0.7%
0 9817
98.2%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9997 
1
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9997
> 99.9%
1 3
 
< 0.1%

Length

2024-07-14T02:50:32.765034image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:50:32.888964image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9997
> 99.9%
1 3
 
< 0.1%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9997 
1
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9997
> 99.9%
1 3
 
< 0.1%

Length

2024-07-14T02:50:33.016426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:50:33.117247image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9997
> 99.9%
1 3
 
< 0.1%

Interactions

2024-07-14T02:50:30.261630image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:27.777842image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:28.483282image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:29.147252image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:29.722836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:30.370564image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:27.884990image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:28.616521image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:29.253780image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:29.834677image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:30.512018image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:28.016160image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:50:28.757065image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/