Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

평균값 is highly overall correlated with 측정기 상태High correlation
측정기 상태 is highly overall correlated with 평균값High correlation
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly imbalanced (96.2%)Imbalance
지자체 기준초과 구분 is highly imbalanced (92.7%)Imbalance
평균값 has 140 (1.4%) zerosZeros
측정기 상태 has 8048 (80.5%) zerosZeros

Reproduction

Analysis started2024-07-13 17:51:04.715802
Analysis finished2024-07-13 17:51:09.234525
Duration4.52 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct667
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0050115 × 109
Minimum2.0050101 × 109
Maximum2.0050128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:09.367779image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0050101 × 109
5-th percentile2.0050102 × 109
Q12.0050107 × 109
median2.0050114 × 109
Q32.0050121 × 109
95-th percentile2.0050127 × 109
Maximum2.0050128 × 109
Range2718
Interquartile range (IQR)1398

Descriptive statistics

Standard deviation804.63188
Coefficient of variation (CV)4.0131036 × 10-7
Kurtosis-1.2010823
Mean2.0050115 × 109
Median Absolute Deviation (MAD)699
Skewness-0.0019011191
Sum2.0050115 × 1013
Variance647432.46
MonotonicityNot monotonic
2024-07-14T02:51:09.620064image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2005010423 28
 
0.3%
2005010111 26
 
0.3%
2005011915 24
 
0.2%
2005012404 24
 
0.2%
2005012723 24
 
0.2%
2005012800 24
 
0.2%
2005011120 23
 
0.2%
2005011604 23
 
0.2%
2005011512 23
 
0.2%
2005011312 23
 
0.2%
Other values (657) 9758
97.6%
ValueCountFrequency (%)
2005010100 11
0.1%
2005010101 21
0.2%
2005010102 13
0.1%
2005010103 12
0.1%
2005010104 16
0.2%
2005010105 20
0.2%
2005010106 14
0.1%
2005010107 16
0.2%
2005010108 18
0.2%
2005010109 17
0.2%
ValueCountFrequency (%)
2005012818 10
0.1%
2005012817 15
0.1%
2005012816 20
0.2%
2005012815 16
0.2%
2005012814 17
0.2%
2005012813 16
0.2%
2005012812 11
0.1%
2005012811 15
0.1%
2005012810 14
0.1%
2005012809 16
0.2%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.9646
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:09.782908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.1697341
Coefficient of variation (CV)0.063468858
Kurtosis-1.1881138
Mean112.9646
Median Absolute Deviation (MAD)6
Skewness0.015028674
Sum1129646
Variance51.405087
MonotonicityNot monotonic
2024-07-14T02:51:09.932534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
109 431
 
4.3%
106 419
 
4.2%
114 419
 
4.2%
113 416
 
4.2%
118 415
 
4.2%
110 414
 
4.1%
116 413
 
4.1%
121 413
 
4.1%
108 406
 
4.1%
107 405
 
4.0%
Other values (15) 5849
58.5%
ValueCountFrequency (%)
101 398
4.0%
102 384
3.8%
103 391
3.9%
104 394
3.9%
105 389
3.9%
106 419
4.2%
107 405
4.0%
108 406
4.1%
109 431
4.3%
110 414
4.1%
ValueCountFrequency (%)
125 389
3.9%
124 402
4.0%
123 396
4.0%
122 396
4.0%
121 413
4.1%
120 363
3.6%
119 384
3.8%
118 415
4.2%
117 374
3.7%
116 413
4.1%

측정항목
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3369
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:10.086423image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7478998
Coefficient of variation (CV)0.51488689
Kurtosis-1.2140786
Mean5.3369
Median Absolute Deviation (MAD)3
Skewness-0.20356343
Sum53369
Variance7.5509535
MonotonicityNot monotonic
2024-07-14T02:51:10.214263image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3 1705
17.1%
8 1697
17.0%
9 1658
16.6%
6 1648
16.5%
1 1648
16.5%
5 1644
16.4%
ValueCountFrequency (%)
1 1648
16.5%
3 1705
17.1%
5 1644
16.4%
6 1648
16.5%
8 1697
17.0%
9 1658
16.6%
ValueCountFrequency (%)
9 1658
16.6%
8 1697
17.0%
6 1648
16.5%
5 1644
16.4%
3 1705
17.1%
1 1648
16.5%

평균값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct311
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-987.67267
Minimum-9999
Maximum952
Zeros140
Zeros (%)1.4%
Negative1752
Negative (%)17.5%
Memory size166.0 KiB
2024-07-14T02:51:10.426454image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile-9999
Q10.003
median0.024
Q31.3
95-th percentile68
Maximum952
Range10951
Interquartile range (IQR)1.297

Descriptive statistics

Standard deviation2975.2862
Coefficient of variation (CV)-3.0124213
Kurtosis5.2711068
Mean-987.67267
Median Absolute Deviation (MAD)0.476
Skewness-2.6926301
Sum-9876726.7
Variance8852328.1
MonotonicityNot monotonic
2024-07-14T02:51:10.590076image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-9999.0 981
 
9.8%
-9.999 591
 
5.9%
0.004 284
 
2.8%
0.003 270
 
2.7%
0.002 251
 
2.5%
0.005 237
 
2.4%
0.001 226
 
2.3%
0.006 210
 
2.1%
0.008 196
 
2.0%
-999.9 179
 
1.8%
Other values (301) 6575
65.8%
ValueCountFrequency (%)
-9999.0 981
9.8%
-999.9 179
 
1.8%
-9.999 591
5.9%
-0.012 1
 
< 0.1%
0.0 140
 
1.4%
0.001 226
 
2.3%
0.002 251
 
2.5%
0.003 270
 
2.7%
0.004 284
 
2.8%
0.005 237
 
2.4%
ValueCountFrequency (%)
952.0 1
< 0.1%
822.0 1
< 0.1%
446.0 1
< 0.1%
301.0 1
< 0.1%
243.0 1
< 0.1%
239.0 1
< 0.1%
212.0 1
< 0.1%
192.0 1
< 0.1%
186.0 1
< 0.1%
183.0 1
< 0.1%

측정기 상태
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.7793
Minimum0
Maximum9
Zeros8048
Zeros (%)80.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:10.729121image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.6575483
Coefficient of variation (CV)2.1269707
Kurtosis3.1213882
Mean0.7793
Median Absolute Deviation (MAD)0
Skewness1.962304
Sum7793
Variance2.7474663
MonotonicityNot monotonic
2024-07-14T02:51:10.855398image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 8048
80.5%
4 1701
 
17.0%
2 130
 
1.3%
8 56
 
0.6%
1 38
 
0.4%
9 27
 
0.3%
ValueCountFrequency (%)
0 8048
80.5%
1 38
 
0.4%
2 130
 
1.3%
4 1701
 
17.0%
8 56
 
0.6%
9 27
 
0.3%
ValueCountFrequency (%)
9 27
 
0.3%
8 56
 
0.6%
4 1701
 
17.0%
2 130
 
1.3%
1 38
 
0.4%
0 8048
80.5%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9960 
1
 
40

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9960
99.6%
1 40
 
0.4%

Length

2024-07-14T02:51:10.976283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:51:11.072432image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9960
99.6%
1 40
 
0.4%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9912 
1
 
88

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9912
99.1%
1 88
 
0.9%

Length

2024-07-14T02:51:11.175194image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:51:11.267052image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9912
99.1%
1 88
 
0.9%

Interactions

2024-07-14T02:51:08.415381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:05.833126image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:06.360486image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:06.937622image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:07.731027image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:08.524897image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:05.956827image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:06.455654image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:07.037949image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:07.834019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:08.653023image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:06.061558image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:06.547233image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/