Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

평균값 is highly overall correlated with 측정기 상태High correlation
측정기 상태 is highly overall correlated with 평균값High correlation
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly imbalanced (88.8%)Imbalance
지자체 기준초과 구분 is highly imbalanced (83.2%)Imbalance
평균값 has 143 (1.4%) zerosZeros
측정기 상태 has 8166 (81.7%) zerosZeros

Reproduction

Analysis started2024-07-13 17:51:21.935656
Analysis finished2024-07-13 17:51:26.556750
Duration4.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct664
Distinct (%)6.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0030115 × 109
Minimum2.0030101 × 109
Maximum2.0030128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:26.640534image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0030101 × 109
5-th percentile2.0030102 × 109
Q12.0030107 × 109
median2.0030115 × 109
Q32.0030121 × 109
95-th percentile2.0030127 × 109
Maximum2.0030128 × 109
Range2715
Interquartile range (IQR)1398

Descriptive statistics

Standard deviation800.05667
Coefficient of variation (CV)3.9942691 × 10-7
Kurtosis-1.2087289
Mean2.0030115 × 109
Median Absolute Deviation (MAD)698
Skewness-0.0063984639
Sum2.0030115 × 1013
Variance640090.67
MonotonicityNot monotonic
2024-07-14T02:51:26.844813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2003012705 27
 
0.3%
2003011622 26
 
0.3%
2003010519 26
 
0.3%
2003010503 25
 
0.2%
2003010904 25
 
0.2%
2003012312 25
 
0.2%
2003011303 25
 
0.2%
2003012713 24
 
0.2%
2003010214 24
 
0.2%
2003012412 24
 
0.2%
Other values (654) 9749
97.5%
ValueCountFrequency (%)
2003010100 11
0.1%
2003010101 15
0.1%
2003010102 14
0.1%
2003010103 11
0.1%
2003010104 14
0.1%
2003010105 14
0.1%
2003010106 14
0.1%
2003010107 17
0.2%
2003010108 11
0.1%
2003010109 10
0.1%
ValueCountFrequency (%)
2003012815 8
0.1%
2003012814 13
0.1%
2003012813 12
0.1%
2003012812 14
0.1%
2003012811 12
0.1%
2003012810 14
0.1%
2003012809 18
0.2%
2003012808 13
0.1%
2003012807 11
0.1%
2003012806 14
0.1%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.0022
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:27.015062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2241549
Coefficient of variation (CV)0.06392933
Kurtosis-1.2085816
Mean113.0022
Median Absolute Deviation (MAD)6
Skewness-0.0086892374
Sum1130022
Variance52.188414
MonotonicityNot monotonic
2024-07-14T02:51:27.184620image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
124 434
 
4.3%
105 423
 
4.2%
123 419
 
4.2%
115 418
 
4.2%
117 415
 
4.2%
112 414
 
4.1%
101 414
 
4.1%
120 412
 
4.1%
109 408
 
4.1%
104 405
 
4.0%
Other values (15) 5838
58.4%
ValueCountFrequency (%)
101 414
4.1%
102 398
4.0%
103 401
4.0%
104 405
4.0%
105 423
4.2%
106 374
3.7%
107 396
4.0%
108 380
3.8%
109 408
4.1%
110 383
3.8%
ValueCountFrequency (%)
125 363
3.6%
124 434
4.3%
123 419
4.2%
122 402
4.0%
121 392
3.9%
120 412
4.1%
119 376
3.8%
118 388
3.9%
117 415
4.2%
116 402
4.0%

측정항목
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3864
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:27.335330image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7563481
Coefficient of variation (CV)0.51172362
Kurtosis-1.2117676
Mean5.3864
Median Absolute Deviation (MAD)3
Skewness-0.22217301
Sum53864
Variance7.5974548
MonotonicityNot monotonic
2024-07-14T02:51:27.484784image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
9 1744
17.4%
8 1707
17.1%
5 1684
16.8%
3 1661
16.6%
1 1623
16.2%
6 1581
15.8%
ValueCountFrequency (%)
1 1623
16.2%
3 1661
16.6%
5 1684
16.8%
6 1581
15.8%
8 1707
17.1%
9 1744
17.4%
ValueCountFrequency (%)
9 1744
17.4%
8 1707
17.1%
6 1581
15.8%
5 1684
16.8%
3 1661
16.6%
1 1623
16.2%

평균값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct369
Distinct (%)3.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-1077.1439
Minimum-9999
Maximum1069
Zeros143
Zeros (%)1.4%
Negative1717
Negative (%)17.2%
Memory size166.0 KiB
2024-07-14T02:51:27.733300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile-9999
Q10.003
median0.033
Q32
95-th percentile96
Maximum1069
Range11068
Interquartile range (IQR)1.997

Descriptive statistics

Standard deviation3105.7146
Coefficient of variation (CV)-2.8832866
Kurtosis4.3664667
Mean-1077.1439
Median Absolute Deviation (MAD)0.667
Skewness-2.5200107
Sum-10771439
Variance9645463
MonotonicityNot monotonic
2024-07-14T02:51:27.913004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-9999.0 1079
 
10.8%
-9.999 488
 
4.9%
0.002 301
 
3.0%
0.003 247
 
2.5%
0.004 235
 
2.4%
0.001 211
 
2.1%
0.005 205
 
2.1%
0.007 189
 
1.9%
0.006 171
 
1.7%
0.008 163
 
1.6%
Other values (359) 6711
67.1%
ValueCountFrequency (%)
-9999.0 1079
10.8%
-999.9 150
 
1.5%
-9.999 488
4.9%
0.0 143
 
1.4%
0.001 211
 
2.1%
0.002 301
 
3.0%
0.003 247
 
2.5%
0.004 235
 
2.4%
0.005 205
 
2.1%
0.006 171
 
1.7%
ValueCountFrequency (%)
1069.0 1
< 0.1%
849.0 1
< 0.1%
304.0 1
< 0.1%
294.0 1
< 0.1%
284.0 1
< 0.1%
279.0 1
< 0.1%
273.0 1
< 0.1%
271.0 1
< 0.1%
258.0 2
< 0.1%
257.0 1
< 0.1%

측정기 상태
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.6997
Minimum0
Maximum9
Zeros8166
Zeros (%)81.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:51:28.075004image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation1.5356939
Coefficient of variation (CV)2.194789
Kurtosis2.7985534
Mean0.6997
Median Absolute Deviation (MAD)0
Skewness1.9570127
Sum6997
Variance2.3583557
MonotonicityNot monotonic
2024-07-14T02:51:28.234314image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 8166
81.7%
4 1593
 
15.9%
2 150
 
1.5%
1 61
 
0.6%
9 24
 
0.2%
8 6
 
0.1%
ValueCountFrequency (%)
0 8166
81.7%
1 61
 
0.6%
2 150
 
1.5%
4 1593
 
15.9%
8 6
 
0.1%
9 24
 
0.2%
ValueCountFrequency (%)
9 24
 
0.2%
8 6
 
0.1%
4 1593
 
15.9%
2 150
 
1.5%
1 61
 
0.6%
0 8166
81.7%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9851 
1
 
149

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9851
98.5%
1 149
 
1.5%

Length

2024-07-14T02:51:28.436943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:51:28.580523image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9851
98.5%
1 149
 
1.5%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9752 
1
 
248

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9752
97.5%
1 248
 
2.5%

Length

2024-07-14T02:51:28.725251image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:51:28.871226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9752
97.5%
1 248
 
2.5%

Interactions

2024-07-14T02:51:25.256917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:22.726954image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:23.224783image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:23.833621image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:24.560745image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:25.404300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:22.830808image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:23.336166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:23.944426image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:24.690969image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:25.555057image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:22.932100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:51:23.448668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/