Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

평균값 is highly overall correlated with 측정기 상태High correlation
측정기 상태 is highly overall correlated with 평균값High correlation
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly imbalanced (92.7%)Imbalance
지자체 기준초과 구분 is highly imbalanced (86.5%)Imbalance
평균값 has 167 (1.7%) zerosZeros
측정기 상태 has 6467 (64.7%) zerosZeros

Reproduction

Analysis started2024-06-15 02:47:15.305330
Analysis finished2024-06-15 02:47:18.710637
Duration3.41 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct695
Distinct (%)7.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0020115 × 109
Minimum2.0020101 × 109
Maximum2.0020129 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-06-15T11:47:18.800143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0020101 × 109
5-th percentile2.0020102 × 109
Q12.0020108 × 109
median2.0020115 × 109
Q32.0020122 × 109
95-th percentile2.0020128 × 109
Maximum2.0020129 × 109
Range2822
Interquartile range (IQR)1414.25

Descriptive statistics

Standard deviation839.44347
Coefficient of variation (CV)4.1930002 × 10-7
Kurtosis-1.2097289
Mean2.0020115 × 109
Median Absolute Deviation (MAD)708
Skewness0.0036702056
Sum2.0020115 × 1013
Variance704665.34
MonotonicityNot monotonic
2024-06-15T11:47:19.017510image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2002011302 29
 
0.3%
2002012418 28
 
0.3%
2002010505 26
 
0.3%
2002010204 26
 
0.3%
2002011316 26
 
0.3%
2002011415 25
 
0.2%
2002011219 24
 
0.2%
2002011417 23
 
0.2%
2002011205 23
 
0.2%
2002012414 23
 
0.2%
Other values (685) 9747
97.5%
ValueCountFrequency (%)
2002010100 14
0.1%
2002010101 15
0.1%
2002010102 17
0.2%
2002010103 15
0.1%
2002010104 15
0.1%
2002010105 11
0.1%
2002010106 14
0.1%
2002010107 13
0.1%
2002010108 14
0.1%
2002010109 21
0.2%
ValueCountFrequency (%)
2002012922 4
 
< 0.1%
2002012921 11
0.1%
2002012920 18
0.2%
2002012919 15
0.1%
2002012918 15
0.1%
2002012917 14
0.1%
2002012916 17
0.2%
2002012915 13
0.1%
2002012914 13
0.1%
2002012913 13
0.1%

측정소 코드
Real number (ℝ)

Distinct24
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.581
Minimum102
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-06-15T11:47:19.180962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum102
5-th percentile103
Q1108
median114
Q3120
95-th percentile124
Maximum125
Range23
Interquartile range (IQR)12

Descriptive statistics

Standard deviation6.968206
Coefficient of variation (CV)0.061350102
Kurtosis-1.2156902
Mean113.581
Median Absolute Deviation (MAD)6
Skewness-0.0075926518
Sum1135810
Variance48.555895
MonotonicityNot monotonic
2024-06-15T11:47:19.320581image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=24)
ValueCountFrequency (%)
104 472
 
4.7%
125 450
 
4.5%
117 441
 
4.4%
124 439
 
4.4%
109 437
 
4.4%
120 431
 
4.3%
116 429
 
4.3%
114 426
 
4.3%
103 423
 
4.2%
121 423
 
4.2%
Other values (14) 5629
56.3%
ValueCountFrequency (%)
102 393
3.9%
103 423
4.2%
104 472
4.7%
105 390
3.9%
106 404
4.0%
107 416
4.2%
108 377
3.8%
109 437
4.4%
110 410
4.1%
111 412
4.1%
ValueCountFrequency (%)
125 450
4.5%
124 439
4.4%
123 422
4.2%
122 423
4.2%
121 423
4.2%
120 431
4.3%
119 375
3.8%
118 397
4.0%
117 441
4.4%
116 429
4.3%

측정항목
Real number (ℝ)

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3332
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-06-15T11:47:19.447025image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7476777
Coefficient of variation (CV)0.51520245
Kurtosis-1.2111228
Mean5.3332
Median Absolute Deviation (MAD)2
Skewness-0.19919707
Sum53332
Variance7.5497327
MonotonicityNot monotonic
2024-06-15T11:47:19.555789image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
3 1694
16.9%
5 1686
16.9%
9 1673
16.7%
8 1668
16.7%
1 1651
16.5%
6 1628
16.3%
ValueCountFrequency (%)
1 1651
16.5%
3 1694
16.9%
5 1686
16.9%
6 1628
16.3%
8 1668
16.7%
9 1673
16.7%
ValueCountFrequency (%)
9 1673
16.7%
8 1668
16.7%
6 1628
16.3%
5 1686
16.9%
3 1694
16.9%
1 1651
16.5%

평균값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct353
Distinct (%)3.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-1575.6169
Minimum-9999
Maximum510
Zeros167
Zeros (%)1.7%
Negative2941
Negative (%)29.4%
Memory size166.0 KiB
2024-06-15T11:47:19.713274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile-9999
Q1-9.999
median0.01
Q30.6
95-th percentile75
Maximum510
Range10509
Interquartile range (IQR)10.599

Descriptive statistics

Standard deviation3612.8287
Coefficient of variation (CV)-2.2929613
Kurtosis1.61289
Mean-1575.6169
Median Absolute Deviation (MAD)1.19
Skewness-1.8960921
Sum-15756169
Variance13052531
MonotonicityNot monotonic
2024-06-15T11:47:19.872393image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-9999.0 1550
 
15.5%
-9.999 1030
 
10.3%
-999.9 358
 
3.6%
0.002 349
 
3.5%
0.003 317
 
3.2%
0.001 297
 
3.0%
0.004 234
 
2.3%
0.3 176
 
1.8%
0.005 168
 
1.7%
0.0 167
 
1.7%
Other values (343) 5354
53.5%
ValueCountFrequency (%)
-9999.0 1550
15.5%
-999.9 358
 
3.6%
-13.0 1
 
< 0.1%
-9.999 1030
10.3%
-3.0 1
 
< 0.1%
-0.6 1
 
< 0.1%
0.0 167
 
1.7%
0.001 297
 
3.0%
0.002 349
 
3.5%
0.003 317
 
3.2%
ValueCountFrequency (%)
510.0 1
< 0.1%
490.0 1
< 0.1%
409.0 1
< 0.1%
316.0 1
< 0.1%
282.0 1
< 0.1%
257.0 1
< 0.1%
254.0 1
< 0.1%
253.0 1
< 0.1%
246.0 2
< 0.1%
242.0 1
< 0.1%

측정기 상태
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.1324
Minimum0
Maximum9
Zeros6467
Zeros (%)64.7%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-06-15T11:47:19.992192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34
95-th percentile8
Maximum9
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation3.154943
Coefficient of variation (CV)1.4795268
Kurtosis-0.5776123
Mean2.1324
Median Absolute Deviation (MAD)0
Skewness1.046635
Sum21324
Variance9.9536656
MonotonicityNot monotonic
2024-06-15T11:47:20.093692image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 6467
64.7%
8 1845
 
18.4%
4 1515
 
15.2%
2 115
 
1.1%
1 31
 
0.3%
9 27
 
0.3%
ValueCountFrequency (%)
0 6467
64.7%
1 31
 
0.3%
2 115
 
1.1%
4 1515
 
15.2%
8 1845
 
18.4%
9 27
 
0.3%
ValueCountFrequency (%)
9 27
 
0.3%
8 1845
 
18.4%
4 1515
 
15.2%
2 115
 
1.1%
1 31
 
0.3%
0 6467
64.7%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9911 
1
 
89

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9911
99.1%
1 89
 
0.9%

Length

2024-06-15T11:47:20.211015image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-06-15T11:47:20.297479image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9911
99.1%
1 89
 
0.9%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9812 
1
 
188

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9812
98.1%
1 188
 
1.9%

Length

2024-06-15T11:47:20.384192image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-06-15T11:47:20.468407image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9812
98.1%
1 188
 
1.9%

Interactions

2024-06-15T11:47:18.011066image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:15.973762image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:16.416507image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:17.075943image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:17.558763image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:18.100877image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:16.058483image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:16.713450image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:17.168749image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:17.652060image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:18.192209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:16.140370image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-06-15T11:47:16.786100image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/