Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

측정항목 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
평균값 is highly overall correlated with 측정항목 and 1 other fieldsHigh correlation
측정기 상태 is highly overall correlated with 측정항목 and 1 other fieldsHigh correlation
국가 기준초과 구분 is highly imbalanced (83.4%)Imbalance
지자체 기준초과 구분 is highly imbalanced (96.6%)Imbalance
평균값 has 661 (6.6%) zerosZeros
측정기 상태 has 5975 (59.8%) zerosZeros

Reproduction

Analysis started2024-07-13 17:53:02.003786
Analysis finished2024-07-13 17:53:05.237994
Duration3.23 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct2069
Distinct (%)20.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.9900212 × 109
Minimum1.9900101 × 109
Maximum1.9900328 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:05.330428image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.9900101 × 109
5-th percentile1.9900105 × 109
Q11.9900122 × 109
median1.9900213 × 109
Q31.9900307 × 109
95-th percentile1.9900324 × 109
Maximum1.9900328 × 109
Range22719
Interquartile range (IQR)18494

Descriptive statistics

Standard deviation8195.4532
Coefficient of variation (CV)4.1182744 × 10-6
Kurtosis-1.4832269
Mean1.9900212 × 109
Median Absolute Deviation (MAD)9207.5
Skewness0.06734399
Sum1.9900212 × 1013
Variance67165453
MonotonicityNot monotonic
2024-07-14T02:53:05.513463image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1990031014 12
 
0.1%
1990021215 11
 
0.1%
1990012418 11
 
0.1%
1990012812 11
 
0.1%
1990020207 11
 
0.1%
1990011617 11
 
0.1%
1990010221 11
 
0.1%
1990030808 11
 
0.1%
1990031000 11
 
0.1%
1990032723 11
 
0.1%
Other values (2059) 9889
98.9%
ValueCountFrequency (%)
1990010100 8
0.1%
1990010101 4
< 0.1%
1990010102 2
 
< 0.1%
1990010103 6
0.1%
1990010104 2
 
< 0.1%
1990010105 4
< 0.1%
1990010106 4
< 0.1%
1990010107 5
0.1%
1990010108 6
0.1%
1990010109 5
0.1%
ValueCountFrequency (%)
1990032819 3
 
< 0.1%
1990032818 6
0.1%
1990032817 1
 
< 0.1%
1990032816 5
0.1%
1990032815 3
 
< 0.1%
1990032814 4
< 0.1%
1990032813 5
0.1%
1990032812 9
0.1%
1990032811 6
0.1%
1990032810 3
 
< 0.1%

측정소 코드
Real number (ℝ)

Distinct8
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean112.462
Minimum103
Maximum124
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:05.882284image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum103
5-th percentile103
Q1107
median113
Q3122
95-th percentile124
Maximum124
Range21
Interquartile range (IQR)15

Descriptive statistics

Standard deviation7.4306714
Coefficient of variation (CV)0.06607273
Kurtosis-1.3970382
Mean112.462
Median Absolute Deviation (MAD)6
Skewness0.31615821
Sum1124620
Variance55.214877
MonotonicityNot monotonic
2024-07-14T02:53:05.996226image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=8)
ValueCountFrequency (%)
122 1278
12.8%
117 1271
12.7%
124 1264
12.6%
107 1250
12.5%
103 1246
12.5%
108 1246
12.5%
113 1235
12.3%
105 1210
12.1%
ValueCountFrequency (%)
103 1246
12.5%
105 1210
12.1%
107 1250
12.5%
108 1246
12.5%
113 1235
12.3%
117 1271
12.7%
122 1278
12.8%
124 1264
12.6%
ValueCountFrequency (%)
124 1264
12.6%
122 1278
12.8%
117 1271
12.7%
113 1235
12.3%
108 1246
12.5%
107 1250
12.5%
105 1210
12.1%
103 1246
12.5%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3508
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:06.088300image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7519623
Coefficient of variation (CV)0.51430858
Kurtosis-1.2092529
Mean5.3508
Median Absolute Deviation (MAD)2
Skewness-0.22595971
Sum53508
Variance7.5732967
MonotonicityNot monotonic
2024-07-14T02:53:06.186205image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
8 1763
17.6%
1 1678
16.8%
6 1677
16.8%
3 1629
16.3%
9 1628
16.3%
5 1625
16.2%
ValueCountFrequency (%)
1 1678
16.8%
3 1629
16.3%
5 1625
16.2%
6 1677
16.8%
8 1763
17.6%
9 1628
16.3%
ValueCountFrequency (%)
9 1628
16.3%
8 1763
17.6%
6 1677
16.8%
5 1625
16.2%
3 1629
16.3%
1 1678
16.8%

평균값
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct520
Distinct (%)5.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean-3191.2337
Minimum-9999
Maximum264
Zeros661
Zeros (%)6.6%
Negative3883
Negative (%)38.8%
Memory size166.0 KiB
2024-07-14T02:53:06.310028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile-9999
Q1-9999
median0.007
Q30.058
95-th percentile4.8
Maximum264
Range10263
Interquartile range (IQR)9999.058

Descriptive statistics

Standard deviation4641.5627
Coefficient of variation (CV)-1.4544728
Kurtosis-1.3831276
Mean-3191.2337
Median Absolute Deviation (MAD)1.993
Skewness-0.78316133
Sum-31912337
Variance21544105
MonotonicityNot monotonic
2024-07-14T02:53:06.434480image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
-9999.0 3171
31.7%
0.0 661
 
6.6%
-9.999 489
 
4.9%
-999.9 221
 
2.2%
0.001 129
 
1.3%
0.002 80
 
0.8%
0.004 76
 
0.8%
0.003 71
 
0.7%
0.031 71
 
0.7%
0.016 70
 
0.7%
Other values (510) 4961
49.6%
ValueCountFrequency (%)
-9999.0 3171
31.7%
-999.9 221
 
2.2%
-9.999 489
 
4.9%
-0.041 1
 
< 0.1%
-0.038 1
 
< 0.1%
0.0 661
 
6.6%
0.001 129
 
1.3%
0.002 80
 
0.8%
0.003 71
 
0.7%
0.004 76
 
0.8%
ValueCountFrequency (%)
264.0 1
< 0.1%
240.0 1
< 0.1%
231.0 1
< 0.1%
200.0 1
< 0.1%
195.0 1
< 0.1%
188.0 1
< 0.1%
186.0 2
< 0.1%
185.0 1
< 0.1%
173.0 1
< 0.1%
169.0 1
< 0.1%

측정기 상태
Real number (ℝ)

HIGH CORRELATION  ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.6924
Minimum0
Maximum9
Zeros5975
Zeros (%)59.8%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:06.564113image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q34
95-th percentile4
Maximum9
Range9
Interquartile range (IQR)4

Descriptive statistics

Standard deviation2.2367572
Coefficient of variation (CV)1.321648
Kurtosis0.29207083
Mean1.6924
Median Absolute Deviation (MAD)0
Skewness1.0073086
Sum16924
Variance5.0030825
MonotonicityNot monotonic
2024-07-14T02:53:06.681532image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 5975
59.8%
4 3394
33.9%
2 284
 
2.8%
8 207
 
2.1%
9 123
 
1.2%
1 17
 
0.2%
ValueCountFrequency (%)
0 5975
59.8%
1 17
 
0.2%
2 284
 
2.8%
4 3394
33.9%
8 207
 
2.1%
9 123
 
1.2%
ValueCountFrequency (%)
9 123
 
1.2%
8 207
 
2.1%
4 3394
33.9%
2 284
 
2.8%
1 17
 
0.2%
0 5975
59.8%

국가 기준초과 구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9755 
1
 
245

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9755
97.5%
1 245
 
2.5%

Length

2024-07-14T02:53:06.806734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:53:06.905639image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9755
97.5%
1 245
 
2.5%

지자체 기준초과 구분
Categorical

IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9964 
1
 
36

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9964
99.6%
1 36
 
0.4%

Length

2024-07-14T02:53:07.012259image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:53:07.112686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9964
99.6%
1 36
 
0.4%

Interactions

2024-07-14T02:53:04.577062image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:02.703099image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:03.201197image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:03.623165image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:04.147447image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:04.661527image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:02.789408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:03.282884image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:03.712863image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:04.231102image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:04.746801image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:02.882094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:03.356647image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/