Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells3
Missing cells (%)< 0.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric5
Categorical2

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

지자체 기준초과 구분 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
국가 기준초과 구분 is highly overall correlated with 평균값 and 1 other fieldsHigh correlation
측정항목 is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 측정항목 and 2 other fieldsHigh correlation
국가 기준초과 구분 is highly imbalanced (85.7%)Imbalance
지자체 기준초과 구분 is highly imbalanced (85.7%)Imbalance
측정기 상태 has 9848 (98.5%) zerosZeros

Reproduction

Analysis started2024-07-13 17:53:26.424087
Analysis finished2024-07-13 17:53:30.023583
Duration3.6 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct667
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0160115 × 109
Minimum2.0160101 × 109
Maximum2.0160128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:30.104316image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0160101 × 109
5-th percentile2.0160102 × 109
Q12.0160107 × 109
median2.0160114 × 109
Q32.0160122 × 109
95-th percentile2.0160127 × 109
Maximum2.0160128 × 109
Range2718
Interquartile range (IQR)1477.25

Descriptive statistics

Standard deviation804.49491
Coefficient of variation (CV)3.9905275 × 10-7
Kurtosis-1.2150712
Mean2.0160115 × 109
Median Absolute Deviation (MAD)700
Skewness0.0022557742
Sum2.0160115 × 1013
Variance647212.06
MonotonicityNot monotonic
2024-07-14T02:53:30.286837image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2016010714 26
 
0.3%
2016011618 25
 
0.2%
2016011110 24
 
0.2%
2016012809 23
 
0.2%
2016010815 23
 
0.2%
2016012413 23
 
0.2%
2016010606 23
 
0.2%
2016011601 23
 
0.2%
2016012517 23
 
0.2%
2016012707 23
 
0.2%
Other values (657) 9764
97.6%
ValueCountFrequency (%)
2016010100 20
0.2%
2016010101 8
 
0.1%
2016010102 13
0.1%
2016010103 9
0.1%
2016010104 11
0.1%
2016010105 15
0.1%
2016010106 9
0.1%
2016010107 18
0.2%
2016010108 21
0.2%
2016010109 20
0.2%
ValueCountFrequency (%)
2016012818 12
0.1%
2016012817 13
0.1%
2016012816 18
0.2%
2016012815 12
0.1%
2016012814 14
0.1%
2016012813 16
0.2%
2016012812 22
0.2%
2016012811 8
 
0.1%
2016012810 12
0.1%
2016012809 23
0.2%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.0163
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:30.429006image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2057634
Coefficient of variation (CV)0.063758621
Kurtosis-1.2114217
Mean113.0163
Median Absolute Deviation (MAD)6
Skewness-0.0014936829
Sum1130163
Variance51.923027
MonotonicityNot monotonic
2024-07-14T02:53:30.588401image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
122 443
 
4.4%
119 431
 
4.3%
123 426
 
4.3%
111 423
 
4.2%
105 419
 
4.2%
107 415
 
4.2%
108 412
 
4.1%
117 410
 
4.1%
112 407
 
4.1%
101 400
 
4.0%
Other values (15) 5814
58.1%
ValueCountFrequency (%)
101 400
4.0%
102 391
3.9%
103 391
3.9%
104 384
3.8%
105 419
4.2%
106 392
3.9%
107 415
4.2%
108 412
4.1%
109 400
4.0%
110 393
3.9%
ValueCountFrequency (%)
125 387
3.9%
124 375
3.8%
123 426
4.3%
122 443
4.4%
121 393
3.9%
120 393
3.9%
119 431
4.3%
118 371
3.7%
117 410
4.1%
116 387
3.9%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.359
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:30.698962image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7323736
Coefficient of variation (CV)0.50986632
Kurtosis-1.1942088
Mean5.359
Median Absolute Deviation (MAD)2
Skewness-0.21364747
Sum53590
Variance7.4658656
MonotonicityNot monotonic
2024-07-14T02:53:30.811592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
8 1702
17.0%
3 1691
16.9%
5 1679
16.8%
6 1666
16.7%
9 1656
16.6%
1 1606
16.1%
ValueCountFrequency (%)
1 1606
16.1%
3 1691
16.9%
5 1679
16.8%
6 1666
16.7%
8 1702
17.0%
9 1656
16.6%
ValueCountFrequency (%)
9 1656
16.6%
8 1702
17.0%
6 1666
16.7%
5 1679
16.8%
3 1691
16.9%
1 1606
16.1%

평균값
Real number (ℝ)

HIGH CORRELATION 

Distinct452
Distinct (%)4.5%
Missing3
Missing (%)< 0.1%
Infinite0
Infinite (%)0.0%
Mean13.279394
Minimum-190
Maximum231
Zeros44
Zeros (%)0.4%
Negative3
Negative (%)< 0.1%
Memory size166.0 KiB
2024-07-14T02:53:30.965009image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-190
5-th percentile0.004
Q10.012
median0.1
Q323
95-th percentile62
Maximum231
Range421
Interquartile range (IQR)22.988

Descriptive statistics

Standard deviation23.525597
Coefficient of variation (CV)1.7715866
Kurtosis8.4828746
Mean13.279394
Median Absolute Deviation (MAD)0.1
Skewness2.2703991
Sum132754.1
Variance553.4537
MonotonicityNot monotonic
2024-07-14T02:53:31.094905image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.005 476
 
4.8%
0.006 464
 
4.6%
0.007 368
 
3.7%
0.004 274
 
2.7%
0.008 194
 
1.9%
0.002 185
 
1.8%
0.003 146
 
1.5%
0.01 103
 
1.0%
0.009 96
 
1.0%
0.012 93
 
0.9%
Other values (442) 7598
76.0%
ValueCountFrequency (%)
-190.0 1
 
< 0.1%
-85.0 1
 
< 0.1%
-1.0 1
 
< 0.1%
0.0 44
 
0.4%
0.001 51
 
0.5%
0.002 185
 
1.8%
0.003 146
 
1.5%
0.004 274
2.7%
0.005 476
4.8%
0.006 464
4.6%
ValueCountFrequency (%)
231.0 1
< 0.1%
223.0 1
< 0.1%
215.0 1
< 0.1%
197.0 1
< 0.1%
193.0 1
< 0.1%
190.0 1
< 0.1%
181.0 2
< 0.1%
179.0 1
< 0.1%
177.0 1
< 0.1%
173.0 1
< 0.1%

측정기 상태
Real number (ℝ)

ZEROS 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.049
Minimum0
Maximum9
Zeros9848
Zeros (%)98.5%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:31.206615image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0
Q10
median0
Q30
95-th percentile0
Maximum9
Range9
Interquartile range (IQR)0

Descriptive statistics

Standard deviation0.56588959
Coefficient of variation (CV)11.548767
Kurtosis223.35788
Mean0.049
Median Absolute Deviation (MAD)0
Skewness14.622422
Sum490
Variance0.32023102
MonotonicityNot monotonic
2024-07-14T02:53:31.298700image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
0 9848
98.5%
1 80
 
0.8%
9 34
 
0.3%
2 30
 
0.3%
4 5
 
0.1%
8 3
 
< 0.1%
ValueCountFrequency (%)
0 9848
98.5%
1 80
 
0.8%
2 30
 
0.3%
4 5
 
0.1%
8 3
 
< 0.1%
9 34
 
0.3%
ValueCountFrequency (%)
9 34
 
0.3%
8 3
 
< 0.1%
4 5
 
0.1%
2 30
 
0.3%
1 80
 
0.8%
0 9848
98.5%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9797 
1
 
203

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 9797
98.0%
1 203
 
2.0%

Length

2024-07-14T02:53:31.400885image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:53:31.491063image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9797
98.0%
1 203
 
2.0%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9797 
1
 
203

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row1

Common Values

ValueCountFrequency (%)
0 9797
98.0%
1 203
 
2.0%

Length

2024-07-14T02:53:31.587848image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:53:31.675144image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9797
98.0%
1 203
 
2.0%

Interactions

2024-07-14T02:53:29.158084image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:27.295659image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:27.747055image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:28.188967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:28.704529image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:29.244353image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:27.392686image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:27.836655image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:28.279125image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:28.785631image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:29.334320image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:27.482404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:27.918950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/