Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric4
Categorical3

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
측정항목 is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 측정항목High correlation
측정기 상태 is highly imbalanced (93.4%)Imbalance
국가 기준초과 구분 is highly imbalanced (68.1%)Imbalance
지자체 기준초과 구분 is highly imbalanced (68.1%)Imbalance
평균값 is highly skewed (γ1 = -29.76138363)Skewed

Reproduction

Analysis started2024-07-13 17:53:08.910455
Analysis finished2024-07-13 17:53:11.350323
Duration2.44 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct667
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0220115 × 109
Minimum2.0220101 × 109
Maximum2.0220128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:11.432408image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0220101 × 109
5-th percentile2.0220102 × 109
Q12.0220108 × 109
median2.0220115 × 109
Q32.0220121 × 109
95-th percentile2.0220127 × 109
Maximum2.0220128 × 109
Range2718
Interquartile range (IQR)1320

Descriptive statistics

Standard deviation800.0819
Coefficient of variation (CV)3.9568614 × 10-7
Kurtosis-1.1989923
Mean2.0220115 × 109
Median Absolute Deviation (MAD)698
Skewness-0.014465321
Sum2.0220115 × 1013
Variance640131.05
MonotonicityNot monotonic
2024-07-14T02:53:11.572143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2022011512 27
 
0.3%
2022010715 25
 
0.2%
2022010509 25
 
0.2%
2022012518 24
 
0.2%
2022012813 24
 
0.2%
2022012310 23
 
0.2%
2022011000 23
 
0.2%
2022012606 23
 
0.2%
2022010208 23
 
0.2%
2022011606 23
 
0.2%
Other values (657) 9760
97.6%
ValueCountFrequency (%)
2022010100 11
0.1%
2022010101 11
0.1%
2022010102 7
0.1%
2022010103 16
0.2%
2022010104 12
0.1%
2022010105 9
0.1%
2022010106 16
0.2%
2022010107 7
0.1%
2022010108 10
0.1%
2022010109 9
0.1%
ValueCountFrequency (%)
2022012818 4
 
< 0.1%
2022012817 12
0.1%
2022012816 15
0.1%
2022012815 10
0.1%
2022012814 15
0.1%
2022012813 24
0.2%
2022012812 16
0.2%
2022012811 21
0.2%
2022012810 16
0.2%
2022012809 15
0.1%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.0118
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:11.693028image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.2099972
Coefficient of variation (CV)0.063798623
Kurtosis-1.1906106
Mean113.0118
Median Absolute Deviation (MAD)6
Skewness-0.011054007
Sum1130118
Variance51.984059
MonotonicityNot monotonic
2024-07-14T02:53:11.819007image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
118 441
 
4.4%
101 439
 
4.4%
113 433
 
4.3%
105 422
 
4.2%
125 414
 
4.1%
112 413
 
4.1%
122 413
 
4.1%
110 412
 
4.1%
115 409
 
4.1%
119 407
 
4.1%
Other values (15) 5797
58.0%
ValueCountFrequency (%)
101 439
4.4%
102 393
3.9%
103 364
3.6%
104 404
4.0%
105 422
4.2%
106 356
3.6%
107 399
4.0%
108 376
3.8%
109 405
4.0%
110 412
4.1%
ValueCountFrequency (%)
125 414
4.1%
124 386
3.9%
123 386
3.9%
122 413
4.1%
121 379
3.8%
120 400
4.0%
119 407
4.1%
118 441
4.4%
117 382
3.8%
116 387
3.9%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.4015
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:53:11.922130image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median6
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7592135
Coefficient of variation (CV)0.51082357
Kurtosis-1.2115296
Mean5.4015
Median Absolute Deviation (MAD)3
Skewness-0.23593353
Sum54015
Variance7.6132591
MonotonicityNot monotonic
2024-07-14T02:53:12.025813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
8 1742
17.4%
9 1740
17.4%
3 1645
16.4%
1 1626
16.3%
5 1624
16.2%
6 1623
16.2%
ValueCountFrequency (%)
1 1626
16.3%
3 1645
16.4%
5 1624
16.2%
6 1623
16.2%
8 1742
17.4%
9 1740
17.4%
ValueCountFrequency (%)
9 1740
17.4%
8 1742
17.4%
6 1623
16.2%
5 1624
16.2%
3 1645
16.4%
1 1626
16.3%

평균값
Real number (ℝ)

HIGH CORRELATION  SKEWED 

Distinct241
Distinct (%)2.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.5992849
Minimum-9999
Maximum985
Zeros12
Zeros (%)0.1%
Negative15
Negative (%)0.1%
Memory size166.0 KiB
2024-07-14T02:53:12.153458image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum-9999
5-th percentile0.003
Q10.013
median0.3
Q323
95-th percentile58.05
Maximum985
Range10984
Interquartile range (IQR)22.987

Descriptive statistics

Standard deviation333.15582
Coefficient of variation (CV)128.17211
Kurtosis891.12036
Mean2.5992849
Median Absolute Deviation (MAD)0.297
Skewness-29.761384
Sum25992.849
Variance110992.8
MonotonicityNot monotonic
2024-07-14T02:53:12.291068image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.003 821
 
8.2%
0.004 674
 
6.7%
0.5 297
 
3.0%
0.002 271
 
2.7%
0.4 271
 
2.7%
0.6 265
 
2.6%
0.7 210
 
2.1%
0.005 179
 
1.8%
0.8 166
 
1.7%
0.9 108
 
1.1%
Other values (231) 6738
67.4%
ValueCountFrequency (%)
-9999.0 11
 
0.1%
-9.999 2
 
< 0.1%
-0.094 1
 
< 0.1%
-0.032 1
 
< 0.1%
0.0 12
 
0.1%
0.001 18
 
0.2%
0.002 271
 
2.7%
0.003 821
8.2%
0.004 674
6.7%
0.005 179
 
1.8%
ValueCountFrequency (%)
985.0 3
< 0.1%
547.0 1
 
< 0.1%
305.0 1
 
< 0.1%
165.0 1
 
< 0.1%
150.0 1
 
< 0.1%
148.0 1
 
< 0.1%
147.0 1
 
< 0.1%
145.0 2
< 0.1%
141.0 1
 
< 0.1%
140.0 3
< 0.1%

측정기 상태
Categorical

IMBALANCE 

Distinct5
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9817 
1
 
130
9
 
42
2
 
8
4
 
3

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9817
98.2%
1 130
 
1.3%
9 42
 
0.4%
2 8
 
0.1%
4 3
 
< 0.1%

Length

2024-07-14T02:53:12.414604image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:53:12.513274image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9817
98.2%
1 130
 
1.3%
9 42
 
0.4%
2 8
 
0.1%
4 3
 
< 0.1%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9420 
1
 
580

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9420
94.2%
1 580
 
5.8%

Length

2024-07-14T02:53:12.617738image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:53:12.701032image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9420
94.2%
1 580
 
5.8%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9420 
1
 
580

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9420
94.2%
1 580
 
5.8%

Length

2024-07-14T02:53:12.780609image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:53:12.865262image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9420
94.2%
1 580
 
5.8%

Interactions

2024-07-14T02:53:10.781673image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:09.575972image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:09.925110image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:10.368323image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:10.870283image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:09.655003image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:10.011634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:10.485193image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:10.957676image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:09.745394image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:10.140933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:10.589846image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:53:11.051094image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/