Overview

Dataset statistics

Number of variables7
Number of observations10000
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory693.4 KiB
Average record size in memory71.0 B

Variable types

Numeric4
Categorical3

Dataset

Description파일 다운로드
Author서울특별시
URLhttps://data.seoul.go.kr/dataList/OA-15526/S/1/datasetView.do

Alerts

지자체 기준초과 구분 is highly overall correlated with 국가 기준초과 구분High correlation
국가 기준초과 구분 is highly overall correlated with 지자체 기준초과 구분High correlation
측정항목 is highly overall correlated with 평균값High correlation
평균값 is highly overall correlated with 측정항목High correlation
측정기 상태 is highly imbalanced (95.9%)Imbalance
국가 기준초과 구분 is highly imbalanced (57.8%)Imbalance
지자체 기준초과 구분 is highly imbalanced (57.8%)Imbalance

Reproduction

Analysis started2024-07-13 17:49:13.070533
Analysis finished2024-07-13 17:49:16.750231
Duration3.68 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

측정일시
Real number (ℝ)

Distinct667
Distinct (%)6.7%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2.0190115 × 109
Minimum2.0190101 × 109
Maximum2.0190128 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:49:16.846358image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2.0190101 × 109
5-th percentile2.0190102 × 109
Q12.0190107 × 109
median2.0190114 × 109
Q32.0190121 × 109
95-th percentile2.0190127 × 109
Maximum2.0190128 × 109
Range2718
Interquartile range (IQR)1397

Descriptive statistics

Standard deviation800.41033
Coefficient of variation (CV)3.9643675 × 10-7
Kurtosis-1.1967253
Mean2.0190115 × 109
Median Absolute Deviation (MAD)698
Skewness0.011934467
Sum2.0190115 × 1013
Variance640656.7
MonotonicityNot monotonic
2024-07-14T02:49:17.047646image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
2019010815 29
 
0.3%
2019010513 29
 
0.3%
2019011810 28
 
0.3%
2019012800 26
 
0.3%
2019010423 26
 
0.3%
2019010521 25
 
0.2%
2019012410 25
 
0.2%
2019011217 25
 
0.2%
2019010910 24
 
0.2%
2019011300 24
 
0.2%
Other values (657) 9739
97.4%
ValueCountFrequency (%)
2019010100 15
0.1%
2019010101 10
0.1%
2019010102 17
0.2%
2019010103 10
0.1%
2019010104 12
0.1%
2019010105 13
0.1%
2019010106 16
0.2%
2019010107 15
0.1%
2019010108 18
0.2%
2019010109 15
0.1%
ValueCountFrequency (%)
2019012818 9
0.1%
2019012817 19
0.2%
2019012816 16
0.2%
2019012815 20
0.2%
2019012814 13
0.1%
2019012813 9
0.1%
2019012812 12
0.1%
2019012811 10
0.1%
2019012810 12
0.1%
2019012809 11
0.1%

측정소 코드
Real number (ℝ)

Distinct25
Distinct (%)0.2%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean113.1727
Minimum101
Maximum125
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:49:17.184382image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum101
5-th percentile102
Q1107
median113
Q3119
95-th percentile124
Maximum125
Range24
Interquartile range (IQR)12

Descriptive statistics

Standard deviation7.1966418
Coefficient of variation (CV)0.06358991
Kurtosis-1.2004449
Mean113.1727
Median Absolute Deviation (MAD)6
Skewness-0.022120531
Sum1131727
Variance51.791654
MonotonicityNot monotonic
2024-07-14T02:49:17.358665image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=25)
ValueCountFrequency (%)
122 435
 
4.3%
125 425
 
4.2%
111 421
 
4.2%
120 417
 
4.2%
124 415
 
4.2%
110 413
 
4.1%
112 413
 
4.1%
113 412
 
4.1%
118 412
 
4.1%
116 409
 
4.1%
Other values (15) 5828
58.3%
ValueCountFrequency (%)
101 372
3.7%
102 360
3.6%
103 406
4.1%
104 407
4.1%
105 396
4.0%
106 376
3.8%
107 399
4.0%
108 386
3.9%
109 373
3.7%
110 413
4.1%
ValueCountFrequency (%)
125 425
4.2%
124 415
4.2%
123 385
3.9%
122 435
4.3%
121 408
4.1%
120 417
4.2%
119 383
3.8%
118 412
4.1%
117 406
4.1%
116 409
4.1%

측정항목
Real number (ℝ)

HIGH CORRELATION 

Distinct6
Distinct (%)0.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean5.3044
Minimum1
Maximum9
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:49:17.490838image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile1
Q13
median5
Q38
95-th percentile9
Maximum9
Range8
Interquartile range (IQR)5

Descriptive statistics

Standard deviation2.7467244
Coefficient of variation (CV)0.51782
Kurtosis-1.2047746
Mean5.3044
Median Absolute Deviation (MAD)2
Skewness-0.19434569
Sum53044
Variance7.5444951
MonotonicityNot monotonic
2024-07-14T02:49:17.639267image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=6)
ValueCountFrequency (%)
6 1725
17.2%
1 1695
17.0%
3 1663
16.6%
5 1658
16.6%
9 1648
16.5%
8 1611
16.1%
ValueCountFrequency (%)
1 1695
17.0%
3 1663
16.6%
5 1658
16.6%
6 1725
17.2%
8 1611
16.1%
9 1648
16.5%
ValueCountFrequency (%)
9 1648
16.5%
8 1611
16.1%
6 1725
17.2%
5 1658
16.6%
3 1663
16.6%
1 1695
17.0%

평균값
Real number (ℝ)

HIGH CORRELATION 

Distinct312
Distinct (%)3.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean18.077028
Minimum0
Maximum1985
Zeros19
Zeros (%)0.2%
Negative0
Negative (%)0.0%
Memory size166.0 KiB
2024-07-14T02:49:17.829436image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile0.003
Q10.009
median0.07
Q324
95-th percentile88
Maximum1985
Range1985
Interquartile range (IQR)23.991

Descriptive statistics

Standard deviation46.527609
Coefficient of variation (CV)2.5738528
Kurtosis468.057
Mean18.077028
Median Absolute Deviation (MAD)0.069
Skewness15.50808
Sum180770.28
Variance2164.8184
MonotonicityNot monotonic
2024-07-14T02:49:18.032575image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
0.004 530
 
5.3%
0.005 520
 
5.2%
0.006 321
 
3.2%
0.003 315
 
3.1%
0.002 293
 
2.9%
0.5 225
 
2.2%
0.007 223
 
2.2%
0.7 207
 
2.1%
0.4 205
 
2.1%
0.6 184
 
1.8%
Other values (302) 6977
69.8%
ValueCountFrequency (%)
0.0 19
 
0.2%
0.001 86
 
0.9%
0.002 293
2.9%
0.003 315
3.1%
0.004 530
5.3%
0.005 520
5.2%
0.006 321
3.2%
0.007 223
2.2%
0.008 172
 
1.7%
0.009 107
 
1.1%
ValueCountFrequency (%)
1985.0 1
 
< 0.1%
985.0 8
0.1%
262.0 1
 
< 0.1%
239.0 1
 
< 0.1%
235.0 1
 
< 0.1%
225.0 1
 
< 0.1%
209.0 1
 
< 0.1%
208.0 1
 
< 0.1%
206.0 1
 
< 0.1%
204.0 1
 
< 0.1%

측정기 상태
Categorical

IMBALANCE 

Distinct3
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9931 
1
 
53
9
 
16

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9931
99.3%
1 53
 
0.5%
9 16
 
0.2%

Length

2024-07-14T02:49:18.204734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:49:18.306418image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9931
99.3%
1 53
 
0.5%
9 16
 
0.2%

국가 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9144 
1
 
856

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9144
91.4%
1 856
 
8.6%

Length

2024-07-14T02:49:18.414579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:49:18.531493image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9144
91.4%
1 856
 
8.6%

지자체 기준초과 구분
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct2
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size156.2 KiB
0
9144 
1
 
856

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row0
3rd row1
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 9144
91.4%
1 856
 
8.6%

Length

2024-07-14T02:49:18.669942image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-07-14T02:49:18.777142image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 9144
91.4%
1 856
 
8.6%

Interactions

2024-07-14T02:49:15.843771image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:13.790177image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:14.334085image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:15.243083image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:16.016703image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:13.917910image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:14.451668image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:15.365431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:16.129337image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:14.045345image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:14.601166image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:15.485589image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-07-14T02:49:16.256049image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/