Overview

Dataset statistics

Number of variables6
Number of observations49
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory2.6 KiB
Average record size in memory54.7 B

Variable types

Numeric3
Text2
Categorical1

Dataset

Description샘플 데이터
Author지디에스컨설팅그룹
URLhttps://www.bigdata-environment.kr/user/data_market/detail.do?id=2a2d9710-2e00-11ea-9713-eb3e5186fb38

Alerts

전국 미세먼지 수치 has constant value ""Constant
화력발전소 미세먼지 수치 is highly overall correlated with 화력발전소 미세먼지 비율High correlation
화력발전소 미세먼지 비율 is highly overall correlated with 화력발전소 미세먼지 수치High correlation
화력발전소 고유번호 has unique valuesUnique
화력발전소 명 has unique valuesUnique

Reproduction

Analysis started2023-12-10 12:32:37.922468
Analysis finished2023-12-10 12:32:40.092415
Duration2.17 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

화력발전소 고유번호
Real number (ℝ)

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean25
Minimum1
Maximum49
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:32:40.222901image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile3.4
Q113
median25
Q337
95-th percentile46.6
Maximum49
Range48
Interquartile range (IQR)24

Descriptive statistics

Standard deviation14.28869
Coefficient of variation (CV)0.57154761
Kurtosis-1.2
Mean25
Median Absolute Deviation (MAD)12
Skewness0
Sum1225
Variance204.16667
MonotonicityStrictly increasing
2023-12-10T21:32:40.489967image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=49)
ValueCountFrequency (%)
1 1
 
2.0%
38 1
 
2.0%
28 1
 
2.0%
29 1
 
2.0%
30 1
 
2.0%
31 1
 
2.0%
32 1
 
2.0%
33 1
 
2.0%
34 1
 
2.0%
35 1
 
2.0%
Other values (39) 39
79.6%
ValueCountFrequency (%)
1 1
2.0%
2 1
2.0%
3 1
2.0%
4 1
2.0%
5 1
2.0%
6 1
2.0%
7 1
2.0%
8 1
2.0%
9 1
2.0%
10 1
2.0%
ValueCountFrequency (%)
49 1
2.0%
48 1
2.0%
47 1
2.0%
46 1
2.0%
45 1
2.0%
44 1
2.0%
43 1
2.0%
42 1
2.0%
41 1
2.0%
40 1
2.0%

화력발전소 명
Text

UNIQUE 

Distinct49
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:32:40.851224image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length8.0204082
Min length4

Characters and Unicode

Total characters393
Distinct characters66
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique49 ?
Unique (%)100.0%

Sample

1st row광양복합화력발전소
2nd row나주열병합발전소
3rd row당진복합화력발전소
4th row북평화력발전소
5th row대산복합화력발전소
ValueCountFrequency (%)
광양복합화력발전소 1
 
2.0%
신인천복합화력발전소 1
 
2.0%
남제주화력발전소 1
 
2.0%
영월복합화력발전소 1
 
2.0%
삼척그린파워발전소 1
 
2.0%
안동복합화력발전소 1
 
2.0%
삼천포발전본부 1
 
2.0%
영흥발전본부 1
 
2.0%
분당발전본부 1
 
2.0%
영동발전본부 1
 
2.0%
Other values (39) 39
79.6%
2023-12-10T21:32:41.426864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
48
 
12.2%
48
 
12.2%
43
 
10.9%
31
 
7.9%
31
 
7.9%
24
 
6.1%
17
 
4.3%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (56) 124
31.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 390
99.2%
Uppercase Letter 3
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Uppercase Letter
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 390
99.2%
Latin 3
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
Latin
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 390
99.2%
ASCII 3
 
0.8%

Most frequent character per block

Hangul
ValueCountFrequency (%)
48
 
12.3%
48
 
12.3%
43
 
11.0%
31
 
7.9%
31
 
7.9%
24
 
6.2%
17
 
4.4%
13
 
3.3%
7
 
1.8%
7
 
1.8%
Other values (53) 121
31.0%
ASCII
ValueCountFrequency (%)
G 1
33.3%
N 1
33.3%
L 1
33.3%
Distinct48
Distinct (%)98.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
2023-12-10T21:32:42.049572image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length32
Median length24
Mean length20.183673
Min length15

Characters and Unicode

Total characters989
Distinct characters130
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique47 ?
Unique (%)95.9%

Sample

1st row전라남도 광양시 제철로 2148-567
2nd row전라남도 나주시 산포면 신도산단길 65 (신도리 1304)
3rd row충남 당진시 송악읍 부곡공단로 241
4th row강원도 동해시 공단 2로 15-5(구호동)
5th row충남 서산시 대산읍 독곶1로 82
ValueCountFrequency (%)
경기도 13
 
5.6%
강원도 6
 
2.6%
전라남도 5
 
2.1%
충청남도 5
 
2.1%
인천광역시 5
 
2.1%
서구 4
 
1.7%
남구 3
 
1.3%
분당로 2
 
0.9%
201 2
 
0.9%
분당구 2
 
0.9%
Other values (166) 186
79.8%
2023-12-10T21:32:42.841383image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
184
 
18.6%
45
 
4.6%
40
 
4.0%
36
 
3.6%
1 28
 
2.8%
5 25
 
2.5%
3 23
 
2.3%
2 21
 
2.1%
21
 
2.1%
19
 
1.9%
Other values (120) 547
55.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 611
61.8%
Space Separator 184
 
18.6%
Decimal Number 176
 
17.8%
Dash Punctuation 11
 
1.1%
Open Punctuation 3
 
0.3%
Close Punctuation 3
 
0.3%
Other Punctuation 1
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Decimal Number
ValueCountFrequency (%)
1 28
15.9%
5 25
14.2%
3 23
13.1%
2 21
11.9%
7 18
10.2%
0 16
9.1%
4 14
8.0%
6 12
6.8%
9 12
6.8%
8 7
 
4.0%
Space Separator
ValueCountFrequency (%)
184
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 11
100.0%
Open Punctuation
ValueCountFrequency (%)
( 3
100.0%
Close Punctuation
ValueCountFrequency (%)
) 3
100.0%
Other Punctuation
ValueCountFrequency (%)
. 1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 611
61.8%
Common 378
38.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%
Common
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%

Most occurring blocks

ValueCountFrequency (%)
Hangul 611
61.8%
ASCII 378
38.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
184
48.7%
1 28
 
7.4%
5 25
 
6.6%
3 23
 
6.1%
2 21
 
5.6%
7 18
 
4.8%
0 16
 
4.2%
4 14
 
3.7%
6 12
 
3.2%
9 12
 
3.2%
Other values (5) 25
 
6.6%
Hangul
ValueCountFrequency (%)
45
 
7.4%
40
 
6.5%
36
 
5.9%
21
 
3.4%
19
 
3.1%
19
 
3.1%
17
 
2.8%
17
 
2.8%
15
 
2.5%
14
 
2.3%
Other values (105) 368
60.2%

화력발전소 미세먼지 수치
Real number (ℝ)

HIGH CORRELATION 

Distinct37
Distinct (%)75.5%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean36.588327
Minimum28.399
Maximum49.335
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:32:43.094548image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum28.399
5-th percentile30.0466
Q134.955
median35.441
Q338.399
95-th percentile42.9082
Maximum49.335
Range20.936
Interquartile range (IQR)3.444

Descriptive statistics

Standard deviation4.162426
Coefficient of variation (CV)0.11376377
Kurtosis1.5988633
Mean36.588327
Median Absolute Deviation (MAD)1.856
Skewness0.74163811
Sum1792.828
Variance17.325791
MonotonicityNot monotonic
2023-12-10T21:32:43.324892image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=37)
ValueCountFrequency (%)
35.339 7
 
14.3%
34.153 2
 
4.1%
37.297 2
 
4.1%
39.646 2
 
4.1%
38.399 2
 
4.1%
41.203 2
 
4.1%
28.399 2
 
4.1%
34.955 1
 
2.0%
35.294 1
 
2.0%
42.412 1
 
2.0%
Other values (27) 27
55.1%
ValueCountFrequency (%)
28.399 2
4.1%
29.515 1
2.0%
30.844 1
2.0%
31.396 1
2.0%
32.028 1
2.0%
33.139 1
2.0%
33.256 1
2.0%
34.153 2
4.1%
34.606 1
2.0%
34.84 1
2.0%
ValueCountFrequency (%)
49.335 1
2.0%
47.537 1
2.0%
42.983 1
2.0%
42.796 1
2.0%
42.412 1
2.0%
41.568 1
2.0%
41.203 2
4.1%
39.646 2
4.1%
38.691 1
2.0%
38.576 1
2.0%

전국 미세먼지 수치
Categorical

CONSTANT 

Distinct1
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size524.0 B
36.588
49 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row36.588
2nd row36.588
3rd row36.588
4th row36.588
5th row36.588

Common Values

ValueCountFrequency (%)
36.588 49
100.0%

Length

2023-12-10T21:32:43.533786image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2023-12-10T21:32:43.666075image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
36.588 49
100.0%

화력발전소 미세먼지 비율
Real number (ℝ)

HIGH CORRELATION 

Distinct35
Distinct (%)71.4%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean0.99995918
Minimum0.776
Maximum1.348
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size573.0 B
2023-12-10T21:32:43.821422image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0.776
5-th percentile0.8214
Q10.955
median0.969
Q31.049
95-th percentile1.173
Maximum1.348
Range0.572
Interquartile range (IQR)0.094

Descriptive statistics

Standard deviation0.11373561
Coefficient of variation (CV)0.11374026
Kurtosis1.5944543
Mean0.99995918
Median Absolute Deviation (MAD)0.05
Skewness0.740123
Sum48.998
Variance0.01293579
MonotonicityNot monotonic
2023-12-10T21:32:44.019751image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=35)
ValueCountFrequency (%)
0.966 8
 
16.3%
1.049 2
 
4.1%
1.126 2
 
4.1%
0.776 2
 
4.1%
1.012 2
 
4.1%
1.019 2
 
4.1%
1.084 2
 
4.1%
0.933 2
 
4.1%
0.807 1
 
2.0%
0.963 1
 
2.0%
Other values (25) 25
51.0%
ValueCountFrequency (%)
0.776 2
4.1%
0.807 1
2.0%
0.8429999999999999 1
2.0%
0.8579999999999999 1
2.0%
0.875 1
2.0%
0.906 1
2.0%
0.909 1
2.0%
0.933 2
4.1%
0.946 1
2.0%
0.952 1
2.0%
ValueCountFrequency (%)
1.348 1
2.0%
1.299 1
2.0%
1.175 1
2.0%
1.17 1
2.0%
1.159 1
2.0%
1.136 1
2.0%
1.126 2
4.1%
1.084 2
4.1%
1.057 1
2.0%
1.054 1
2.0%

Interactions

2023-12-10T21:32:39.360611image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:38.468952image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:38.943859image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2023-12-10T21:32:39.482391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/