Overview

Dataset statistics

Number of variables6
Number of observations349
Missing cells210
Missing cells (%)10.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory16.8 KiB
Average record size in memory49.4 B

Variable types

Categorical1
Text2
Numeric1
DateTime2

Dataset

Description태양광발전 시설 현황(발전소명, 설비용량, 발전소주소, 최초허가일, 사업개시일)에 대한 정보를 제공하고 있습니다.
Author충청남도
URLhttps://alldam.chungnam.go.kr/index.chungnam?menuCd=DOM_000000201001001001&st=&cds=&orgCd=&apiType=&isOpen=Y&pageIndex=433&beforeMenuCd=DOM_000000201001001000&publicdatapk=15033988

Alerts

구분 has constant value ""Constant
사업개시일 has 210 (60.2%) missing valuesMissing

Reproduction

Analysis started2024-01-09 20:50:59.031035
Analysis finished2024-01-09 20:50:59.464537
Duration0.43 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

구분
Categorical

CONSTANT 

Distinct1
Distinct (%)0.3%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
전기사업허가
349 

Length

Max length6
Median length6
Mean length6
Min length6

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row전기사업허가
2nd row전기사업허가
3rd row전기사업허가
4th row전기사업허가
5th row전기사업허가

Common Values

ValueCountFrequency (%)
전기사업허가 349
100.0%

Length

2024-01-10T05:50:59.519908image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-01-10T05:50:59.598500image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전기사업허가 349
100.0%
Distinct330
Distinct (%)94.6%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-01-10T05:50:59.914535image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length18
Median length16
Mean length11.232092
Min length2

Characters and Unicode

Total characters3920
Distinct characters235
Distinct categories7 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique317 ?
Unique (%)90.8%

Sample

1st row마정2
2nd row윤경
3rd row원북면장대1리 태양광발전소
4th row신두3리 다목적회관
5th row방갈2리 다목적회관
ValueCountFrequency (%)
태양광발전소 303
41.8%
쏠라포스 23
 
3.2%
2호 8
 
1.1%
3호 7
 
1.0%
1호 6
 
0.8%
㈜썬솔라에너지 6
 
0.8%
다도 5
 
0.7%
4호 5
 
0.7%
유한회사 5
 
0.7%
방갈1리 4
 
0.6%
Other values (317) 353
48.7%
2024-01-10T05:51:00.399691image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
382
 
9.7%
376
 
9.6%
347
 
8.9%
333
 
8.5%
330
 
8.4%
330
 
8.4%
329
 
8.4%
190
 
4.8%
1 73
 
1.9%
59
 
1.5%
Other values (225) 1171
29.9%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3250
82.9%
Space Separator 376
 
9.6%
Decimal Number 265
 
6.8%
Other Symbol 14
 
0.4%
Uppercase Letter 11
 
0.3%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
382
11.8%
347
10.7%
333
 
10.2%
330
 
10.2%
330
 
10.2%
329
 
10.1%
190
 
5.8%
59
 
1.8%
49
 
1.5%
36
 
1.1%
Other values (205) 865
26.6%
Decimal Number
ValueCountFrequency (%)
1 73
27.5%
2 52
19.6%
3 35
13.2%
4 33
12.5%
5 21
 
7.9%
6 14
 
5.3%
7 10
 
3.8%
8 10
 
3.8%
9 9
 
3.4%
0 8
 
3.0%
Uppercase Letter
ValueCountFrequency (%)
S 3
27.3%
B 3
27.3%
K 2
18.2%
R 1
 
9.1%
C 1
 
9.1%
M 1
 
9.1%
Space Separator
ValueCountFrequency (%)
376
100.0%
Other Symbol
ValueCountFrequency (%)
14
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3264
83.3%
Common 645
 
16.5%
Latin 11
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
382
11.7%
347
 
10.6%
333
 
10.2%
330
 
10.1%
330
 
10.1%
329
 
10.1%
190
 
5.8%
59
 
1.8%
49
 
1.5%
36
 
1.1%
Other values (206) 879
26.9%
Common
ValueCountFrequency (%)
376
58.3%
1 73
 
11.3%
2 52
 
8.1%
3 35
 
5.4%
4 33
 
5.1%
5 21
 
3.3%
6 14
 
2.2%
7 10
 
1.6%
8 10
 
1.6%
9 9
 
1.4%
Other values (3) 12
 
1.9%
Latin
ValueCountFrequency (%)
S 3
27.3%
B 3
27.3%
K 2
18.2%
R 1
 
9.1%
C 1
 
9.1%
M 1
 
9.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3250
82.9%
ASCII 656
 
16.7%
None 14
 
0.4%

Most frequent character per block

Hangul
ValueCountFrequency (%)
382
11.8%
347
10.7%
333
 
10.2%
330
 
10.2%
330
 
10.2%
329
 
10.1%
190
 
5.8%
59
 
1.8%
49
 
1.5%
36
 
1.1%
Other values (205) 865
26.6%
ASCII
ValueCountFrequency (%)
376
57.3%
1 73
 
11.1%
2 52
 
7.9%
3 35
 
5.3%
4 33
 
5.0%
5 21
 
3.2%
6 14
 
2.1%
7 10
 
1.5%
8 10
 
1.5%
9 9
 
1.4%
Other values (9) 23
 
3.5%
None
ValueCountFrequency (%)
14
100.0%

설비용량
Real number (ℝ)

Distinct112
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean296.12444
Minimum11.83
Maximum1951.2
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size3.2 KiB
2024-01-10T05:51:00.532536image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum11.83
5-th percentile19.587
Q198.1
median99.68
Q3499.8
95-th percentile998.4
Maximum1951.2
Range1939.37
Interquartile range (IQR)401.7

Descriptive statistics

Standard deviation356.14882
Coefficient of variation (CV)1.2026998
Kurtosis0.93350455
Mean296.12444
Median Absolute Deviation (MAD)43.12
Skewness1.4072339
Sum103347.43
Variance126841.98
MonotonicityNot monotonic
2024-01-10T05:51:00.668819image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
99.68 42
 
12.0%
99.76 26
 
7.4%
998.4 26
 
7.4%
99.44 25
 
7.2%
499.96 21
 
6.0%
99.6 20
 
5.7%
98.1 14
 
4.0%
99.84 14
 
4.0%
96.3 7
 
2.0%
991.2 5
 
1.4%
Other values (102) 149
42.7%
ValueCountFrequency (%)
11.83 1
0.3%
12.6 1
0.3%
15.0 1
0.3%
15.3 1
0.3%
15.47 1
0.3%
16.02 1
0.3%
16.2 1
0.3%
17.85 2
0.6%
18.06 1
0.3%
19.04 1
0.3%
ValueCountFrequency (%)
1951.2 1
 
0.3%
999.9 1
 
0.3%
999.18 2
 
0.6%
999.0 5
 
1.4%
998.64 3
 
0.9%
998.4 26
7.4%
997.92 5
 
1.4%
997.5 1
 
0.3%
996.84 4
 
1.1%
991.2 5
 
1.4%
Distinct220
Distinct (%)63.0%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
2024-01-10T05:51:00.880209image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length104
Median length51
Mean length25.808023
Min length13

Characters and Unicode

Total characters9007
Distinct characters98
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique172 ?
Unique (%)49.3%

Sample

1st row태안군 안면읍 중장리 425-348, 425-349
2nd row태안군 원북면 장대리 7-2 제2동
3rd row태안군 원북면 장대리 233-2
4th row태안군 원북면 신두리 1221--12
5th row태안군 원북면 방갈리 515-131(건물위)
ValueCountFrequency (%)
태안군 343
19.2%
178
 
10.0%
안면읍 135
 
7.6%
정당리 90
 
5.0%
소원면 80
 
4.5%
원북면 48
 
2.7%
모항리 35
 
2.0%
태안읍 35
 
2.0%
중장리 34
 
1.9%
충청남도 32
 
1.8%
Other values (333) 778
43.5%
2024-01-10T05:51:01.209387image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1443
 
16.0%
1 588
 
6.5%
514
 
5.7%
- 481
 
5.3%
378
 
4.2%
2 360
 
4.0%
348
 
3.9%
343
 
3.8%
315
 
3.5%
6 298
 
3.3%
Other values (88) 3939
43.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3865
42.9%
Decimal Number 2595
28.8%
Space Separator 1443
 
16.0%
Dash Punctuation 481
 
5.3%
Open Punctuation 212
 
2.4%
Close Punctuation 212
 
2.4%
Other Punctuation 199
 
2.2%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
514
13.3%
378
 
9.8%
348
 
9.0%
343
 
8.9%
315
 
8.2%
188
 
4.9%
170
 
4.4%
139
 
3.6%
138
 
3.6%
107
 
2.8%
Other values (72) 1225
31.7%
Decimal Number
ValueCountFrequency (%)
1 588
22.7%
2 360
13.9%
6 298
11.5%
5 286
11.0%
7 270
10.4%
4 229
 
8.8%
3 183
 
7.1%
0 158
 
6.1%
9 116
 
4.5%
8 107
 
4.1%
Other Punctuation
ValueCountFrequency (%)
, 192
96.5%
/ 7
 
3.5%
Space Separator
ValueCountFrequency (%)
1443
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 481
100.0%
Open Punctuation
ValueCountFrequency (%)
( 212
100.0%
Close Punctuation
ValueCountFrequency (%)
) 212
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 5142
57.1%
Hangul 3865
42.9%

Most frequent character per script

Hangul
ValueCountFrequency (%)
514
13.3%
378
 
9.8%
348
 
9.0%
343
 
8.9%
315
 
8.2%
188
 
4.9%
170
 
4.4%
139
 
3.6%
138
 
3.6%
107
 
2.8%
Other values (72) 1225
31.7%
Common
ValueCountFrequency (%)
1443
28.1%
1 588
11.4%
- 481
 
9.4%
2 360
 
7.0%
6 298
 
5.8%
5 286
 
5.6%
7 270
 
5.3%
4 229
 
4.5%
( 212
 
4.1%
) 212
 
4.1%
Other values (6) 763
14.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 5142
57.1%
Hangul 3865
42.9%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1443
28.1%
1 588
11.4%
- 481
 
9.4%
2 360
 
7.0%
6 298
 
5.8%
5 286
 
5.6%
7 270
 
5.3%
4 229
 
4.5%
( 212
 
4.1%
) 212
 
4.1%
Other values (6) 763
14.8%
Hangul
ValueCountFrequency (%)
514
13.3%
378
 
9.8%
348
 
9.0%
343
 
8.9%
315
 
8.2%
188
 
4.9%
170
 
4.4%
139
 
3.6%
138
 
3.6%
107
 
2.8%
Other values (72) 1225
31.7%
Distinct112
Distinct (%)32.1%
Missing0
Missing (%)0.0%
Memory size2.9 KiB
Minimum2014-02-04 00:00:00
Maximum2023-11-30 00:00:00
2024-01-10T05:51:01.328118image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:51:01.441743image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

사업개시일
Date

MISSING 

Distinct60
Distinct (%)43.2%
Missing210
Missing (%)60.2%
Memory size2.9 KiB
Minimum2015-03-12 00:00:00
Maximum2025-07-21 00:00:00
2024-01-10T05:51:01.552019image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
2024-01-10T05:51:01.665730image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)

Interactions

2024-01-10T05:50:59.246470image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Correlations

2024-01-10T05:51:01.743592image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
설비용량사업개시일
설비용량1.0000.976
사업개시일0.9761.000

Missing values

2024-01-10T05:50:59.345836image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/