Overview

Dataset statistics

Number of variables30
Number of observations155
Missing cells2468
Missing cells (%)53.1%
Duplicate rows1
Duplicate rows (%)0.6%
Total size in memory36.6 KiB
Average record size in memory241.9 B

Variable types

Text19
Categorical10
Unsupported1

Alerts

Unnamed: 1 has constant value ""Constant
Unnamed: 18 has constant value ""Constant
Unnamed: 25 has constant value ""Constant
Unnamed: 26 has constant value ""Constant
Unnamed: 27 has constant value ""Constant
Unnamed: 28 has constant value ""Constant
Dataset has 1 (0.6%) duplicate rowsDuplicates
Unnamed: 12 is highly imbalanced (51.9%)Imbalance
Unnamed: 13 is highly imbalanced (64.8%)Imbalance
Unnamed: 14 is highly imbalanced (69.1%)Imbalance
Unnamed: 24 is highly imbalanced (75.2%)Imbalance
사회복지법인 현황 (전라북도) has 36 (23.2%) missing valuesMissing
Unnamed: 1 has 154 (99.4%) missing valuesMissing
Unnamed: 5 has 37 (23.9%) missing valuesMissing
Unnamed: 6 has 38 (24.5%) missing valuesMissing
Unnamed: 8 has 38 (24.5%) missing valuesMissing
Unnamed: 10 has 39 (25.2%) missing valuesMissing
Unnamed: 15 has 149 (96.1%) missing valuesMissing
Unnamed: 16 has 152 (98.1%) missing valuesMissing
Unnamed: 17 has 152 (98.1%) missing valuesMissing
Unnamed: 18 has 154 (99.4%) missing valuesMissing
Unnamed: 19 has 152 (98.1%) missing valuesMissing
Unnamed: 20 has 149 (96.1%) missing valuesMissing
Unnamed: 21 has 149 (96.1%) missing valuesMissing
Unnamed: 22 has 149 (96.1%) missing valuesMissing
Unnamed: 23 has 149 (96.1%) missing valuesMissing
Unnamed: 25 has 154 (99.4%) missing valuesMissing
Unnamed: 26 has 154 (99.4%) missing valuesMissing
Unnamed: 27 has 154 (99.4%) missing valuesMissing
Unnamed: 28 has 154 (99.4%) missing valuesMissing
Unnamed: 29 has 155 (100.0%) missing valuesMissing
Unnamed: 29 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 01:17:08.524685
Analysis finished2024-03-14 01:17:09.143818
Duration0.62 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct119
Distinct (%)100.0%
Missing36
Missing (%)23.2%
Memory size1.3 KiB
2024-03-14T10:17:09.352628image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length20
Median length2
Mean length2.2184874
Min length1

Characters and Unicode

Total characters264
Distinct characters27
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique119 ?
Unique (%)100.0%

Sample

1st row※ 2017.6월말 기준 사회복지법인
2nd row연번
3rd row총계
4th row1
5th row2
ValueCountFrequency (%)
1
 
0.8%
59 1
 
0.8%
84 1
 
0.8%
83 1
 
0.8%
82 1
 
0.8%
81 1
 
0.8%
80 1
 
0.8%
79 1
 
0.8%
78 1
 
0.8%
77 1
 
0.8%
Other values (112) 112
91.8%
2024-03-14T10:17:09.755397image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 47
17.8%
2 23
8.7%
6 23
8.7%
0 22
8.3%
7 22
8.3%
5 22
8.3%
4 22
8.3%
3 22
8.3%
9 21
8.0%
8 21
8.0%
Other values (17) 19
7.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 245
92.8%
Other Letter 14
 
5.3%
Space Separator 3
 
1.1%
Other Punctuation 2
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%
Decimal Number
ValueCountFrequency (%)
1 47
19.2%
2 23
9.4%
6 23
9.4%
0 22
9.0%
7 22
9.0%
5 22
9.0%
4 22
9.0%
3 22
9.0%
9 21
8.6%
8 21
8.6%
Other Punctuation
ValueCountFrequency (%)
1
50.0%
. 1
50.0%
Space Separator
ValueCountFrequency (%)
3
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 250
94.7%
Hangul 14
 
5.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%
Common
ValueCountFrequency (%)
1 47
18.8%
2 23
9.2%
6 23
9.2%
0 22
8.8%
7 22
8.8%
5 22
8.8%
4 22
8.8%
3 22
8.8%
9 21
8.4%
8 21
8.4%
Other values (3) 5
 
2.0%

Most occurring blocks

ValueCountFrequency (%)
ASCII 249
94.3%
Hangul 14
 
5.3%
Punctuation 1
 
0.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 47
18.9%
2 23
9.2%
6 23
9.2%
0 22
8.8%
7 22
8.8%
5 22
8.8%
4 22
8.8%
3 22
8.8%
9 21
8.4%
8 21
8.4%
Other values (2) 4
 
1.6%
Hangul
ValueCountFrequency (%)
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
1
 
7.1%
Other values (4) 4
28.6%
Punctuation
ValueCountFrequency (%)
1
100.0%

Unnamed: 1
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing154
Missing (%)99.4%
Memory size1.3 KiB
2024-03-14T10:17:09.854976image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row담당자
ValueCountFrequency (%)
담당자 1
100.0%
2024-03-14T10:17:10.272586image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Unnamed: 2
Categorical

Distinct17
Distinct (%)11.0%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
노인
46 
<NA>
38 
장애인
24 
아동
11 
사회복귀시설운영
Other values (12)
27 

Length

Max length28
Median length9
Mean length3.7096774
Min length2

Unique

Unique6 ?
Unique (%)3.9%

Sample

1st row<NA>
2nd row<NA>
3rd row주요 목적사업 <노인,아동,장애인 등으로만 기재>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
노인 46
29.7%
<NA> 38
24.5%
장애인 24
15.5%
아동 11
 
7.1%
사회복귀시설운영 9
 
5.8%
보육/노인 5
 
3.2%
한부모 5
 
3.2%
사회복지 4
 
2.6%
정신요양시설운영 3
 
1.9%
사회복지 2
 
1.3%
Other values (7) 8
 
5.2%

Length

2024-03-14T10:17:10.383218image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
노인 46
28.7%
na 38
23.8%
장애인 24
15.0%
아동 11
 
6.9%
사회복귀시설운영 9
 
5.6%
사회복지 6
 
3.8%
보육/노인 5
 
3.1%
한부모 5
 
3.1%
정신요양시설운영 3
 
1.9%
노인/보육 2
 
1.2%
Other values (11) 11
 
6.9%

Unnamed: 3
Categorical

Distinct3
Distinct (%)1.9%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
전북
117 
<NA>
37 
시도
 
1

Length

Max length4
Median length2
Mean length2.4774194
Min length2

Unique

Unique1 ?
Unique (%)0.6%

Sample

1st row<NA>
2nd row<NA>
3rd row시도
4th row<NA>
5th row전북

Common Values

ValueCountFrequency (%)
전북 117
75.5%
<NA> 37
 
23.9%
시도 1
 
0.6%

Length

2024-03-14T10:17:10.483574image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:17:10.575143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북 117
75.5%
na 37
 
23.9%
시도 1
 
0.6%

Unnamed: 4
Categorical

Distinct15
Distinct (%)9.7%
Missing0
Missing (%)0.0%
Memory size1.3 KiB
<NA>
38 
전주시
28 
익산시
17 
완주군
16 
군산시
14 
Other values (10)
42 

Length

Max length4
Median length3
Mean length3.2451613
Min length3

Unique

Unique2 ?
Unique (%)1.3%

Sample

1st row<NA>
2nd row<NA>
3rd row시군구
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 38
24.5%
전주시 28
18.1%
익산시 17
11.0%
완주군 16
10.3%
군산시 14
 
9.0%
남원시 10
 
6.5%
정읍시 9
 
5.8%
김제시 7
 
4.5%
고창군 4
 
2.6%
순창군 3
 
1.9%
Other values (5) 9
 
5.8%

Length

2024-03-14T10:17:10.686115image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
na 38
24.5%
전주시 28
18.1%
익산시 17
11.0%
완주군 16
10.3%
군산시 14
 
9.0%
남원시 10
 
6.5%
정읍시 9
 
5.8%
김제시 7
 
4.5%
고창군 4
 
2.6%
순창군 3
 
1.9%
Other values (5) 9
 
5.8%

Unnamed: 5
Text

MISSING 

Distinct118
Distinct (%)100.0%
Missing37
Missing (%)23.9%
Memory size1.3 KiB
2024-03-14T10:17:10.959933image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length12
Mean length5.6610169
Min length1

Characters and Unicode

Total characters668
Distinct characters171
Distinct categories5 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique118 ?
Unique (%)100.0%

Sample

1st row법 인 명
2nd row116개 법인
3rd row전라북도사회복지협의회
4th row참사랑복지회
5th row천주교성가복지회
ValueCountFrequency (%)
임마누엘 1
 
0.8%
평화 1
 
0.8%
김제가나안복지재단 1
 
0.8%
유한복지재단 1
 
0.8%
햇빛 1
 
0.8%
우리원 1
 
0.8%
예닮문화복지재단 1
 
0.8%
서남행복원 1
 
0.8%
서남 1
 
0.8%
상초 1
 
0.8%
Other values (119) 119
92.2%
2024-03-14T10:17:11.297845image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
63
 
9.4%
60
 
9.0%
43
 
6.4%
43
 
6.4%
39
 
5.8%
22
 
3.3%
17
 
2.5%
15
 
2.2%
11
 
1.6%
9
 
1.3%
Other values (161) 346
51.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 644
96.4%
Space Separator 17
 
2.5%
Decimal Number 3
 
0.4%
Close Punctuation 2
 
0.3%
Open Punctuation 2
 
0.3%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
63
 
9.8%
60
 
9.3%
43
 
6.7%
43
 
6.7%
39
 
6.1%
22
 
3.4%
15
 
2.3%
11
 
1.7%
9
 
1.4%
9
 
1.4%
Other values (156) 330
51.2%
Decimal Number
ValueCountFrequency (%)
1 2
66.7%
6 1
33.3%
Space Separator
ValueCountFrequency (%)
17
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 644
96.4%
Common 24
 
3.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
63
 
9.8%
60
 
9.3%
43
 
6.7%
43
 
6.7%
39
 
6.1%
22
 
3.4%
15
 
2.3%
11
 
1.7%
9
 
1.4%
9
 
1.4%
Other values (156) 330
51.2%
Common
ValueCountFrequency (%)
17
70.8%
) 2
 
8.3%
( 2
 
8.3%
1 2
 
8.3%
6 1
 
4.2%

Most occurring blocks

ValueCountFrequency (%)
Hangul 644
96.4%
ASCII 24
 
3.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
63
 
9.8%
60
 
9.3%
43
 
6.7%
43
 
6.7%
39
 
6.1%
22
 
3.4%
15
 
2.3%
11
 
1.7%
9
 
1.4%
9
 
1.4%
Other values (156) 330
51.2%
ASCII
ValueCountFrequency (%)
17
70.8%
) 2
 
8.3%
( 2
 
8.3%
1 2
 
8.3%
6 1
 
4.2%

Unnamed: 6
Text

MISSING 

Distinct115
Distinct (%)98.3%
Missing38
Missing (%)24.5%
Memory size1.3 KiB
2024-03-14T10:17:11.579204image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/