Overview

Dataset statistics

Number of variables29
Number of observations121
Missing cells2034
Missing cells (%)58.0%
Duplicate rows1
Duplicate rows (%)0.8%
Total size in memory27.5 KiB
Average record size in memory233.1 B

Variable types

Unsupported18
Text8
Categorical3

Alerts

Unnamed: 1 has constant value ""Constant
Unnamed: 18 has constant value ""Constant
Unnamed: 25 has constant value ""Constant
Unnamed: 26 has constant value ""Constant
Unnamed: 27 has constant value ""Constant
Unnamed: 28 has constant value ""Constant
Dataset has 1 (0.8%) duplicate rowsDuplicates
Unnamed: 3 is highly imbalanced (85.1%)Imbalance
사회복지법인 현황 (전라북도) has 2 (1.7%) missing valuesMissing
Unnamed: 1 has 120 (99.2%) missing valuesMissing
Unnamed: 5 has 3 (2.5%) missing valuesMissing
Unnamed: 6 has 4 (3.3%) missing valuesMissing
Unnamed: 7 has 4 (3.3%) missing valuesMissing
Unnamed: 8 has 4 (3.3%) missing valuesMissing
Unnamed: 9 has 4 (3.3%) missing valuesMissing
Unnamed: 10 has 5 (4.1%) missing valuesMissing
Unnamed: 11 has 4 (3.3%) missing valuesMissing
Unnamed: 12 has 60 (49.6%) missing valuesMissing
Unnamed: 13 has 91 (75.2%) missing valuesMissing
Unnamed: 14 has 97 (80.2%) missing valuesMissing
Unnamed: 15 has 115 (95.0%) missing valuesMissing
Unnamed: 16 has 118 (97.5%) missing valuesMissing
Unnamed: 17 has 118 (97.5%) missing valuesMissing
Unnamed: 18 has 120 (99.2%) missing valuesMissing
Unnamed: 19 has 118 (97.5%) missing valuesMissing
Unnamed: 20 has 115 (95.0%) missing valuesMissing
Unnamed: 21 has 115 (95.0%) missing valuesMissing
Unnamed: 22 has 115 (95.0%) missing valuesMissing
Unnamed: 23 has 115 (95.0%) missing valuesMissing
Unnamed: 24 has 107 (88.4%) missing valuesMissing
Unnamed: 25 has 120 (99.2%) missing valuesMissing
Unnamed: 26 has 120 (99.2%) missing valuesMissing
Unnamed: 27 has 120 (99.2%) missing valuesMissing
Unnamed: 28 has 120 (99.2%) missing valuesMissing
사회복지법인 현황 (전라북도) is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 5 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 7 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 9 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 10 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 11 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 12 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 13 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 14 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 15 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 16 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 17 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 19 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 20 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 21 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 22 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 23 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 24 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-03-14 01:17:20.359358
Analysis finished2024-03-14 01:17:20.646884
Duration0.29 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

사회복지법인 현황 (전라북도)
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing2
Missing (%)1.7%
Memory size1.1 KiB

Unnamed: 1
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing120
Missing (%)99.2%
Memory size1.1 KiB
2024-03-14T10:17:20.688513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length3
Median length3
Mean length3
Min length3

Characters and Unicode

Total characters3
Distinct characters3
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row담당자
ValueCountFrequency (%)
담당자 1
100.0%
2024-03-14T10:17:20.881352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
33.3%
1
33.3%
1
33.3%

Unnamed: 2
Categorical

Distinct16
Distinct (%)13.2%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
노인
46 
장애인
24 
아동
11 
사회복귀시설운영
사회복지
Other values (11)
25 

Length

Max length28
Median length9
Mean length3.5950413
Min length2

Unique

Unique6 ?
Unique (%)5.0%

Sample

1st row<NA>
2nd row<NA>
3rd row주요 목적사업 <노인,아동,장애인 등으로만 기재>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
노인 46
38.0%
장애인 24
19.8%
아동 11
 
9.1%
사회복귀시설운영 9
 
7.4%
사회복지 6
 
5.0%
보육/노인 5
 
4.1%
한부모 5
 
4.1%
<NA> 4
 
3.3%
정신요양시설운영 3
 
2.5%
노인/보육 2
 
1.7%
Other values (6) 6
 
5.0%

Length

2024-03-14T10:17:20.980189image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
노인 46
36.5%
장애인 24
19.0%
아동 11
 
8.7%
사회복귀시설운영 9
 
7.1%
사회복지 6
 
4.8%
보육/노인 5
 
4.0%
한부모 5
 
4.0%
na 4
 
3.2%
정신요양시설운영 3
 
2.4%
노인/보육 2
 
1.6%
Other values (11) 11
 
8.7%

Unnamed: 3
Categorical

IMBALANCE 

Distinct3
Distinct (%)2.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
전북
117 
<NA>
 
3
시도
 
1

Length

Max length4
Median length2
Mean length2.0495868
Min length2

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row시도
4th row<NA>
5th row전북

Common Values

ValueCountFrequency (%)
전북 117
96.7%
<NA> 3
 
2.5%
시도 1
 
0.8%

Length

2024-03-14T10:17:21.075212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T10:17:21.171222image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
전북 117
96.7%
na 3
 
2.5%
시도 1
 
0.8%

Unnamed: 4
Categorical

Distinct15
Distinct (%)12.4%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
전주시
28 
익산시
17 
완주군
16 
군산시
14 
남원시
10 
Other values (10)
36 

Length

Max length4
Median length3
Mean length3.0330579
Min length3

Unique

Unique2 ?
Unique (%)1.7%

Sample

1st row<NA>
2nd row<NA>
3rd row시군구
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
전주시 28
23.1%
익산시 17
14.0%
완주군 16
13.2%
군산시 14
11.6%
남원시 10
 
8.3%
정읍시 9
 
7.4%
김제시 7
 
5.8%
<NA> 4
 
3.3%
고창군 4
 
3.3%
순창군 3
 
2.5%
Other values (5) 9
 
7.4%

Length

2024-03-14T10:17:21.279391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
전주시 28
23.1%
익산시 17
14.0%
완주군 16
13.2%
군산시 14
11.6%
남원시 10
 
8.3%
정읍시 9
 
7.4%
김제시 7
 
5.8%
na 4
 
3.3%
고창군 4
 
3.3%
순창군 3
 
2.5%
Other values (5) 9
 
7.4%

Unnamed: 5
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing3
Missing (%)2.5%
Memory size1.1 KiB

Unnamed: 6
Text

MISSING 

Distinct115
Distinct (%)98.3%
Missing4
Missing (%)3.3%
Memory size1.1 KiB
2024-03-14T10:17:21.524381image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length3
Mean length3.017094
Min length2

Characters and Unicode

Total characters353
Distinct characters117
Distinct categories2 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique114 ?
Unique (%)97.4%

Sample

1st row대표자명
2nd row차종선
3rd row양기승
4th row이병호
5th row김정석
ValueCountFrequency (%)
이병호 3
 
2.5%
이인재 1
 
0.8%
안준언 1
 
0.8%
박춘아 1
 
0.8%
김영식 1
 
0.8%
온주현 1
 
0.8%
전유권 1
 
0.8%
최규순 1
 
0.8%
김상태 1
 
0.8%
임안희 1
 
0.8%
Other values (107) 107
89.9%
2024-03-14T10:17:21.923405image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
25
 
7.1%
15
 
4.2%
15
 
4.2%
11
 
3.1%
10
 
2.8%
8
 
2.3%
7
 
2.0%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (107) 243
68.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 349
98.9%
Space Separator 4
 
1.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
25
 
7.2%
15
 
4.3%
15
 
4.3%
11
 
3.2%
10
 
2.9%
8
 
2.3%
7
 
2.0%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (106) 239
68.5%
Space Separator
ValueCountFrequency (%)
4
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 349
98.9%
Common 4
 
1.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
25
 
7.2%
15
 
4.3%
15
 
4.3%
11
 
3.2%
10
 
2.9%
8
 
2.3%
7
 
2.0%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (106) 239
68.5%
Common
ValueCountFrequency (%)
4
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 349
98.9%
ASCII 4
 
1.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
25
 
7.2%
15
 
4.3%
15
 
4.3%
11
 
3.2%
10
 
2.9%
8
 
2.3%
7
 
2.0%
7
 
2.0%
6
 
1.7%
6
 
1.7%
Other values (106) 239
68.5%
ASCII
ValueCountFrequency (%)
4
100.0%

Unnamed: 7
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)3.3%
Memory size1.1 KiB

Unnamed: 8
Text

MISSING 

Distinct117
Distinct (%)100.0%
Missing4
Missing (%)3.3%
Memory size1.1 KiB
2024-03-14T10:17:22.226858image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length22
Median length19
Mean length14.965812
Min length3

Characters and Unicode

Total characters1751
Distinct characters175
Distinct categories6 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique117 ?
Unique (%)100.0%

Sample

1st row주소지
2nd row전주시 덕진구 전주천동로 483
3rd row전주시 완산구 바람쐬는길 152
4th row전주시 완산구 서노송동 560-6
5th row전주시 완산구 전주객사 2길 12-8
ValueCountFrequency (%)
전주시 28
 
6.4%
완산구 19
 
4.3%
완주군 16
 
3.7%
익산시 16
 
3.7%
군산시 14
 
3.2%
남원시 10
 
2.3%
덕진구 9
 
2.1%
정읍시 9
 
2.1%
김제시 7
 
1.6%
소양면 5
 
1.1%
Other values (265) 304
69.6%
2024-03-14T10:17:22.649866image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
320
 
18.3%
84
 
4.8%
1 67
 
3.8%
62
 
3.5%
59
 
3.4%
59
 
3.4%
2 53
 
3.0%
52
 
3.0%
- 47
 
2.7%
4 44
 
2.5%
Other values (165) 904
51.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1009
57.6%
Decimal Number 371
 
21.2%
Space Separator 320
 
18.3%
Dash Punctuation 47
 
2.7%
Open Punctuation 2
 
0.1%
Close Punctuation 2
 
0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
84
 
8.3%
62
 
6.1%
59
 
5.8%
59
 
5.8%
52
 
5.2%
43
 
4.3%
40
 
4.0%
35
 
3.5%
33
 
3.3%
33
 
3.3%
Other values (151) 509
50.4%
Decimal Number
ValueCountFrequency (%)
1 67
18.1%
2 53
14.3%
4 44
11.9%
3 39
10.5%
7 35
9.4%
6 34
9.2%
5 29
7.8%
9 29
7.8%
8 24
 
6.5%
0 17
 
4.6%
Space Separator
ValueCountFrequency (%)
320
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 47
100.0%
Open Punctuation
ValueCountFrequency (%)
( 2
100.0%
Close Punctuation
ValueCountFrequency (%)
) 2
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1009
57.6%
Common 742
42.4%

Most frequent character per script

Hangul
ValueCountFrequency (%)
84
 
8.3%
62
 
6.1%
59
 
5.8%
59
 
5.8%
52
 
5.2%
43
 
4.3%
40
 
4.0%
35
 
3.5%
33
 
3.3%
33
 
3.3%
Other values (151) 509
50.4%
Common
ValueCountFrequency (%)
320
43.1%
1 67
 
9.0%
2 53
 
7.1%
- 47
 
6.3%
4 44
 
5.9%
3 39
 
5.3%
7 35
 
4.7%
6 34
 
4.6%
5 29
 
3.9%
9 29
 
3.9%
Other values (4) 45
 
6.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1009
57.6%
ASCII 742
42.4%

Most frequent character per block

ASCII
ValueCountFrequency (%)
320
43.1%
1 67
 
9.0%
2 53
 
7.1%
- 47
 
6.3%
4 44
 
5.9%
3 39
 
5.3%
7 35
 
4.7%
6 34
 
4.6%
5 29
 
3.9%
9 29
 
3.9%
Other values (4) 45
 
6.1%
Hangul
ValueCountFrequency (%)
84
 
8.3%
62
 
6.1%
59
 
5.8%
59
 
5.8%
52
 
5.2%
43
 
4.3%
40
 
4.0%
35
 
3.5%
33
 
3.3%
33
 
3.3%
Other values (151) 509
50.4%

Unnamed: 9
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)3.3%
Memory size1.1 KiB

Unnamed: 10
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing5
Missing (%)4.1%
Memory size1.1 KiB

Unnamed: 11
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing4
Missing (%)3.3%
Memory size1.1 KiB

Unnamed: 12
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing60
Missing (%)49.6%
Memory size1.1 KiB

Unnamed: 13
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing91
Missing (%)75.2%
Memory size1.1 KiB

Unnamed: 14
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing97
Missing (%)80.2%
Memory size1.1 KiB

Unnamed: 15
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing115
Missing (%)95.0%
Memory size1.1 KiB

Unnamed: 16
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing118
Missing (%)97.5%
Memory size1.1 KiB

Unnamed: 17
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing118
Missing (%)97.5%
Memory size1.1 KiB

Unnamed: 18
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing120
Missing (%)99.2%
Memory size1.1 KiB
2024-03-14T10:17:22.764715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row결핵한센
ValueCountFrequency (%)
결핵한센 1
100.0%
2024-03-14T10:17:22.997684image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 4
100.0%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per script

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 4
100.0%

Most frequent character per block

Hangul
ValueCountFrequency (%)
1
25.0%
1
25.0%
1
25.0%
1
25.0%

Unnamed: 19
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing118
Missing (%)97.5%
Memory size1.1 KiB

Unnamed: 20
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing115
Missing (%)95.0%
Memory size1.1 KiB

Unnamed: 21
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing115
Missing (%)95.0%
Memory size1.1 KiB

Unnamed: 22
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing115
Missing (%)95.0%
Memory size1.1 KiB

Unnamed: 23
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing115
Missing (%)95.0%
Memory size1.1 KiB

Unnamed: 24
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing107
Missing (%)88.4%
Memory size1.1 KiB

Unnamed: 25
Text

CONSTANT  MISSING 

Distinct1
Distinct (%)100.0%
Missing120
Missing (%)99.2%
Memory size1.1 KiB
2024-03-14T10:17:23.091579image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length4
Median length4
Mean length4
Min length4

Characters and Unicode

Total characters4
Distinct characters4
Distinct categories1 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1 ?
Unique (%)100.0%

Sample

1st row보육시설
ValueCountFrequency (%)
보육시설 1
100.0%
2024-03-14T10:17:23.299513image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/