Overview

Dataset statistics

Number of variables33
Number of observations130
Missing cells1590
Missing cells (%)37.1%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory36.6 KiB
Average record size in memory288.0 B

Variable types

Numeric8
Categorical10
Text4
Unsupported10
DateTime1

Dataset

Description2021-12-01
Author지방행정인허가공개데이터
URLhttps://bigdata.busan.go.kr/data/bigDataDetailView.do?menuCode=M00000000007&hdfs_file_sn=20230901050101123193

Alerts

개방서비스명 has constant value ""Constant
개방서비스id has constant value ""Constant
개방자치단체코드 has constant value ""Constant
휴업종료일자 is highly imbalanced (93.5%)Imbalance
재개업일자 is highly imbalanced (93.5%)Imbalance
인허가취소일자 has 130 (100.0%) missing valuesMissing
폐업일자 has 57 (43.8%) missing valuesMissing
휴업시작일자 has 130 (100.0%) missing valuesMissing
소재지전화 has 10 (7.7%) missing valuesMissing
소재지면적 has 130 (100.0%) missing valuesMissing
소재지우편번호 has 130 (100.0%) missing valuesMissing
소재지전체주소 has 18 (13.8%) missing valuesMissing
도로명전체주소 has 13 (10.0%) missing valuesMissing
도로명우편번호 has 94 (72.3%) missing valuesMissing
업태구분명 has 130 (100.0%) missing valuesMissing
좌표정보(x) has 49 (37.7%) missing valuesMissing
좌표정보(y) has 49 (37.7%) missing valuesMissing
보유자격증명 has 130 (100.0%) missing valuesMissing
국비지원여부 has 130 (100.0%) missing valuesMissing
담당직원내용 has 130 (100.0%) missing valuesMissing
삭제일자 has 130 (100.0%) missing valuesMissing
Unnamed: 32 has 130 (100.0%) missing valuesMissing
번호 has unique valuesUnique
관리번호 has unique valuesUnique
최종수정시점 has unique valuesUnique
인허가취소일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
휴업시작일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지면적 is an unsupported type, check if it needs cleaning or further analysisUnsupported
소재지우편번호 is an unsupported type, check if it needs cleaning or further analysisUnsupported
업태구분명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
보유자격증명 is an unsupported type, check if it needs cleaning or further analysisUnsupported
국비지원여부 is an unsupported type, check if it needs cleaning or further analysisUnsupported
담당직원내용 is an unsupported type, check if it needs cleaning or further analysisUnsupported
삭제일자 is an unsupported type, check if it needs cleaning or further analysisUnsupported
Unnamed: 32 is an unsupported type, check if it needs cleaning or further analysisUnsupported

Reproduction

Analysis started2024-04-20 14:32:23.533631
Analysis finished2024-04-20 14:32:24.342318
Duration0.81 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

번호
Real number (ℝ)

UNIQUE 

Distinct130
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean65.5
Minimum1
Maximum130
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-04-20T23:32:24.539143image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile7.45
Q133.25
median65.5
Q397.75
95-th percentile123.55
Maximum130
Range129
Interquartile range (IQR)64.5

Descriptive statistics

Standard deviation37.671829
Coefficient of variation (CV)0.57514242
Kurtosis-1.2
Mean65.5
Median Absolute Deviation (MAD)32.5
Skewness0
Sum8515
Variance1419.1667
MonotonicityStrictly increasing
2024-04-20T23:32:25.108391image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.8%
99 1
 
0.8%
97 1
 
0.8%
96 1
 
0.8%
95 1
 
0.8%
94 1
 
0.8%
93 1
 
0.8%
92 1
 
0.8%
91 1
 
0.8%
90 1
 
0.8%
Other values (120) 120
92.3%
ValueCountFrequency (%)
1 1
0.8%
2 1
0.8%
3 1
0.8%
4 1
0.8%
5 1
0.8%
6 1
0.8%
7 1
0.8%
8 1
0.8%
9 1
0.8%
10 1
0.8%
ValueCountFrequency (%)
130 1
0.8%
129 1
0.8%
128 1
0.8%
127 1
0.8%
126 1
0.8%
125 1
0.8%
124 1
0.8%
123 1
0.8%
122 1
0.8%
121 1
0.8%

개방서비스명
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
요양보호사교육기관
130 

Length

Max length9
Median length9
Mean length9
Min length9

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row요양보호사교육기관
2nd row요양보호사교육기관
3rd row요양보호사교육기관
4th row요양보호사교육기관
5th row요양보호사교육기관

Common Values

ValueCountFrequency (%)
요양보호사교육기관 130
100.0%

Length

2024-04-20T23:32:25.543879image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:25.848935image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
요양보호사교육기관 130
100.0%

개방서비스id
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
11_49_01_P
130 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row11_49_01_P
2nd row11_49_01_P
3rd row11_49_01_P
4th row11_49_01_P
5th row11_49_01_P

Common Values

ValueCountFrequency (%)
11_49_01_P 130
100.0%

Length

2024-04-20T23:32:26.176945image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:26.422508image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
11_49_01_p 130
100.0%

개방자치단체코드
Categorical

CONSTANT 

Distinct1
Distinct (%)0.8%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
6260000
130 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row6260000
2nd row6260000
3rd row6260000
4th row6260000
5th row6260000

Common Values

ValueCountFrequency (%)
6260000 130
100.0%

Length

2024-04-20T23:32:26.586816image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:26.956734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
6260000 130
100.0%

관리번호
Text

UNIQUE 

Distinct130
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
2024-04-20T23:32:27.671170image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length27
Median length27
Mean length27
Min length27

Characters and Unicode

Total characters3510
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique130 ?
Unique (%)100.0%

Sample

1st row201062600000040045-20100706
2nd row201062600000040071-20101011
3rd row201562600000040004-20150713
4th row201562600000040006-20151214
5th row201062600000040007-20100604
ValueCountFrequency (%)
201062600000040045-20100706 1
 
0.8%
201062600000040028-20100622 1
 
0.8%
201062600000040031-20100622 1
 
0.8%
201062600000040032-20100622 1
 
0.8%
201062600000040033-20100622 1
 
0.8%
201062600000040020-20100611 1
 
0.8%
201062600000040018-20100611 1
 
0.8%
201062600000040088-20101019 1
 
0.8%
201062600000040087-20101019 1
 
0.8%
201062600000040084-20101019 1
 
0.8%
Other values (120) 120
92.3%
2024-04-20T23:32:28.821913image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 1717
48.9%
2 484
 
13.8%
1 413
 
11.8%
6 356
 
10.1%
4 184
 
5.2%
- 130
 
3.7%
5 62
 
1.8%
7 52
 
1.5%
9 44
 
1.3%
8 37
 
1.1%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 3380
96.3%
Dash Punctuation 130
 
3.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 1717
50.8%
2 484
 
14.3%
1 413
 
12.2%
6 356
 
10.5%
4 184
 
5.4%
5 62
 
1.8%
7 52
 
1.5%
9 44
 
1.3%
8 37
 
1.1%
3 31
 
0.9%
Dash Punctuation
ValueCountFrequency (%)
- 130
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 3510
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 1717
48.9%
2 484
 
13.8%
1 413
 
11.8%
6 356
 
10.1%
4 184
 
5.2%
- 130
 
3.7%
5 62
 
1.8%
7 52
 
1.5%
9 44
 
1.3%
8 37
 
1.1%

Most occurring blocks

ValueCountFrequency (%)
ASCII 3510
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 1717
48.9%
2 484
 
13.8%
1 413
 
11.8%
6 356
 
10.1%
4 184
 
5.2%
- 130
 
3.7%
5 62
 
1.8%
7 52
 
1.5%
9 44
 
1.3%
8 37
 
1.1%

인허가일자
Real number (ℝ)

Distinct44
Distinct (%)33.8%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean20113542
Minimum20100603
Maximum20201126
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-04-20T23:32:29.123253image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20100603
5-th percentile20100604
Q120100622
median20100917
Q320101025
95-th percentile20161076
Maximum20201126
Range100523
Interquartile range (IQR)403

Descriptive statistics

Standard deviation24219.622
Coefficient of variation (CV)0.0012041451
Kurtosis1.0792062
Mean20113542
Median Absolute Deviation (MAD)295
Skewness1.5773915
Sum2.6147605 × 109
Variance5.8659011 × 108
MonotonicityNot monotonic
2024-04-20T23:32:29.495335image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=44)
ValueCountFrequency (%)
20100622 13
 
10.0%
20101011 12
 
9.2%
20101019 12
 
9.2%
20100611 11
 
8.5%
20100605 8
 
6.2%
20100706 7
 
5.4%
20100902 7
 
5.4%
20100604 7
 
5.4%
20101025 7
 
5.4%
20100811 4
 
3.1%
Other values (34) 42
32.3%
ValueCountFrequency (%)
20100603 1
 
0.8%
20100604 7
5.4%
20100605 8
6.2%
20100611 11
8.5%
20100622 13
10.0%
20100706 7
5.4%
20100715 3
 
2.3%
20100803 3
 
2.3%
20100811 4
 
3.1%
20100902 7
5.4%
ValueCountFrequency (%)
20201126 1
0.8%
20170724 2
1.5%
20170411 1
0.8%
20170222 1
0.8%
20170102 1
0.8%
20161116 1
0.8%
20161028 1
0.8%
20160824 1
0.8%
20160725 1
0.8%
20160622 1
0.8%

인허가취소일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing130
Missing (%)100.0%
Memory size1.3 KiB
Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
3
75 
1
55 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row1
3rd row1
4th row1
5th row1

Common Values

ValueCountFrequency (%)
3 75
57.7%
1 55
42.3%

Length

2024-04-20T23:32:29.990965image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:30.249471image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
3 75
57.7%
1 55
42.3%

영업상태명
Categorical

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
폐업
75 
영업/정상
55 

Length

Max length5
Median length2
Mean length3.2692308
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row영업/정상
2nd row영업/정상
3rd row영업/정상
4th row영업/정상
5th row영업/정상

Common Values

ValueCountFrequency (%)
폐업 75
57.7%
영업/정상 55
42.3%

Length

2024-04-20T23:32:30.655802image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:31.010168image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐업 75
57.7%
영업/정상 55
42.3%
Distinct3
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
1
75 
0
54 
3
 
1

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
1 75
57.7%
0 54
41.5%
3 1
 
0.8%

Length

2024-04-20T23:32:31.417200image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:31.753734image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 75
57.7%
0 54
41.5%
3 1
 
0.8%
Distinct3
Distinct (%)2.3%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
폐지
75 
운영
54 
재개
 
1

Length

Max length2
Median length2
Mean length2
Min length2

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row운영
2nd row운영
3rd row운영
4th row운영
5th row운영

Common Values

ValueCountFrequency (%)
폐지 75
57.7%
운영 54
41.5%
재개 1
 
0.8%

Length

2024-04-20T23:32:32.170442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:32.507953image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
폐지 75
57.7%
운영 54
41.5%
재개 1
 
0.8%

폐업일자
Real number (ℝ)

MISSING 

Distinct61
Distinct (%)83.6%
Missing57
Missing (%)43.8%
Infinite0
Infinite (%)0.0%
Mean20136302
Minimum20100803
Maximum20201126
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-04-20T23:32:32.891515image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum20100803
5-th percentile20101222
Q120111006
median20121023
Q320160630
95-th percentile20180207
Maximum20201126
Range100323
Interquartile range (IQR)49624

Descriptive statistics

Standard deviation26454.475
Coefficient of variation (CV)0.0013137703
Kurtosis-1.0983662
Mean20136302
Median Absolute Deviation (MAD)19392
Skewness0.43263637
Sum1.46995 × 109
Variance6.9983926 × 108
MonotonicityNot monotonic
2024-04-20T23:32:33.395868image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
20150731 7
 
5.4%
20140415 2
 
1.5%
20110119 2
 
1.5%
20170724 2
 
1.5%
20171012 2
 
1.5%
20161230 2
 
1.5%
20101222 2
 
1.5%
20150930 1
 
0.8%
20120705 1
 
0.8%
20151210 1
 
0.8%
Other values (51) 51
39.2%
(Missing) 57
43.8%
ValueCountFrequency (%)
20100803 1
0.8%
20101013 1
0.8%
20101119 1
0.8%
20101222 2
1.5%
20110119 2
1.5%
20110210 1
0.8%
20110214 1
0.8%
20110331 1
0.8%
20110412 1
0.8%
20110518 1
0.8%
ValueCountFrequency (%)
20201126 1
0.8%
20181129 1
0.8%
20180411 1
0.8%
20180209 1
0.8%
20180206 1
0.8%
20180110 1
0.8%
20171229 1
0.8%
20171112 1
0.8%
20171012 2
1.5%
20170918 1
0.8%

휴업시작일자
Unsupported

MISSING  REJECTED  UNSUPPORTED 

Missing130
Missing (%)100.0%
Memory size1.3 KiB

휴업종료일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
129 
20171231
 
1

Length

Max length8
Median length4
Mean length4.0307692
Min length4

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 129
99.2%
20171231 1
 
0.8%

Length

2024-04-20T23:32:33.901367image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:34.242956image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 129
99.2%
20171231 1
 
0.8%

재개업일자
Categorical

IMBALANCE 

Distinct2
Distinct (%)1.5%
Missing0
Missing (%)0.0%
Memory size1.1 KiB
<NA>
129 
20160513
 
1

Length

Max length8
Median length4
Mean length4.0307692
Min length4

Unique

Unique1 ?
Unique (%)0.8%

Sample

1st row<NA>
2nd row<NA>
3rd row<NA>
4th row<NA>
5th row<NA>

Common Values

ValueCountFrequency (%)
<NA> 129
99.2%
20160513 1
 
0.8%

Length

2024-04-20T23:32:34.507318image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-20T23:32:34.888431image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
na 129
99.2%
20160513 1
 
0.8%

소재지전화
Real number (ℝ)

MISSING 

Distinct115
Distinct (%)95.8%
Missing10
Missing (%)7.7%
Infinite0
Infinite (%)0.0%
Mean7.188821 × 108
Minimum5.120034 × 108
Maximum7.0872506 × 109
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.3 KiB
2024-04-20T23:32:35.184570image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/