Overview

Dataset statistics

Number of variables15
Number of observations306
Missing cells340
Missing cells (%)7.4%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory37.5 KiB
Average record size in memory125.4 B

Variable types

Numeric5
Categorical4
Text5
DateTime1

Dataset

Description대전광역시 서구 폐의약품 수거함 비치 현황에 대한 데이터입니다.- 서구청, 23개동행정복지센터, 서구보건소(지소포함)
Author대전광역시 서구
URLhttps://www.data.go.kr/data/15077806/fileData.do

Alerts

데이터기준일자 has constant value ""Constant
비고 is highly overall correlated with 순번 and 7 other fieldsHigh correlation
행정동명 is highly overall correlated with 행정동코드 and 5 other fieldsHigh correlation
법정동명 is highly overall correlated with 행정동코드 and 5 other fieldsHigh correlation
수거장소구분명 is highly overall correlated with 순번 and 1 other fieldsHigh correlation
순번 is highly overall correlated with 수거장소구분명 and 1 other fieldsHigh correlation
행정동코드 is highly overall correlated with 법정동코드 and 3 other fieldsHigh correlation
법정동코드 is highly overall correlated with 행정동코드 and 3 other fieldsHigh correlation
위도 is highly overall correlated with 행정동명 and 2 other fieldsHigh correlation
경도 is highly overall correlated with 행정동명 and 2 other fieldsHigh correlation
수거장소구분명 is highly imbalanced (53.1%)Imbalance
비고 is highly imbalanced (55.9%)Imbalance
개설자명 has 45 (14.7%) missing valuesMissing
전화번호 has 289 (94.4%) missing valuesMissing
순번 has unique valuesUnique

Reproduction

Analysis started2024-03-15 02:21:14.367291
Analysis finished2024-03-15 02:21:23.994664
Duration9.63 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

순번
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct306
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean153.5
Minimum1
Maximum306
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size2.8 KiB
2024-03-15T11:21:24.213449image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile16.25
Q177.25
median153.5
Q3229.75
95-th percentile290.75
Maximum306
Range305
Interquartile range (IQR)152.5

Descriptive statistics

Standard deviation88.478811
Coefficient of variation (CV)0.57640919
Kurtosis-1.2
Mean153.5
Median Absolute Deviation (MAD)76.5
Skewness0
Sum46971
Variance7828.5
MonotonicityStrictly increasing
2024-03-15T11:21:24.675817image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
0.3%
203 1
 
0.3%
210 1
 
0.3%
209 1
 
0.3%
208 1
 
0.3%
207 1
 
0.3%
206 1
 
0.3%
205 1
 
0.3%
204 1
 
0.3%
202 1
 
0.3%
Other values (296) 296
96.7%
ValueCountFrequency (%)
1 1
0.3%
2 1
0.3%
3 1
0.3%
4 1
0.3%
5 1
0.3%
6 1
0.3%
7 1
0.3%
8 1
0.3%
9 1
0.3%
10 1
0.3%
ValueCountFrequency (%)
306 1
0.3%
305 1
0.3%
304 1
0.3%
303 1
0.3%
302 1
0.3%
301 1
0.3%
300 1
0.3%
299 1
0.3%
298 1
0.3%
297 1
0.3%

수거장소구분명
Categorical

HIGH CORRELATION  IMBALANCE 

Distinct3
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
약국
261 
행정복지센터,보건소
28 
복지관,성당,체육시설,기타
 
17

Length

Max length14
Median length2
Mean length3.3986928
Min length2

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row약국
2nd row약국
3rd row약국
4th row약국
5th row약국

Common Values

ValueCountFrequency (%)
약국 261
85.3%
행정복지센터,보건소 28
 
9.2%
복지관,성당,체육시설,기타 17
 
5.6%

Length

2024-03-15T11:21:25.123716image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-15T11:21:25.465336image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
약국 261
85.3%
행정복지센터,보건소 28
 
9.2%
복지관,성당,체육시설,기타 17
 
5.6%
Distinct297
Distinct (%)97.1%
Missing0
Missing (%)0.0%
Memory size2.5 KiB
2024-03-15T11:21:26.368645image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length10
Median length9
Mean length5.624183
Min length3

Characters and Unicode

Total characters1721
Distinct characters232
Distinct categories3 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique289 ?
Unique (%)94.4%

Sample

1st row건강약국
2nd row가장약국
3rd row가장태평양약국
4th row갈마드림약국
5th row갈마약국
ValueCountFrequency (%)
성당 7
 
2.2%
탄방우리약국 3
 
1.0%
누리약국 2
 
0.6%
더좋은약국 2
 
0.6%
메디팜우리약국 2
 
0.6%
연합약국 2
 
0.6%
새봄약국 2
 
0.6%
선사프라임약국 2
 
0.6%
탄방대한약국 2
 
0.6%
이화약국 1
 
0.3%
Other values (289) 289
92.0%
2024-03-15T11:21:27.759624image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
263
 
15.3%
262
 
15.2%
44
 
2.6%
40
 
2.3%
37
 
2.1%
36
 
2.1%
36
 
2.1%
28
 
1.6%
25
 
1.5%
24
 
1.4%
Other values (222) 926
53.8%

Most occurring categories

ValueCountFrequency (%)
Other Letter 1693
98.4%
Decimal Number 20
 
1.2%
Space Separator 8
 
0.5%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
263
 
15.5%
262
 
15.5%
44
 
2.6%
40
 
2.4%
37
 
2.2%
36
 
2.1%
36
 
2.1%
28
 
1.7%
25
 
1.5%
24
 
1.4%
Other values (216) 898
53.0%
Decimal Number
ValueCountFrequency (%)
2 6
30.0%
1 6
30.0%
3 4
20.0%
5 2
 
10.0%
6 2
 
10.0%
Space Separator
ValueCountFrequency (%)
8
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 1693
98.4%
Common 28
 
1.6%

Most frequent character per script

Hangul
ValueCountFrequency (%)
263
 
15.5%
262
 
15.5%
44
 
2.6%
40
 
2.4%
37
 
2.2%
36
 
2.1%
36
 
2.1%
28
 
1.7%
25
 
1.5%
24
 
1.4%
Other values (216) 898
53.0%
Common
ValueCountFrequency (%)
8
28.6%
2 6
21.4%
1 6
21.4%
3 4
14.3%
5 2
 
7.1%
6 2
 
7.1%

Most occurring blocks

ValueCountFrequency (%)
Hangul 1693
98.4%
ASCII 28
 
1.6%

Most frequent character per block

Hangul
ValueCountFrequency (%)
263
 
15.5%
262
 
15.5%
44
 
2.6%
40
 
2.4%
37
 
2.2%
36
 
2.1%
36
 
2.1%
28
 
1.7%
25
 
1.5%
24
 
1.4%
Other values (216) 898
53.0%
ASCII
ValueCountFrequency (%)
8
28.6%
2 6
21.4%
1 6
21.4%
3 4
14.3%
5 2
 
7.1%
6 2
 
7.1%
Distinct284
Distinct (%)93.1%
Missing1
Missing (%)0.3%
Memory size2.5 KiB
2024-03-15T11:21:29.289984image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length29
Median length28
Mean length17.508197
Min length14

Characters and Unicode

Total characters5340
Distinct characters109
Distinct categories4 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique265 ?
Unique (%)86.9%

Sample

1st row대전광역시 서구 가수원동 765-6
2nd row대전광역시 서구 가장동 45-9 가장크리닉
3rd row대전광역시 서구 가장동 32-23 온누리크리닉
4th row대전광역시 서구 갈마동 393-13
5th row대전광역시 서구 갈마동 261-14
ValueCountFrequency (%)
대전광역시 305
24.6%
서구 305
24.6%
둔산동 80
 
6.4%
탄방동 38
 
3.1%
관저동 38
 
3.1%
도마동 27
 
2.2%
월평동 26
 
2.1%
갈마동 16
 
1.3%
괴정동 13
 
1.0%
정림동 10
 
0.8%
Other values (307) 384
30.9%
2024-03-15T11:21:31.197874image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
939
17.6%
306
 
5.7%
306
 
5.7%
306
 
5.7%
305
 
5.7%
305
 
5.7%
305
 
5.7%
305
 
5.7%
305
 
5.7%
1 280
 
5.2%
Other values (99) 1678
31.4%

Most occurring categories

ValueCountFrequency (%)
Other Letter 3145
58.9%
Decimal Number 1144
 
21.4%
Space Separator 939
 
17.6%
Dash Punctuation 112
 
2.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
306
9.7%
306
9.7%
306
9.7%
305
9.7%
305
9.7%
305
9.7%
305
9.7%
305
9.7%
83
 
2.6%
81
 
2.6%
Other values (87) 538
17.1%
Decimal Number
ValueCountFrequency (%)
1 280
24.5%
2 116
10.1%
3 107
 
9.4%
4 104
 
9.1%
9 101
 
8.8%
8 97
 
8.5%
5 97
 
8.5%
0 94
 
8.2%
7 75
 
6.6%
6 73
 
6.4%
Space Separator
ValueCountFrequency (%)
939
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 112
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 3145
58.9%
Common 2195
41.1%

Most frequent character per script

Hangul
ValueCountFrequency (%)
306
9.7%
306
9.7%
306
9.7%
305
9.7%
305
9.7%
305
9.7%
305
9.7%
305
9.7%
83
 
2.6%
81
 
2.6%
Other values (87) 538
17.1%
Common
ValueCountFrequency (%)
939
42.8%
1 280
 
12.8%
2 116
 
5.3%
- 112
 
5.1%
3 107
 
4.9%
4 104
 
4.7%
9 101
 
4.6%
8 97
 
4.4%
5 97
 
4.4%
0 94
 
4.3%
Other values (2) 148
 
6.7%

Most occurring blocks

ValueCountFrequency (%)
Hangul 3145
58.9%
ASCII 2195
41.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
939
42.8%
1 280
 
12.8%
2 116
 
5.3%
- 112
 
5.1%
3 107
 
4.9%
4 104
 
4.7%
9 101
 
4.6%
8 97
 
4.4%
5 97
 
4.4%
0 94
 
4.3%
Other values (2) 148
 
6.7%
Hangul
ValueCountFrequency (%)
306
9.7%
306
9.7%
306
9.7%
305
9.7%
305
9.7%
305
9.7%
305
9.7%
305
9.7%
83
 
2.6%
81
 
2.6%
Other values (87) 538
17.1%
Distinct284
Distinct (%)93.1%
Missing1
Missing (%)0.3%
Memory size2.5 KiB
2024-03-15T11:21:32.325634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length30
Median length27
Mean length17.898361
Min length14

Characters and Unicode

Total characters5459
Distinct characters94
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique265 ?
Unique (%)86.9%

Sample

1st row대전광역시 서구 계백로 1166-12(가수원동)
2nd row