Overview

Dataset statistics

Number of variables6
Number of observations2451
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory117.4 KiB
Average record size in memory49.1 B

Variable types

Text2
Categorical3
Numeric1

Dataset

Description전라북도 정읍시 소재한 가축사육업 현황중 (농장명, 축종, 사육수, 소재지, 담당부서)등의 정보를 제공합니다.
Author전라북도
URLhttps://www.bigdatahub.go.kr/index.jeonbuk?startPage=3&menuCd=DOM_000000103007001000&pListTypeStr=&pId=15034191

Alerts

담당부서 has constant value ""Constant
데이터기준일자 has constant value ""Constant
축종 is highly imbalanced (63.5%)Imbalance
사육수 has 50 (2.0%) zerosZeros

Reproduction

Analysis started2024-03-14 00:48:01.665085
Analysis finished2024-03-14 00:48:02.201912
Duration0.54 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

Distinct1958
Distinct (%)79.9%
Missing0
Missing (%)0.0%
Memory size19.3 KiB
2024-03-14T09:48:02.415907image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length17
Median length4
Mean length4.3986128
Min length2

Characters and Unicode

Total characters10781
Distinct characters409
Distinct categories10 ?
Distinct scripts3 ?
Distinct blocks3 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique1637 ?
Unique (%)66.8%

Sample

1st row구면농장
2nd row삼영농장
3rd row전원농장
4th row만수양계장
5th row방장목장
ValueCountFrequency (%)
농장 17
 
0.7%
우리농장 10
 
0.4%
대성농장 8
 
0.3%
신성농장 7
 
0.3%
희망농장 7
 
0.3%
형제농장 7
 
0.3%
영철농장 6
 
0.2%
하늘농장 6
 
0.2%
영원농장 6
 
0.2%
동진농장 6
 
0.2%
Other values (1956) 2428
96.8%
2024-03-14T09:48:02.776420image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
2341
21.7%
2172
20.1%
180
 
1.7%
173
 
1.6%
164
 
1.5%
133
 
1.2%
124
 
1.2%
122
 
1.1%
116
 
1.1%
115
 
1.1%
Other values (399) 5141
47.7%

Most occurring categories

ValueCountFrequency (%)
Other Letter 10441
96.8%
Decimal Number 141
 
1.3%
Space Separator 57
 
0.5%
Open Punctuation 53
 
0.5%
Close Punctuation 53
 
0.5%
Uppercase Letter 29
 
0.3%
Lowercase Letter 3
 
< 0.1%
Other Punctuation 2
 
< 0.1%
Dash Punctuation 1
 
< 0.1%
Letter Number 1
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
2341
22.4%
2172
20.8%
180
 
1.7%
173
 
1.7%
164
 
1.6%
133
 
1.3%
124
 
1.2%
122
 
1.2%
116
 
1.1%
115
 
1.1%
Other values (376) 4801
46.0%
Uppercase Letter
ValueCountFrequency (%)
O 4
13.8%
A 4
13.8%
M 4
13.8%
C 3
10.3%
K 3
10.3%
E 2
6.9%
R 2
6.9%
D 2
6.9%
B 2
6.9%
G 2
6.9%
Decimal Number
ValueCountFrequency (%)
2 100
70.9%
1 33
 
23.4%
3 8
 
5.7%
Lowercase Letter
ValueCountFrequency (%)
k 1
33.3%
c 1
33.3%
u 1
33.3%
Space Separator
ValueCountFrequency (%)
57
100.0%
Open Punctuation
ValueCountFrequency (%)
( 53
100.0%
Close Punctuation
ValueCountFrequency (%)
) 53
100.0%
Other Punctuation
ValueCountFrequency (%)
. 2
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 1
100.0%
Letter Number
ValueCountFrequency (%)
1
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 10441
96.8%
Common 307
 
2.8%
Latin 33
 
0.3%

Most frequent character per script

Hangul
ValueCountFrequency (%)
2341
22.4%
2172
20.8%
180
 
1.7%
173
 
1.7%
164
 
1.6%
133
 
1.3%
124
 
1.2%
122
 
1.2%
116
 
1.1%
115
 
1.1%
Other values (376) 4801
46.0%
Latin
ValueCountFrequency (%)
O 4
12.1%
A 4
12.1%
M 4
12.1%
C 3
9.1%
K 3
9.1%
E 2
 
6.1%
R 2
 
6.1%
D 2
 
6.1%
B 2
 
6.1%
G 2
 
6.1%
Other values (5) 5
15.2%
Common
ValueCountFrequency (%)
2 100
32.6%
57
18.6%
( 53
17.3%
) 53
17.3%
1 33
 
10.7%
3 8
 
2.6%
. 2
 
0.7%
- 1
 
0.3%

Most occurring blocks

ValueCountFrequency (%)
Hangul 10441
96.8%
ASCII 339
 
3.1%
Number Forms 1
 
< 0.1%

Most frequent character per block

Hangul
ValueCountFrequency (%)
2341
22.4%
2172
20.8%
180
 
1.7%
173
 
1.7%
164
 
1.6%
133
 
1.3%
124
 
1.2%
122
 
1.2%
116
 
1.1%
115
 
1.1%
Other values (376) 4801
46.0%
ASCII
ValueCountFrequency (%)
2 100
29.5%
57
16.8%
( 53
15.6%
) 53
15.6%
1 33
 
9.7%
3 8
 
2.4%
O 4
 
1.2%
A 4
 
1.2%
M 4
 
1.2%
C 3
 
0.9%
Other values (12) 20
 
5.9%
Number Forms
ValueCountFrequency (%)
1
100.0%

축종
Categorical

IMBALANCE 

Distinct12
Distinct (%)0.5%
Missing0
Missing (%)0.0%
Memory size19.3 KiB
한우
1916 
육계
 
168
돼지
 
150
젖소
 
85
오리
 
55
Other values (7)
 
77

Length

Max length6
Median length2
Mean length2.0150959
Min length2

Unique

Unique2 ?
Unique (%)0.1%

Sample

1st row돼지
2nd row종계/산란계
3rd row육계
4th row육계
5th row젖소

Common Values

ValueCountFrequency (%)
한우 1916
78.2%
육계 168
 
6.9%
돼지 150
 
6.1%
젖소 85
 
3.5%
오리 55
 
2.2%
염소 30
 
1.2%
육우 16
 
0.7%
산양 16
 
0.7%
종계/산란계 9
 
0.4%
타조 4
 
0.2%
Other values (2) 2
 
0.1%

Length

2024-03-14T09:48:02.889768image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category
ValueCountFrequency (%)
한우 1916
78.2%
육계 168
 
6.9%
돼지 150
 
6.1%
젖소 85
 
3.5%
오리 55
 
2.2%
염소 30
 
1.2%
육우 16
 
0.7%
산양 16
 
0.7%
종계/산란계 9
 
0.4%
타조 4
 
0.2%
Other values (2) 2
 
0.1%

사육수
Real number (ℝ)

ZEROS 

Distinct334
Distinct (%)13.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean4431.9372
Minimum0
Maximum290000
Zeros50
Zeros (%)2.0%
Negative0
Negative (%)0.0%
Memory size21.7 KiB
2024-03-14T09:48:02.993256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum0
5-th percentile3
Q111
median36
Q3118
95-th percentile35000
Maximum290000
Range290000
Interquartile range (IQR)107

Descriptive statistics

Standard deviation17585.561
Coefficient of variation (CV)3.9679175
Kurtosis71.456843
Mean4431.9372
Median Absolute Deviation (MAD)29
Skewness6.8694164
Sum10862678
Variance3.0925196 × 108
MonotonicityNot monotonic
2024-03-14T09:48:03.101442image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
10 105
 
4.3%
20 81
 
3.3%
5 80
 
3.3%
30 76
 
3.1%
50 66
 
2.7%
4 61
 
2.5%
15 53
 
2.2%
7 53
 
2.2%
6 51
 
2.1%
3 51
 
2.1%
Other values (324) 1774
72.4%
ValueCountFrequency (%)
0 50
2.0%
1 9
 
0.4%
2 43
1.8%
3 51
2.1%
4 61
2.5%
5 80
3.3%
6 51
2.1%
7 53
2.2%
8 48
2.0%
9 45
1.8%
ValueCountFrequency (%)
290000 1
 
< 0.1%
270000 1
 
< 0.1%
200000 2
0.1%
145000 1
 
< 0.1%
130000 1
 
< 0.1%
110000 4
0.2%
105000 1
 
< 0.1%
100000 2
0.1%
95000 2
0.1%
92250 1
 
< 0.1%
Distinct2389
Distinct (%)97.5%
Missing0
Missing (%)0.0%
Memory size19.3 KiB
2024-03-14T09:48:03.393950image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length83
Median length71
Mean length27.575683
Min length4

Characters and Unicode

Total characters67588
Distinct characters168
Distinct categories7 ?
Distinct scripts2 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique2332 ?
Unique (%)95.1%

Sample

1st row전라북도 정읍시 입암면 신면리 791번지 4호
2nd row전라북도 정읍시 고부면 장문리 195번지 3호
3rd row전라북도 정읍시 입암면 연월리 115번지 2호
4th row전라북도 정읍시 고부면 만수리 60번지 3호
5th row전라북도 정읍시 입암면 연월리 507번지 1호 외 2필지,507-2
ValueCountFrequency (%)
전라북도 2448
 
16.2%
정읍시 2448
 
16.2%
642
 
4.3%
1호 504
 
3.3%
1필지 308
 
2.0%
산외면 221
 
1.5%
덕천면 216
 
1.4%
태인면 212
 
1.4%
정우면 205
 
1.4%
이평면 189
 
1.3%
Other values (1665) 7674
50.9%
2024-03-14T09:48:03.924362image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
16702
24.7%
3140
 
4.6%
2796
 
4.1%
2619
 
3.9%
2531
 
3.7%
2528
 
3.7%
1 2476
 
3.7%
2472
 
3.7%
2462
 
3.6%
2448
 
3.6%
Other values (158) 27414
40.6%

Most occurring categories

ValueCountFrequency (%)
Other Letter 39740
58.8%
Space Separator 16702
24.7%
Decimal Number 10683
 
15.8%
Dash Punctuation 221
 
0.3%
Other Punctuation 215
 
0.3%
Open Punctuation 14
 
< 0.1%
Close Punctuation 13
 
< 0.1%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
3140
 
7.9%
2796
 
7.0%
2619
 
6.6%
2531
 
6.4%
2528
 
6.4%
2472
 
6.2%
2462
 
6.2%
2448
 
6.2%
2423
 
6.1%
2331
 
5.9%
Other values (143) 13990
35.2%
Decimal Number
ValueCountFrequency (%)
1 2476
23.2%
2 1316
12.3%
3 1099
10.3%
4 1011
9.5%
5 955
 
8.9%
6 889
 
8.3%
8 830
 
7.8%
7 795
 
7.4%
0 663
 
6.2%
9 649
 
6.1%
Space Separator
ValueCountFrequency (%)
16702
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 221
100.0%
Other Punctuation
ValueCountFrequency (%)
, 215
100.0%
Open Punctuation
ValueCountFrequency (%)
( 14
100.0%
Close Punctuation
ValueCountFrequency (%)
) 13
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 39740
58.8%
Common 27848
41.2%

Most frequent character per script

Hangul
ValueCountFrequency (%)
3140
 
7.9%
2796
 
7.0%
2619
 
6.6%
2531
 
6.4%
2528
 
6.4%
2472
 
6.2%
2462
 
6.2%
2448
 
6.2%
2423
 
6.1%
2331
 
5.9%
Other values (143) 13990
35.2%
Common
ValueCountFrequency (%)
16702
60.0%
1 2476
 
8.9%
2 1316
 
4.7%
3 1099
 
3.9%
4 1011
 
3.6%
5 955
 
3.4%
6 889
 
3.2%
8 830
 
3.0%
7 795
 
2.9%
0 663
 
2.4%
Other values (5) 1112
 
4.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 39740
58.8%
ASCII 27848
41.2%

Most frequent character per block

ASCII
ValueCountFrequency (%)
16702
60.0%
1 2476
 
8.9%
2 1316
 
4.7%
3 1099
 
3.9%
4 1011
 
3.6%
5 955
 
3.4%
6 889
 
3.2%
8 830
 
3.0%
7 795
 
2.9%
0 663
 
2.4%
Other values (5) 1112
 
4.0%
Hangul
ValueCountFrequency (%)
3140
 
7.9%
2796
 
7.0%
2619
 
6.6%
2531
 
6.4%
2528
 
6.4%
2472
 
6.2%
2462
 
6.2%
2448
 
6.2%
2423
 
6.1%
2331
 
5.9%
Other values (143) 13990
35.2%

담당부서
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.3 KiB
정읍시 축산과
2451 

Length

Max length7
Median length7
Mean length7
Min length7

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row정읍시 축산과
2nd row정읍시 축산과
3rd row정읍시 축산과
4th row정읍시 축산과
5th row정읍시 축산과

Common Values

ValueCountFrequency (%)
정읍시 축산과 2451
100.0%

Length

2024-03-14T09:48:04.042813image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:48:04.108591image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
정읍시 2451
50.0%
축산과 2451
50.0%

데이터기준일자
Categorical

CONSTANT 

Distinct1
Distinct (%)< 0.1%
Missing0
Missing (%)0.0%
Memory size19.3 KiB
2021-12-20
2451 

Length

Max length10
Median length10
Mean length10
Min length10

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row2021-12-20
2nd row2021-12-20
3rd row2021-12-20
4th row2021-12-20
5th row2021-12-20

Common Values

ValueCountFrequency (%)
2021-12-20 2451
100.0%

Length

2024-03-14T09:48:04.176415image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-03-14T09:48:04.258864image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
2021-12-20 2451
100.0%

Interactions

2024-03-14T09:48:01.987464image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/