Overview

Dataset statistics

Number of variables15
Number of observations500
Missing cells658
Missing cells (%)8.8%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory63.1 KiB
Average record size in memory129.3 B

Variable types

Numeric8
Text6
Categorical1

Dataset

Description샘플 데이터
Author빅밸류
URLhttps://bigdata.seoul.go.kr/data/selectSampleData.do?sample_data_seq=325

Alerts

대지구분(DAEJI) has constant value ""Constant
건물이름(BLDNAME) has 248 (49.6%) missing valuesMissing
건물(동)이름(DONGNAME) has 379 (75.8%) missing valuesMissing
호_이름(HONAME) has 31 (6.2%) missing valuesMissing
전유부_키코드(PKCODE2) has unique valuesUnique
부번(BUNJI2) has 24 (4.8%) zerosZeros

Reproduction

Analysis started2023-12-10 15:05:16.058367
Analysis finished2023-12-10 15:05:32.111487
Duration16.05 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

PNU코드(PNU)
Real number (ℝ)

Distinct498
Distinct (%)99.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean1.1465548 × 1018
Minimum1.1110109 × 1018
Maximum1.1740109 × 1018
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:05:32.251155image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1.1110109 × 1018
5-th percentile1.117013 × 1018
Q11.1305103 × 1018
median1.1500102 × 1018
Q31.1620101 × 1018
95-th percentile1.1710114 × 1018
Maximum1.1740109 × 1018
Range6.3 × 1016
Interquartile range (IQR)3.14998 × 1016

Descriptive statistics

Standard deviation1.7663572 × 1016
Coefficient of variation (CV)0.015405781
Kurtosis-1.0758781
Mean1.1465548 × 1018
Median Absolute Deviation (MAD)1.499975 × 1016
Skewness-0.14383761
Sum1.4283498 × 1018
Variance3.1200179 × 1032
MonotonicityNot monotonic
2023-12-11T00:05:32.551296image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1165010200103670000 2
 
0.4%
1123010600103210010 2
 
0.4%
1126010300104560000 1
 
0.2%
1171010500101720016 1
 
0.2%
1138010400104010009 1
 
0.2%
1150010300100610034 1
 
0.2%
1150010600107000015 1
 
0.2%
1171011200101160013 1
 
0.2%
1129013800102190257 1
 
0.2%
1171011300102920005 1
 
0.2%
Other values (488) 488
97.6%
ValueCountFrequency (%)
1111010900100170000 1
0.2%
1111010900101000001 1
0.2%
1111010900101660037 1
0.2%
1111010900101660251 1
0.2%
1111011500102620022 1
0.2%
1111017300100230000 1
0.2%
1111018200101100018 1
0.2%
1111018200101390021 1
0.2%
1111018300102930046 1
0.2%
1111018600101360013 1
0.2%
ValueCountFrequency (%)
1174010900105610000 1
0.2%
1174010900103620038 1
0.2%
1174010900103310002 1
0.2%
1174010900103170010 1
0.2%
1174010900103150008 1
0.2%
1174010900103100017 1
0.2%
1174010900103090016 1
0.2%
1174010900101850035 1
0.2%
1174010900101670117 1
0.2%
1174010900100900002 1
0.2%

기준년월(KEYMONTH)
Real number (ℝ)

Distinct18
Distinct (%)3.6%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean202034.46
Minimum201912
Maximum202105
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size4.5 KiB
2023-12-11T00:05:32.815806image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum201912
5-th percentile202001
Q1202005
median202009
Q3202101
95-th percentile202104.05
Maximum202105
Range193
Interquartile range (IQR)96

Descriptive statistics

Standard deviation44.245084
Coefficient of variation (CV)0.00021899771
Kurtosis-1.0839686
Mean202034.46
Median Absolute Deviation (MAD)5
Skewness0.85494202
Sum1.0101723 × 108
Variance1957.6275
MonotonicityNot monotonic
2023-12-11T00:05:33.030005image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=18)
ValueCountFrequency (%)
202101 39
 
7.8%
202009 38
 
7.6%
202012 32
 
6.4%
202006 31
 
6.2%
202002 31
 
6.2%
202010 31
 
6.2%
202001 30
 
6.0%
202005 30
 
6.0%
202103 30
 
6.0%
202008 29
 
5.8%
Other values (8) 179
35.8%
ValueCountFrequency (%)
201912 1
 
0.2%
202001 30
6.0%
202002 31
6.2%
202003 27
5.4%
202004 28
5.6%
202005 30
6.0%
202006 31
6.2%
202007 22
4.4%
202008 29
5.8%
202009 38
7.6%
ValueCountFrequency (%)
202105 25
5.0%
202104 26
5.2%
202103 30
6.0%
202102 26
5.2%
202101 39
7.8%
202012 32
6.4%
202011 24
4.8%
202010 31
6.2%
202009 38
7.6%
202008 29
5.8%
Distinct496
Distinct (%)99.2%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-11T00:05:33.475638image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length11
Mean length12.386
Min length9

Characters and Unicode

Total characters6193
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique492 ?
Unique (%)98.4%

Sample

1st row11215-100200522
2nd row11290-100256230
3rd row11290-100245949
4th row11680-7902
5th row11500-17348
ValueCountFrequency (%)
11305-30648 2
 
0.4%
11470-14507 2
 
0.4%
11680-100272874 2
 
0.4%
11410-100223282 2
 
0.4%
11650-7460 1
 
0.2%
11260-100195287 1
 
0.2%
11305-13023 1
 
0.2%
11710-21903 1
 
0.2%
11470-100224148 1
 
0.2%
11710-15374 1
 
0.2%
Other values (486) 486
97.2%
2023-12-11T00:05:34.206887image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1675
27.0%
0 1120
18.1%
2 591
 
9.5%
- 500
 
8.1%
5 407
 
6.6%
4 371
 
6.0%
3 368
 
5.9%
6 311
 
5.0%
7 302
 
4.9%
8 277
 
4.5%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5693
91.9%
Dash Punctuation 500
 
8.1%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1675
29.4%
0 1120
19.7%
2 591
 
10.4%
5 407
 
7.1%
4 371
 
6.5%
3 368
 
6.5%
6 311
 
5.5%
7 302
 
5.3%
8 277
 
4.9%
9 271
 
4.8%
Dash Punctuation
ValueCountFrequency (%)
- 500
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6193
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1675
27.0%
0 1120
18.1%
2 591
 
9.5%
- 500
 
8.1%
5 407
 
6.6%
4 371
 
6.0%
3 368
 
5.9%
6 311
 
5.0%
7 302
 
4.9%
8 277
 
4.5%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6193
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1675
27.0%
0 1120
18.1%
2 591
 
9.5%
- 500
 
8.1%
5 407
 
6.6%
4 371
 
6.0%
3 368
 
5.9%
6 311
 
5.0%
7 302
 
4.9%
8 277
 
4.5%
Distinct500
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-11T00:05:34.616914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length15
Median length12
Mean length12.982
Min length11

Characters and Unicode

Total characters6491
Distinct characters11
Distinct categories2 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique500 ?
Unique (%)100.0%

Sample

1st row11710-76789
2nd row11590-78834
3rd row11305-90089
4th row11620-100200173
5th row11380-100190494
ValueCountFrequency (%)
11710-76789 1
 
0.2%
11215-100252583 1
 
0.2%
11215-41449 1
 
0.2%
11545-100232153 1
 
0.2%
11440-100219159 1
 
0.2%
11140-69399 1
 
0.2%
11500-100281280 1
 
0.2%
11380-100254980 1
 
0.2%
11650-130940 1
 
0.2%
11380-69734 1
 
0.2%
Other values (490) 490
98.0%
2023-12-11T00:05:35.259304image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
1 1749
26.9%
0 1205
18.6%
2 548
 
8.4%
- 500
 
7.7%
5 444
 
6.8%
4 388
 
6.0%
3 370
 
5.7%
7 359
 
5.5%
6 327
 
5.0%
8 309
 
4.8%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 5991
92.3%
Dash Punctuation 500
 
7.7%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
1 1749
29.2%
0 1205
20.1%
2 548
 
9.1%
5 444
 
7.4%
4 388
 
6.5%
3 370
 
6.2%
7 359
 
6.0%
6 327
 
5.5%
8 309
 
5.2%
9 292
 
4.9%
Dash Punctuation
ValueCountFrequency (%)
- 500
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 6491
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
1 1749
26.9%
0 1205
18.6%
2 548
 
8.4%
- 500
 
7.7%
5 444
 
6.8%
4 388
 
6.0%
3 370
 
5.7%
7 359
 
5.5%
6 327
 
5.0%
8 309
 
4.8%

Most occurring blocks

ValueCountFrequency (%)
ASCII 6491
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
1 1749
26.9%
0 1205
18.6%
2 548
 
8.4%
- 500
 
7.7%
5 444
 
6.8%
4 388
 
6.0%
3 370
 
5.7%
7 359
 
5.5%
6 327
 
5.0%
8 309
 
4.8%
Distinct429
Distinct (%)85.8%
Missing0
Missing (%)0.0%
Memory size4.0 KiB
2023-12-11T00:05:35.653986image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/