Overview

Dataset statistics

Number of variables16
Number of observations100
Missing cells0
Missing cells (%)0.0%
Duplicate rows0
Duplicate rows (%)0.0%
Total size in memory13.8 KiB
Average record size in memory141.3 B

Variable types

Numeric9
Categorical5
Text2

Alerts

도로종류 has constant value ""Constant
측정일 has constant value ""Constant
측정시간 has constant value ""Constant
기본키 is highly overall correlated with 주소High correlation
연장((km)) is highly overall correlated with 주소High correlation
좌표위치위도((°)) is highly overall correlated with 주소High correlation
좌표위치경도((°)) is highly overall correlated with 주소High correlation
co((g/km)) is highly overall correlated with nox((g/km)) and 4 other fieldsHigh correlation
nox((g/km)) is highly overall correlated with co((g/km)) and 3 other fieldsHigh correlation
hc((g/km)) is highly overall correlated with co((g/km)) and 3 other fieldsHigh correlation
pm((g/km)) is highly overall correlated with co((g/km)) and 3 other fieldsHigh correlation
co2((g/km)) is highly overall correlated with co((g/km)) and 4 other fieldsHigh correlation
주소 is highly overall correlated with 기본키 and 5 other fieldsHigh correlation
기본키 has unique valuesUnique
co((g/km)) has unique valuesUnique
nox((g/km)) has unique valuesUnique
co2((g/km)) has unique valuesUnique

Reproduction

Analysis started2024-04-16 09:20:24.184602
Analysis finished2024-04-16 09:20:32.328089
Duration8.14 seconds
Software versionydata-profiling vv4.5.1
Download configurationconfig.json

Variables

기본키
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean50.5
Minimum1
Maximum100
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-16T18:20:32.401715image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum1
5-th percentile5.95
Q125.75
median50.5
Q375.25
95-th percentile95.05
Maximum100
Range99
Interquartile range (IQR)49.5

Descriptive statistics

Standard deviation29.011492
Coefficient of variation (CV)0.57448499
Kurtosis-1.2
Mean50.5
Median Absolute Deviation (MAD)25
Skewness0
Sum5050
Variance841.66667
MonotonicityStrictly increasing
2024-04-16T18:20:32.562256image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
1 1
 
1.0%
65 1
 
1.0%
75 1
 
1.0%
74 1
 
1.0%
73 1
 
1.0%
72 1
 
1.0%
71 1
 
1.0%
70 1
 
1.0%
69 1
 
1.0%
68 1
 
1.0%
Other values (90) 90
90.0%
ValueCountFrequency (%)
1 1
1.0%
2 1
1.0%
3 1
1.0%
4 1
1.0%
5 1
1.0%
6 1
1.0%
7 1
1.0%
8 1
1.0%
9 1
1.0%
10 1
1.0%
ValueCountFrequency (%)
100 1
1.0%
99 1
1.0%
98 1
1.0%
97 1
1.0%
96 1
1.0%
95 1
1.0%
94 1
1.0%
93 1
1.0%
92 1
1.0%
91 1
1.0%

도로종류
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
건기연
100 

Length

Max length3
Median length3
Mean length3
Min length3

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row건기연
2nd row건기연
3rd row건기연
4th row건기연
5th row건기연

Common Values

ValueCountFrequency (%)
건기연 100
100.0%

Length

2024-04-16T18:20:32.675634image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T18:20:32.750831image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
건기연 100
100.0%

지점
Text

Distinct50
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-16T18:20:32.966044image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length8
Mean length8
Min length8

Characters and Unicode

Total characters800
Distinct characters13
Distinct categories4 ?
Distinct scripts1 ?
Distinct blocks1 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row[0526-3]
2nd row[0526-3]
3rd row[0529-0]
4th row[0529-0]
5th row[0530-0]
ValueCountFrequency (%)
0526-3 2
 
2.0%
4313-0 2
 
2.0%
5601-2 2
 
2.0%
3813-1 2
 
2.0%
3814-0 2
 
2.0%
3818-0 2
 
2.0%
4209-1 2
 
2.0%
4209-2 2
 
2.0%
4212-1 2
 
2.0%
4213-1 2
 
2.0%
Other values (40) 80
80.0%
2024-04-16T18:20:33.251914image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
0 106
13.2%
1 104
13.0%
[ 100
12.5%
- 100
12.5%
] 100
12.5%
3 64
8.0%
2 58
7.2%
4 56
7.0%
6 34
 
4.2%
5 28
 
3.5%
Other values (3) 50
6.2%

Most occurring categories

ValueCountFrequency (%)
Decimal Number 500
62.5%
Open Punctuation 100
 
12.5%
Dash Punctuation 100
 
12.5%
Close Punctuation 100
 
12.5%

Most frequent character per category

Decimal Number
ValueCountFrequency (%)
0 106
21.2%
1 104
20.8%
3 64
12.8%
2 58
11.6%
4 56
11.2%
6 34
 
6.8%
5 28
 
5.6%
7 22
 
4.4%
8 18
 
3.6%
9 10
 
2.0%
Open Punctuation
ValueCountFrequency (%)
[ 100
100.0%
Dash Punctuation
ValueCountFrequency (%)
- 100
100.0%
Close Punctuation
ValueCountFrequency (%)
] 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Common 800
100.0%

Most frequent character per script

Common
ValueCountFrequency (%)
0 106
13.2%
1 104
13.0%
[ 100
12.5%
- 100
12.5%
] 100
12.5%
3 64
8.0%
2 58
7.2%
4 56
7.0%
6 34
 
4.2%
5 28
 
3.5%
Other values (3) 50
6.2%

Most occurring blocks

ValueCountFrequency (%)
ASCII 800
100.0%

Most frequent character per block

ASCII
ValueCountFrequency (%)
0 106
13.2%
1 104
13.0%
[ 100
12.5%
- 100
12.5%
] 100
12.5%
3 64
8.0%
2 58
7.2%
4 56
7.0%
6 34
 
4.2%
5 28
 
3.5%
Other values (3) 50
6.2%

방향
Categorical

Distinct2
Distinct (%)2.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
1
50 
2
50 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row1
2nd row2
3rd row1
4th row2
5th row1

Common Values

ValueCountFrequency (%)
1 50
50.0%
2 50
50.0%

Length

2024-04-16T18:20:33.361151image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T18:20:33.434240image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
1 50
50.0%
2 50
50.0%
Distinct50
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
2024-04-16T18:20:33.619212image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Length

Max length8
Median length5
Mean length5.18
Min length4

Characters and Unicode

Total characters518
Distinct characters90
Distinct categories3 ?
Distinct scripts3 ?
Distinct blocks2 ?
The Unicode Standard assigns character properties to each code point, which can be used to analyse textual variables.

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row원주-소초
2nd row원주-소초
3rd row공근-동산
4th row공근-동산
5th row횡성-춘천
ValueCountFrequency (%)
원주-소초 2
 
2.0%
문혜-김화 2
 
2.0%
김화-근남 2
 
2.0%
신동-사북 2
 
2.0%
남-사북 2
 
2.0%
도계-고천 2
 
2.0%
원주-새말 2
 
2.0%
문막-원주 2
 
2.0%
안흥-운교 2
 
2.0%
노론-용탄 2
 
2.0%
Other values (40) 80
80.0%
2024-04-16T18:20:33.969642image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Most occurring characters

ValueCountFrequency (%)
- 100
 
19.3%
24
 
4.6%
18
 
3.5%
14
 
2.7%
14
 
2.7%
12
 
2.3%
12
 
2.3%
12
 
2.3%
12
 
2.3%
10
 
1.9%
Other values (80) 290
56.0%

Most occurring categories

ValueCountFrequency (%)
Other Letter 414
79.9%
Dash Punctuation 100
 
19.3%
Uppercase Letter 4
 
0.8%

Most frequent character per category

Other Letter
ValueCountFrequency (%)
24
 
5.8%
18
 
4.3%
14
 
3.4%
14
 
3.4%
12
 
2.9%
12
 
2.9%
12
 
2.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
Other values (77) 276
66.7%
Uppercase Letter
ValueCountFrequency (%)
C 2
50.0%
I 2
50.0%
Dash Punctuation
ValueCountFrequency (%)
- 100
100.0%

Most occurring scripts

ValueCountFrequency (%)
Hangul 414
79.9%
Common 100
 
19.3%
Latin 4
 
0.8%

Most frequent character per script

Hangul
ValueCountFrequency (%)
24
 
5.8%
18
 
4.3%
14
 
3.4%
14
 
3.4%
12
 
2.9%
12
 
2.9%
12
 
2.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
Other values (77) 276
66.7%
Latin
ValueCountFrequency (%)
C 2
50.0%
I 2
50.0%
Common
ValueCountFrequency (%)
- 100
100.0%

Most occurring blocks

ValueCountFrequency (%)
Hangul 414
79.9%
ASCII 104
 
20.1%

Most frequent character per block

ASCII
ValueCountFrequency (%)
- 100
96.2%
C 2
 
1.9%
I 2
 
1.9%
Hangul
ValueCountFrequency (%)
24
 
5.8%
18
 
4.3%
14
 
3.4%
14
 
3.4%
12
 
2.9%
12
 
2.9%
12
 
2.9%
12
 
2.9%
10
 
2.4%
10
 
2.4%
Other values (77) 276
66.7%

연장((km))
Real number (ℝ)

HIGH CORRELATION 

Distinct45
Distinct (%)45.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean10.284
Minimum2
Maximum27
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-16T18:20:34.130352image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum2
5-th percentile2.4
Q15.7
median8.95
Q313.3
95-th percentile23
Maximum27
Range25
Interquartile range (IQR)7.6

Descriptive statistics

Standard deviation6.1305448
Coefficient of variation (CV)0.59612454
Kurtosis0.20968578
Mean10.284
Median Absolute Deviation (MAD)4.2
Skewness0.80873663
Sum1028.4
Variance37.58358
MonotonicityNot monotonic
2024-04-16T18:20:34.235106image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=45)
ValueCountFrequency (%)
7.2 4
 
4.0%
12.0 4
 
4.0%
3.5 4
 
4.0%
8.0 4
 
4.0%
8.8 4
 
4.0%
4.8 2
 
2.0%
14.4 2
 
2.0%
3.8 2
 
2.0%
5.9 2
 
2.0%
16.1 2
 
2.0%
Other values (35) 70
70.0%
ValueCountFrequency (%)
2.0 2
2.0%
2.3 2
2.0%
2.4 2
2.0%
2.6 2
2.0%
2.9 2
2.0%
3.5 4
4.0%
3.6 2
2.0%
3.8 2
2.0%
4.0 2
2.0%
4.4 2
2.0%
ValueCountFrequency (%)
27.0 2
2.0%
24.7 2
2.0%
23.0 2
2.0%
22.7 2
2.0%
18.1 2
2.0%
18.0 2
2.0%
17.4 2
2.0%
16.4 2
2.0%
16.1 2
2.0%
15.3 2
2.0%

측정일
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
20210401
100 

Length

Max length8
Median length8
Mean length8
Min length8

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row20210401
2nd row20210401
3rd row20210401
4th row20210401
5th row20210401

Common Values

ValueCountFrequency (%)
20210401 100
100.0%

Length

2024-04-16T18:20:34.329999image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T18:20:34.400658image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
20210401 100
100.0%

측정시간
Categorical

CONSTANT 

Distinct1
Distinct (%)1.0%
Missing0
Missing (%)0.0%
Memory size932.0 B
0
100 

Length

Max length1
Median length1
Mean length1
Min length1

Unique

Unique0 ?
Unique (%)0.0%

Sample

1st row0
2nd row0
3rd row0
4th row0
5th row0

Common Values

ValueCountFrequency (%)
0 100
100.0%

Length

2024-04-16T18:20:34.472050image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram of lengths of the category

Common Values (Plot)

2024-04-16T18:20:34.545725image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
ValueCountFrequency (%)
0 100
100.0%

좌표위치위도((°))
Real number (ℝ)

HIGH CORRELATION 

Distinct50
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean37.701821
Minimum37.08588
Maximum38.38086
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-16T18:20:34.888312image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum37.08588
5-th percentile37.18732
Q137.40743
median37.67506
Q338.04937
95-th percentile38.23094
Maximum38.38086
Range1.29498
Interquartile range (IQR)0.64194

Descriptive statistics

Standard deviation0.35485589
Coefficient of variation (CV)0.0094121684
Kurtosis-1.2690701
Mean37.701821
Median Absolute Deviation (MAD)0.33918
Skewness0.079947021
Sum3770.1821
Variance0.1259227
MonotonicityNot monotonic
2024-04-16T18:20:35.012917image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
37.3551 2
 
2.0%
37.83194 2
 
2.0%
37.25108 2
 
2.0%
37.30412 2
 
2.0%
37.40802 2
 
2.0%
37.32395 2
 
2.0%
37.4163 2
 
2.0%
37.32703 2
 
2.0%
37.4489 2
 
2.0%
37.48273 2
 
2.0%
Other values (40) 80
80.0%
ValueCountFrequency (%)
37.08588 2
2.0%
37.18474 2
2.0%
37.18732 2
2.0%
37.19159 2
2.0%
37.21543 2
2.0%
37.25108 2
2.0%
37.28643 2
2.0%
37.30412 2
2.0%
37.32395 2
2.0%
37.32703 2
2.0%
ValueCountFrequency (%)
38.38086 2
2.0%
38.25247 2
2.0%
38.23094 2
2.0%
38.19136 2
2.0%
38.18502 2
2.0%
38.17869 2
2.0%
38.14989 2
2.0%
38.11527 2
2.0%
38.08778 2
2.0%
38.07373 2
2.0%

좌표위치경도((°))
Real number (ℝ)

HIGH CORRELATION 

Distinct50
Distinct (%)50.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean128.25485
Minimum127.35058
Maximum129.20253
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-16T18:20:35.132752image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/

Quantile statistics

Minimum127.35058
5-th percentile127.47033
Q1127.86266
median128.19891
Q3128.64197
95-th percentile129.07044
Maximum129.20253
Range1.85195
Interquartile range (IQR)0.77931

Descriptive statistics

Standard deviation0.48416528
Coefficient of variation (CV)0.0037750251
Kurtosis-0.95508247
Mean128.25485
Median Absolute Deviation (MAD)0.36049
Skewness0.13492916
Sum12825.485
Variance0.23441602
MonotonicityNot monotonic
2024-04-16T18:20:35.254404image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/
Histogram with fixed size bins (bins=50)
ValueCountFrequency (%)
127.99487 2
 
2.0%
128.01254 2
 
2.0%
128.7796 2
 
2.0%
129.07044 2
 
2.0%
127.99641 2
 
2.0%
127.83897 2
 
2.0%
128.2034 2
 
2.0%
128.51597 2
 
2.0%
128.66017 2
 
2.0%
129.09293 2
 
2.0%
Other values (40) 80
80.0%
ValueCountFrequency (%)
127.35058 2
2.0%
127.41894 2
2.0%
127.47033 2
2.0%
127.60419 2
2.0%
127.62463 2
2.0%
127.63815 2
2.0%
127.67987 2
2.0%
127.77663 2
2.0%
127.81252 2
2.0%
127.81502 2
2.0%
ValueCountFrequency (%)
129.20253 2
2.0%
129.09293 2
2.0%
129.07044 2
2.0%
129.02671 2
2.0%
128.98396 2
2.0%
128.84543 2
2.0%
128.84271 2
2.0%
128.83913 2
2.0%
128.81034 2
2.0%
128.79017 2
2.0%

co((g/km))
Real number (ℝ)

HIGH CORRELATION  UNIQUE 

Distinct100
Distinct (%)100.0%
Missing0
Missing (%)0.0%
Infinite0
Infinite (%)0.0%
Mean2420.4145
Minimum363.68
Maximum9595.97
Zeros0
Zeros (%)0.0%
Negative0
Negative (%)0.0%
Memory size1.0 KiB
2024-04-16T18:20:35.398088image/svg+xmlMatplotlib v3.7.2, https://matplotlib.org/