Dataset statistics
Number of variables | 15 |
---|---|
Number of observations | 546212 |
Missing cells | 48324 |
Missing cells (%) | 0.6% |
Duplicate rows | 0 |
Duplicate rows (%) | 0.0% |
Total size in memory | 62.5 MiB |
Average record size in memory | 120.0 B |
Variable types
Categorical | 9 |
---|---|
Numeric | 6 |
PUNT_LECTURA_CRITICA is highly correlated with PUNT_MATEMATICAS and 4 other fields | High correlation |
PUNT_MATEMATICAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_C_NATURALES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_SOCIALES_CIUDADANAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_INGLES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_GLOBAL is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_LECTURA_CRITICA is highly correlated with PUNT_MATEMATICAS and 4 other fields | High correlation |
PUNT_MATEMATICAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_C_NATURALES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_SOCIALES_CIUDADANAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_INGLES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_GLOBAL is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_LECTURA_CRITICA is highly correlated with PUNT_MATEMATICAS and 4 other fields | High correlation |
PUNT_MATEMATICAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_C_NATURALES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_SOCIALES_CIUDADANAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_INGLES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_GLOBAL is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
ESTU_DEPTO_RESIDE is highly correlated with COLE_DEPTO_UBICACION | High correlation |
FAMI_TIENEINTERNET is highly correlated with ESTU_DEDICACIONLECTURADIARIA and 1 other fields | High correlation |
ESTU_DEDICACIONLECTURADIARIA is highly correlated with FAMI_TIENEINTERNET | High correlation |
COLE_DEPTO_UBICACION is highly correlated with ESTU_DEPTO_RESIDE | High correlation |
FAMI_ESTRATOVIVIENDA is highly correlated with FAMI_TIENEINTERNET | High correlation |
ESTU_DEPTO_RESIDE is highly correlated with COLE_DEPTO_UBICACION | High correlation |
FAMI_ESTRATOVIVIENDA is highly correlated with FAMI_TIENEINTERNET and 1 other fields | High correlation |
FAMI_TIENEINTERNET is highly correlated with FAMI_ESTRATOVIVIENDA and 2 other fields | High correlation |
FAMI_SITUACIONECONOMICA is highly correlated with ESTU_HORASSEMANATRABAJA | High correlation |
ESTU_DEDICACIONLECTURADIARIA is highly correlated with FAMI_ESTRATOVIVIENDA and 1 other fields | High correlation |
ESTU_DEDICACIONINTERNET is highly correlated with FAMI_TIENEINTERNET | High correlation |
ESTU_HORASSEMANATRABAJA is highly correlated with FAMI_SITUACIONECONOMICA | High correlation |
COLE_DEPTO_UBICACION is highly correlated with ESTU_DEPTO_RESIDE | High correlation |
PUNT_LECTURA_CRITICA is highly correlated with PUNT_MATEMATICAS and 4 other fields | High correlation |
PUNT_MATEMATICAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_C_NATURALES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_SOCIALES_CIUDADANAS is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_INGLES is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
PUNT_GLOBAL is highly correlated with PUNT_LECTURA_CRITICA and 4 other fields | High correlation |
FAMI_TIENEINTERNET has 8337 (1.5%) missing values | Missing |
FAMI_SITUACIONECONOMICA has 8259 (1.5%) missing values | Missing |
ESTU_DEDICACIONINTERNET has 30298 (5.5%) missing values | Missing |
Reproduction
Analysis started | 2022-05-24 18:26:41.511446 |
---|---|
Analysis finished | 2022-05-24 18:27:42.292309 |
Duration | 1 minute and 0.78 seconds |
Software version | pandas-profiling v3.2.0 |
Download configuration | config.json |
ESTU_GENERO
Categorical
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.2 MiB |
F | |
---|---|
M | |
- | 121 |
Length
Max length | 1 |
---|---|
Median length | 1 |
Mean length | 1 |
Min length | 1 |
Characters and Unicode
Total characters | 546212 |
---|---|
Distinct characters | 3 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | M |
---|---|
2nd row | M |
3rd row | M |
4th row | M |
5th row | M |
Common Values
Value | Count | Frequency (%) |
F | 295994 | |
M | 250097 | |
- | 121 | < 0.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
f | 295994 | |
m | 250097 | |
121 | < 0.1% |
Most occurring characters
Value | Count | Frequency (%) |
F | 295994 | |
M | 250097 | |
- | 121 | < 0.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 546091 | |
Dash Punctuation | 121 | < 0.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
F | 295994 | |
M | 250097 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 121 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 546091 | |
Common | 121 | < 0.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
F | 295994 | |
M | 250097 |
Common
Value | Count | Frequency (%) |
- | 121 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 546212 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
F | 295994 | |
M | 250097 | |
- | 121 | < 0.1% |
Distinct | 34 |
---|---|
Distinct (%) | < 0.1% |
Missing | 377 |
Missing (%) | 0.1% |
Memory size | 4.2 MiB |
BOGOTÁ | |
---|---|
ANTIOQUIA | |
VALLE | |
CUNDINAMARCA | |
ATLANTICO | |
Other values (29) |
Length
Max length | 15 |
---|---|
Median length | 12 |
Mean length | 7.530119908 |
Min length | 4 |
Characters and Unicode
Total characters | 4110203 |
---|---|
Distinct characters | 26 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MAGDALENA |
---|---|
2nd row | BOGOTÁ |
3rd row | BOLIVAR |
4th row | BOGOTÁ |
5th row | BOGOTÁ |
Common Values
Value | Count | Frequency (%) |
BOGOTÁ | 83600 | |
ANTIOQUIA | 74228 | |
VALLE | 38640 | 7.1% |
CUNDINAMARCA | 36196 | 6.6% |
ATLANTICO | 32179 | 5.9% |
SANTANDER | 25473 | 4.7% |
BOLIVAR | 25232 | 4.6% |
CORDOBA | 20037 | 3.7% |
NARIÑO | 16903 | 3.1% |
BOYACA | 16763 | 3.1% |
Other values (24) | 176584 |
Length
Value | Count | Frequency (%) |
bogotá | 83600 | |
antioquia | 74228 | 13.0% |
santander | 41241 | 7.2% |
valle | 38640 | 6.8% |
cundinamarca | 36196 | 6.3% |
atlantico | 32179 | 5.6% |
bolivar | 25232 | 4.4% |
cordoba | 20037 | 3.5% |
nariño | 16903 | 3.0% |
boyaca | 16763 | 2.9% |
Other values (26) | 186051 |
Most occurring characters
Value | Count | Frequency (%) |
A | 800950 | |
O | 425965 | |
N | 325463 | 7.9% |
I | 323904 | 7.9% |
T | 316893 | 7.7% |
C | 228237 | 5.6% |
R | 221090 | 5.4% |
L | 211817 | 5.2% |
U | 182367 | 4.4% |
E | 161838 | 3.9% |
Other values (16) | 911679 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 4084968 | |
Space Separator | 25235 | 0.6% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 800950 | |
O | 425965 | |
N | 325463 | 8.0% |
I | 323904 | 7.9% |
T | 316893 | 7.8% |
C | 228237 | 5.6% |
R | 221090 | 5.4% |
L | 211817 | 5.2% |
U | 182367 | 4.5% |
E | 161838 | 4.0% |
Other values (15) | 886444 |
Space Separator
Value | Count | Frequency (%) |
25235 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 4084968 | |
Common | 25235 | 0.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 800950 | |
O | 425965 | |
N | 325463 | 8.0% |
I | 323904 | 7.9% |
T | 316893 | 7.8% |
C | 228237 | 5.6% |
R | 221090 | 5.4% |
L | 211817 | 5.2% |
U | 182367 | 4.5% |
E | 161838 | 4.0% |
Other values (15) | 886444 |
Common
Value | Count | Frequency (%) |
25235 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4009700 | |
None | 100503 | 2.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 800950 | |
O | 425965 | |
N | 325463 | |
I | 323904 | 8.1% |
T | 316893 | 7.9% |
C | 228237 | 5.7% |
R | 221090 | 5.5% |
L | 211817 | 5.3% |
U | 182367 | 4.5% |
E | 161838 | 4.0% |
Other values (14) | 811176 |
None
Value | Count | Frequency (%) |
Á | 83600 | |
Ñ | 16903 | 16.8% |
Distinct | 8 |
---|---|
Distinct (%) | < 0.1% |
Missing | 26 |
Missing (%) | < 0.1% |
Memory size | 4.2 MiB |
Estrato 2 | |
---|---|
Estrato 1 | |
Estrato 3 | |
- | |
Estrato 4 | |
Other values (3) |
Length
Max length | 11 |
---|---|
Median length | 9 |
Mean length | 8.557853918 |
Min length | 1 |
Characters and Unicode
Total characters | 4674180 |
---|---|
Distinct characters | 17 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Estrato 3 |
---|---|
2nd row | Estrato 3 |
3rd row | Estrato 1 |
4th row | Estrato 3 |
5th row | Estrato 3 |
Common Values
Value | Count | Frequency (%) |
Estrato 2 | 188314 | |
Estrato 1 | 159977 | |
Estrato 3 | 108692 | |
- | 34481 | 6.3% |
Estrato 4 | 25810 | 4.7% |
Sin Estrato | 17177 | 3.1% |
Estrato 5 | 8024 | 1.5% |
Estrato 6 | 3711 | 0.7% |
(Missing) | 26 | < 0.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
estrato | 511705 | |
2 | 188314 | 17.8% |
1 | 159977 | 15.1% |
3 | 108692 | 10.3% |
34481 | 3.3% | |
4 | 25810 | 2.4% |
sin | 17177 | 1.6% |
5 | 8024 | 0.8% |
6 | 3711 | 0.4% |
Most occurring characters
Value | Count | Frequency (%) |
t | 1023410 | |
E | 511705 | |
s | 511705 | |
r | 511705 | |
a | 511705 | |
o | 511705 | |
511705 | ||
2 | 188314 | 4.0% |
1 | 159977 | 3.4% |
3 | 108692 | 2.3% |
Other values (7) | 123557 | 2.6% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 3104584 | |
Uppercase Letter | 528882 | 11.3% |
Space Separator | 511705 | 10.9% |
Decimal Number | 494528 | 10.6% |
Dash Punctuation | 34481 | 0.7% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
t | 1023410 | |
s | 511705 | |
r | 511705 | |
a | 511705 | |
o | 511705 | |
i | 17177 | 0.6% |
n | 17177 | 0.6% |
Decimal Number
Value | Count | Frequency (%) |
2 | 188314 | |
1 | 159977 | |
3 | 108692 | |
4 | 25810 | 5.2% |
5 | 8024 | 1.6% |
6 | 3711 | 0.8% |
Uppercase Letter
Value | Count | Frequency (%) |
E | 511705 | |
S | 17177 | 3.2% |
Space Separator
Value | Count | Frequency (%) |
511705 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 34481 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 3633466 | |
Common | 1040714 | 22.3% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
t | 1023410 | |
E | 511705 | |
s | 511705 | |
r | 511705 | |
a | 511705 | |
o | 511705 | |
S | 17177 | 0.5% |
i | 17177 | 0.5% |
n | 17177 | 0.5% |
Common
Value | Count | Frequency (%) |
511705 | ||
2 | 188314 | 18.1% |
1 | 159977 | 15.4% |
3 | 108692 | 10.4% |
- | 34481 | 3.3% |
4 | 25810 | 2.5% |
5 | 8024 | 0.8% |
6 | 3711 | 0.4% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4674180 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
t | 1023410 | |
E | 511705 | |
s | 511705 | |
r | 511705 | |
a | 511705 | |
o | 511705 | |
511705 | ||
2 | 188314 | 4.0% |
1 | 159977 | 3.4% |
3 | 108692 | 2.3% |
Other values (7) | 123557 | 2.6% |
Distinct | 3 |
---|---|
Distinct (%) | < 0.1% |
Missing | 8337 |
Missing (%) | 1.5% |
Memory size | 4.2 MiB |
Si | |
---|---|
No | |
- | 22634 |
Length
Max length | 2 |
---|---|
Median length | 2 |
Mean length | 1.957919591 |
Min length | 1 |
Characters and Unicode
Total characters | 1053116 |
---|---|
Distinct characters | 5 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Si |
---|---|
2nd row | Si |
3rd row | No |
4th row | No |
5th row | Si |
Common Values
Value | Count | Frequency (%) |
Si | 314042 | |
No | 201199 | |
- | 22634 | 4.1% |
(Missing) | 8337 | 1.5% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
si | 314042 | |
no | 201199 | |
22634 | 4.2% |
Most occurring characters
Value | Count | Frequency (%) |
S | 314042 | |
i | 314042 | |
N | 201199 | |
o | 201199 | |
- | 22634 | 2.1% |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 515241 | |
Lowercase Letter | 515241 | |
Dash Punctuation | 22634 | 2.1% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
S | 314042 | |
N | 201199 |
Lowercase Letter
Value | Count | Frequency (%) |
i | 314042 | |
o | 201199 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 22634 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 1030482 | |
Common | 22634 | 2.1% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
S | 314042 | |
i | 314042 | |
N | 201199 | |
o | 201199 |
Common
Value | Count | Frequency (%) |
- | 22634 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 1053116 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
S | 314042 | |
i | 314042 | |
N | 201199 | |
o | 201199 | |
- | 22634 | 2.1% |
Distinct | 4 |
---|---|
Distinct (%) | < 0.1% |
Missing | 8259 |
Missing (%) | 1.5% |
Memory size | 4.2 MiB |
Igual | |
---|---|
Mejor | |
Peor | |
- | 9616 |
Length
Max length | 5 |
---|---|
Median length | 5 |
Mean length | 4.794429997 |
Min length | 1 |
Characters and Unicode
Total characters | 2579178 |
---|---|
Distinct characters | 12 |
Distinct categories | 3 ? |
Distinct scripts | 2 ? |
Distinct blocks | 1 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Peor |
---|---|
2nd row | Mejor |
3rd row | Igual |
4th row | Igual |
5th row | Mejor |
Common Values
Value | Count | Frequency (%) |
Igual | 322524 | |
Mejor | 133690 | |
Peor | 72123 | 13.2% |
- | 9616 | 1.8% |
(Missing) | 8259 | 1.5% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
igual | 322524 | |
mejor | 133690 | |
peor | 72123 | 13.4% |
9616 | 1.8% |
Most occurring characters
Value | Count | Frequency (%) |
I | 322524 | |
g | 322524 | |
u | 322524 | |
a | 322524 | |
l | 322524 | |
e | 205813 | |
o | 205813 | |
r | 205813 | |
M | 133690 | |
j | 133690 | |
Other values (2) | 81739 | 3.2% |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 2041225 | |
Uppercase Letter | 528337 | 20.5% |
Dash Punctuation | 9616 | 0.4% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
g | 322524 | |
u | 322524 | |
a | 322524 | |
l | 322524 | |
e | 205813 | |
o | 205813 | |
r | 205813 | |
j | 133690 |
Uppercase Letter
Value | Count | Frequency (%) |
I | 322524 | |
M | 133690 | |
P | 72123 | 13.7% |
Dash Punctuation
Value | Count | Frequency (%) |
- | 9616 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2569562 | |
Common | 9616 | 0.4% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
I | 322524 | |
g | 322524 | |
u | 322524 | |
a | 322524 | |
l | 322524 | |
e | 205813 | |
o | 205813 | |
r | 205813 | |
M | 133690 | |
j | 133690 |
Common
Value | Count | Frequency (%) |
- | 9616 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 2579178 |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
I | 322524 | |
g | 322524 | |
u | 322524 | |
a | 322524 | |
l | 322524 | |
e | 205813 | |
o | 205813 | |
r | 205813 | |
M | 133690 | |
j | 133690 | |
Other values (2) | 81739 | 3.2% |
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 627 |
Missing (%) | 0.1% |
Memory size | 4.2 MiB |
30 minutos o menos | |
---|---|
Entre 30 y 60 minutos | |
No leo por entretenimiento | |
Entre 1 y 2 horas | |
- |
Length
Max length | 26 |
---|---|
Median length | 21 |
Mean length | 18.9777175 |
Min length | 1 |
Characters and Unicode
Total characters | 10353958 |
---|---|
Distinct characters | 26 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Entre 30 y 60 minutos |
---|---|
2nd row | Entre 30 y 60 minutos |
3rd row | Entre 30 y 60 minutos |
4th row | 30 minutos o menos |
5th row | No leo por entretenimiento |
Common Values
Value | Count | Frequency (%) |
30 minutos o menos | 199094 | |
Entre 30 y 60 minutos | 144272 | |
No leo por entretenimiento | 95621 | |
Entre 1 y 2 horas | 55480 | 10.2% |
- | 31108 | 5.7% |
Más de 2 horas | 20010 | 3.7% |
(Missing) | 627 | 0.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
30 | 343366 | |
minutos | 343366 | |
entre | 199752 | |
y | 199752 | |
o | 199094 | |
menos | 199094 | |
60 | 144272 | 6.3% |
por | 95621 | 4.2% |
entretenimiento | 95621 | 4.2% |
leo | 95621 | 4.2% |
Other values (7) | 373209 |
Most occurring characters
Value | Count | Frequency (%) |
1743183 | ||
o | 1199528 | |
n | 1029075 | |
e | 896961 | |
t | 829981 | |
m | 638081 | 6.2% |
s | 637960 | 6.2% |
i | 534608 | 5.2% |
0 | 487638 | 4.7% |
r | 466484 | 4.5% |
Other values (16) | 1890459 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 7158038 | |
Space Separator | 1743183 | 16.8% |
Decimal Number | 1106246 | 10.7% |
Uppercase Letter | 315383 | 3.0% |
Dash Punctuation | 31108 | 0.3% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 1199528 | |
n | 1029075 | |
e | 896961 | |
t | 829981 | |
m | 638081 | |
s | 637960 | |
i | 534608 | |
r | 466484 | 6.5% |
u | 343366 | 4.8% |
y | 199752 | 2.8% |
Other values (6) | 382242 | 5.3% |
Decimal Number
Value | Count | Frequency (%) |
0 | 487638 | |
3 | 343366 | |
6 | 144272 | 13.0% |
2 | 75490 | 6.8% |
1 | 55480 | 5.0% |
Uppercase Letter
Value | Count | Frequency (%) |
E | 199752 | |
N | 95621 | |
M | 20010 | 6.3% |
Space Separator
Value | Count | Frequency (%) |
1743183 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 31108 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 7473421 | |
Common | 2880537 | 27.8% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 1199528 | |
n | 1029075 | |
e | 896961 | |
t | 829981 | |
m | 638081 | |
s | 637960 | |
i | 534608 | |
r | 466484 | 6.2% |
u | 343366 | 4.6% |
y | 199752 | 2.7% |
Other values (9) | 697625 |
Common
Value | Count | Frequency (%) |
1743183 | ||
0 | 487638 | 16.9% |
3 | 343366 | 11.9% |
6 | 144272 | 5.0% |
2 | 75490 | 2.6% |
1 | 55480 | 1.9% |
- | 31108 | 1.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 10333948 | |
None | 20010 | 0.2% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1743183 | ||
o | 1199528 | |
n | 1029075 | |
e | 896961 | |
t | 829981 | |
m | 638081 | 6.2% |
s | 637960 | 6.2% |
i | 534608 | 5.2% |
0 | 487638 | 4.7% |
r | 466484 | 4.5% |
Other values (15) | 1870449 |
None
Value | Count | Frequency (%) |
á | 20010 |
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 30298 |
Missing (%) | 5.5% |
Memory size | 4.2 MiB |
Entre 1 y 3 horas | |
---|---|
Entre 30 y 60 minutos | |
Más de 3 horas | |
30 minutos o menos | |
No Navega Internet |
Length
Max length | 21 |
---|---|
Median length | 18 |
Mean length | 17.61314095 |
Min length | 1 |
Characters and Unicode
Total characters | 9086866 |
---|---|
Distinct characters | 26 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Entre 30 y 60 minutos |
---|---|
2nd row | Entre 30 y 60 minutos |
3rd row | Más de 3 horas |
4th row | Entre 30 y 60 minutos |
5th row | Más de 3 horas |
Common Values
Value | Count | Frequency (%) |
Entre 1 y 3 horas | 157557 | |
Entre 30 y 60 minutos | 134383 | |
Más de 3 horas | 100134 | |
30 minutos o menos | 90517 | |
No Navega Internet | 30697 | 5.6% |
- | 2626 | 0.5% |
(Missing) | 30298 | 5.5% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
entre | 291940 | |
y | 291940 | |
3 | 257691 | |
horas | 257691 | |
30 | 224900 | |
minutos | 224900 | |
1 | 157557 | |
60 | 134383 | |
más | 100134 | 4.3% |
de | 100134 | 4.3% |
Other values (6) | 275751 |
Most occurring characters
Value | Count | Frequency (%) |
1801107 | ||
o | 694322 | 7.6% |
s | 673242 | 7.4% |
n | 668751 | 7.4% |
r | 580328 | 6.4% |
t | 578234 | 6.4% |
e | 574682 | 6.3% |
3 | 482591 | 5.3% |
0 | 359283 | 4.0% |
a | 319085 | 3.5% |
Other values (16) | 2355241 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 5665154 | |
Space Separator | 1801107 | 19.8% |
Decimal Number | 1133814 | 12.5% |
Uppercase Letter | 484165 | 5.3% |
Dash Punctuation | 2626 | < 0.1% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
o | 694322 | |
s | 673242 | |
n | 668751 | |
r | 580328 | |
t | 578234 | |
e | 574682 | |
a | 319085 | |
m | 315417 | |
y | 291940 | |
h | 257691 | 4.5% |
Other values (6) | 711462 |
Decimal Number
Value | Count | Frequency (%) |
3 | 482591 | |
0 | 359283 | |
1 | 157557 | 13.9% |
6 | 134383 | 11.9% |
Uppercase Letter
Value | Count | Frequency (%) |
E | 291940 | |
M | 100134 | 20.7% |
N | 61394 | 12.7% |
I | 30697 | 6.3% |
Space Separator
Value | Count | Frequency (%) |
1801107 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 2626 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 6149319 | |
Common | 2937547 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
o | 694322 | |
s | 673242 | |
n | 668751 | |
r | 580328 | |
t | 578234 | |
e | 574682 | |
a | 319085 | 5.2% |
m | 315417 | 5.1% |
E | 291940 | 4.7% |
y | 291940 | 4.7% |
Other values (10) | 1161378 |
Common
Value | Count | Frequency (%) |
1801107 | ||
3 | 482591 | 16.4% |
0 | 359283 | 12.2% |
1 | 157557 | 5.4% |
6 | 134383 | 4.6% |
- | 2626 | 0.1% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 8986732 | |
None | 100134 | 1.1% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
1801107 | ||
o | 694322 | 7.7% |
s | 673242 | 7.5% |
n | 668751 | 7.4% |
r | 580328 | 6.5% |
t | 578234 | 6.4% |
e | 574682 | 6.4% |
3 | 482591 | 5.4% |
0 | 359283 | 4.0% |
a | 319085 | 3.6% |
Other values (15) | 2255107 |
None
Value | Count | Frequency (%) |
á | 100134 |
Distinct | 6 |
---|---|
Distinct (%) | < 0.1% |
Missing | 381 |
Missing (%) | 0.1% |
Memory size | 4.2 MiB |
0 | |
---|---|
Menos de 10 horas | |
Entre 11 y 20 horas | |
Más de 30 horas | 19907 |
- | 16509 |
Length
Max length | 19 |
---|---|
Median length | 1 |
Mean length | 6.260397449 |
Min length | 1 |
Characters and Unicode
Total characters | 3417119 |
---|---|
Distinct characters | 19 |
Distinct categories | 5 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | Menos de 10 horas |
---|---|
2nd row | Menos de 10 horas |
3rd row | 0 |
4th row | Más de 30 horas |
5th row | Más de 30 horas |
Common Values
Value | Count | Frequency (%) |
0 | 354503 | |
Menos de 10 horas | 97913 | 17.9% |
Entre 11 y 20 horas | 41729 | 7.6% |
Más de 30 horas | 19907 | 3.6% |
- | 16509 | 3.0% |
Entre 21 y 30 horas | 15270 | 2.8% |
(Missing) | 381 | 0.1% |
Length
Category Frequency Plot
Value | Count | Frequency (%) |
0 | 354503 | |
horas | 174819 | |
de | 117820 | 10.5% |
menos | 97913 | 8.7% |
10 | 97913 | 8.7% |
entre | 56999 | 5.1% |
y | 56999 | 5.1% |
11 | 41729 | 3.7% |
20 | 41729 | 3.7% |
30 | 35177 | 3.1% |
Other values (3) | 51686 | 4.6% |
Most occurring characters
Value | Count | Frequency (%) |
581456 | ||
0 | 529322 | |
s | 292639 | |
e | 272732 | |
o | 272732 | |
r | 231818 | 6.8% |
1 | 196641 | 5.8% |
a | 174819 | 5.1% |
h | 174819 | 5.1% |
n | 154912 | 4.5% |
Other values (9) | 535229 |
Most occurring categories
Value | Count | Frequency (%) |
Lowercase Letter | 1826196 | |
Decimal Number | 818139 | |
Space Separator | 581456 | 17.0% |
Uppercase Letter | 174819 | 5.1% |
Dash Punctuation | 16509 | 0.5% |
Most frequent character per category
Lowercase Letter
Value | Count | Frequency (%) |
s | 292639 | |
e | 272732 | |
o | 272732 | |
r | 231818 | |
a | 174819 | |
h | 174819 | |
n | 154912 | |
d | 117820 | |
t | 56999 | 3.1% |
y | 56999 | 3.1% |
Decimal Number
Value | Count | Frequency (%) |
0 | 529322 | |
1 | 196641 | 24.0% |
2 | 56999 | 7.0% |
3 | 35177 | 4.3% |
Uppercase Letter
Value | Count | Frequency (%) |
M | 117820 | |
E | 56999 |
Space Separator
Value | Count | Frequency (%) |
581456 |
Dash Punctuation
Value | Count | Frequency (%) |
- | 16509 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 2001015 | |
Common | 1416104 |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
s | 292639 | |
e | 272732 | |
o | 272732 | |
r | 231818 | |
a | 174819 | |
h | 174819 | |
n | 154912 | |
M | 117820 | |
d | 117820 | |
E | 56999 | 2.8% |
Other values (3) | 133905 |
Common
Value | Count | Frequency (%) |
581456 | ||
0 | 529322 | |
1 | 196641 | 13.9% |
2 | 56999 | 4.0% |
3 | 35177 | 2.5% |
- | 16509 | 1.2% |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 3397212 | |
None | 19907 | 0.6% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
581456 | ||
0 | 529322 | |
s | 292639 | |
e | 272732 | |
o | 272732 | |
r | 231818 | 6.8% |
1 | 196641 | 5.8% |
a | 174819 | 5.1% |
h | 174819 | 5.1% |
n | 154912 | 4.6% |
Other values (8) | 515322 |
None
Value | Count | Frequency (%) |
á | 19907 |
Distinct | 33 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Memory size | 4.2 MiB |
BOGOTÁ | |
---|---|
ANTIOQUIA | |
VALLE | |
CUNDINAMARCA | |
ATLANTICO | |
Other values (28) |
Length
Max length | 15 |
---|---|
Median length | 12 |
Mean length | 7.541300814 |
Min length | 4 |
Characters and Unicode
Total characters | 4119149 |
---|---|
Distinct characters | 25 |
Distinct categories | 2 ? |
Distinct scripts | 2 ? |
Distinct blocks | 2 ? |
Unique
Unique | 0 ? |
---|---|
Unique (%) | 0.0% |
Sample
1st row | MAGDALENA |
---|---|
2nd row | BOGOTÁ |
3rd row | BOLIVAR |
4th row | BOGOTÁ |
5th row | BOGOTÁ |
Common Values
Value | Count | Frequency (%) |
BOGOTÁ | 82832 | |
ANTIOQUIA | 74182 | |
VALLE | 38664 | 7.1% |
CUNDINAMARCA | 37049 | 6.8% |
ATLANTICO | 32235 | 5.9% |
SANTANDER | 25751 | 4.7% |
BOLIVAR | 25418 | 4.7% |
CORDOBA | 19984 | 3.7% |
NARIÑO | 16933 | 3.1% |
BOYACA | 16737 | 3.1% |
Other values (23) | 176427 |
Length
Value | Count | Frequency (%) |
bogotá | 82832 | |
antioquia | 74182 | 13.0% |
santander | 41671 | 7.3% |
valle | 38664 | 6.8% |
cundinamarca | 37049 | 6.5% |
atlantico | 32235 | 5.6% |
bolivar | 25418 | 4.4% |
cordoba | 19984 | 3.5% |
nariño | 16933 | 3.0% |
boyaca | 16737 | 2.9% |
Other values (25) | 185882 |
Most occurring characters
Value | Count | Frequency (%) |
A | 803921 | |
O | 424463 | |
N | 328006 | 8.0% |
I | 324942 | 7.9% |
T | 316651 | 7.7% |
C | 229766 | 5.6% |
R | 222404 | 5.4% |
L | 211884 | 5.1% |
U | 183212 | 4.4% |
E | 162148 | 3.9% |
Other values (15) | 911752 |
Most occurring categories
Value | Count | Frequency (%) |
Uppercase Letter | 4093774 | |
Space Separator | 25375 | 0.6% |
Most frequent character per category
Uppercase Letter
Value | Count | Frequency (%) |
A | 803921 | |
O | 424463 | |
N | 328006 | 8.0% |
I | 324942 | 7.9% |
T | 316651 | 7.7% |
C | 229766 | 5.6% |
R | 222404 | 5.4% |
L | 211884 | 5.2% |
U | 183212 | 4.5% |
E | 162148 | 4.0% |
Other values (14) | 886377 |
Space Separator
Value | Count | Frequency (%) |
25375 |
Most occurring scripts
Value | Count | Frequency (%) |
Latin | 4093774 | |
Common | 25375 | 0.6% |
Most frequent character per script
Latin
Value | Count | Frequency (%) |
A | 803921 | |
O | 424463 | |
N | 328006 | 8.0% |
I | 324942 | 7.9% |
T | 316651 | 7.7% |
C | 229766 | 5.6% |
R | 222404 | 5.4% |
L | 211884 | 5.2% |
U | 183212 | 4.5% |
E | 162148 | 4.0% |
Other values (14) | 886377 |
Common
Value | Count | Frequency (%) |
25375 |
Most occurring blocks
Value | Count | Frequency (%) |
ASCII | 4019384 | |
None | 99765 | 2.4% |
Most frequent character per block
ASCII
Value | Count | Frequency (%) |
A | 803921 | |
O | 424463 | |
N | 328006 | |
I | 324942 | |
T | 316651 | 7.9% |
C | 229766 | 5.7% |
R | 222404 | 5.5% |
L | 211884 | 5.3% |
U | 183212 | 4.6% |
E | 162148 | 4.0% |
Other values (13) | 811987 |
None
Value | Count | Frequency (%) |
Á | 82832 | |
Ñ | 16933 | 17.0% |
PUNT_LECTURA_CRITICA
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 65 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 52.15730522 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 127 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 35 |
Q1 | 45 |
median | 52 |
Q3 | 60 |
95-th percentile | 69 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 15 |
Descriptive statistics
Standard deviation | 10.53796336 |
---|---|
Coefficient of variation (CV) | 0.2020419443 |
Kurtosis | -0.2639727926 |
Mean | 52.15730522 |
Median Absolute Deviation (MAD) | 8 |
Skewness | -0.02693031487 |
Sum | 28488946 |
Variance | 111.0486717 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
53 | 19780 | 3.6% |
54 | 19719 | 3.6% |
55 | 19616 | 3.6% |
52 | 19561 | 3.6% |
51 | 19036 | 3.5% |
56 | 18879 | 3.5% |
50 | 18765 | 3.4% |
57 | 18623 | 3.4% |
49 | 18052 | 3.3% |
58 | 17915 | 3.3% |
Other values (55) | 356266 |
Value | Count | Frequency (%) |
0 | 127 | < 0.1% |
20 | 7 | < 0.1% |
21 | 21 | < 0.1% |
22 | 42 | < 0.1% |
23 | 116 | < 0.1% |
24 | 201 | < 0.1% |
25 | 360 | 0.1% |
26 | 577 | |
27 | 942 | |
28 | 1292 |
Value | Count | Frequency (%) |
100 | 221 | < 0.1% |
82 | 5 | < 0.1% |
81 | 44 | < 0.1% |
80 | 261 | < 0.1% |
79 | 526 | 0.1% |
78 | 724 | 0.1% |
77 | 1034 | |
76 | 1404 | |
75 | 1938 | |
74 | 2484 |
PUNT_MATEMATICAS
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 73 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 50.60634882 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 10 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 31 |
Q1 | 42 |
median | 51 |
Q3 | 59 |
95-th percentile | 70 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 11.99764898 |
---|---|
Coefficient of variation (CV) | 0.237077941 |
Kurtosis | -0.2457941652 |
Mean | 50.60634882 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.05566056913 |
Sum | 27641795 |
Variance | 143.943581 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
52 | 17011 | 3.1% |
51 | 17010 | 3.1% |
53 | 16962 | 3.1% |
50 | 16952 | 3.1% |
49 | 16930 | 3.1% |
54 | 16724 | 3.1% |
48 | 16548 | 3.0% |
55 | 16469 | 3.0% |
56 | 16215 | 3.0% |
47 | 16198 | 3.0% |
Other values (63) | 379193 |
Value | Count | Frequency (%) |
0 | 10 | < 0.1% |
15 | 11 | < 0.1% |
16 | 17 | < 0.1% |
17 | 69 | < 0.1% |
18 | 144 | < 0.1% |
19 | 256 | < 0.1% |
20 | 406 | 0.1% |
21 | 618 | |
22 | 881 | |
23 | 1215 |
Value | Count | Frequency (%) |
100 | 526 | 0.1% |
85 | 9 | < 0.1% |
84 | 89 | < 0.1% |
83 | 385 | 0.1% |
82 | 390 | 0.1% |
81 | 636 | 0.1% |
80 | 816 | |
79 | 1005 | |
78 | 1276 | |
77 | 1597 |
PUNT_C_NATURALES
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 66 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 48.2347788 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 18 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 31 |
Q1 | 40 |
median | 48 |
Q3 | 56 |
95-th percentile | 67 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 16 |
Descriptive statistics
Standard deviation | 10.7640352 |
---|---|
Coefficient of variation (CV) | 0.2231592115 |
Kurtosis | -0.3215417997 |
Mean | 48.2347788 |
Median Absolute Deviation (MAD) | 8 |
Skewness | 0.2308582355 |
Sum | 26346415 |
Variance | 115.8644538 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
45 | 19175 | 3.5% |
46 | 19056 | 3.5% |
47 | 18927 | 3.5% |
48 | 18744 | 3.4% |
49 | 18420 | 3.4% |
43 | 18323 | 3.4% |
44 | 18309 | 3.4% |
50 | 17871 | 3.3% |
42 | 17768 | 3.3% |
51 | 17579 | 3.2% |
Other values (56) | 362040 |
Value | Count | Frequency (%) |
0 | 18 | < 0.1% |
19 | 6 | < 0.1% |
20 | 49 | < 0.1% |
21 | 136 | < 0.1% |
22 | 311 | 0.1% |
23 | 489 | 0.1% |
24 | 839 | 0.2% |
25 | 1185 | |
26 | 1801 | |
27 | 2471 |
Value | Count | Frequency (%) |
100 | 138 | < 0.1% |
82 | 34 | < 0.1% |
81 | 104 | < 0.1% |
80 | 237 | < 0.1% |
79 | 339 | 0.1% |
78 | 453 | 0.1% |
77 | 641 | |
76 | 881 | |
75 | 1129 | |
74 | 1397 |
PUNT_SOCIALES_CIUDADANAS
Real number (ℝ≥0)
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
HIGH CORRELATION
Distinct | 70 |
---|---|
Distinct (%) | < 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 46.22458862 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 15 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 28 |
Q1 | 37 |
median | 45 |
Q3 | 55 |
95-th percentile | 67 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 18 |
Descriptive statistics
Standard deviation | 12.14058808 |
---|---|
Coefficient of variation (CV) | 0.2626435074 |
Kurtosis | -0.5042016304 |
Mean | 46.22458862 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 0.3156098131 |
Sum | 25248425 |
Variance | 147.393879 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
39 | 16509 | 3.0% |
38 | 16405 | 3.0% |
37 | 16399 | 3.0% |
40 | 16204 | 3.0% |
41 | 16059 | 2.9% |
36 | 16000 | 2.9% |
42 | 15920 | 2.9% |
43 | 15737 | 2.9% |
44 | 15617 | 2.9% |
45 | 15282 | 2.8% |
Other values (60) | 386080 |
Value | Count | Frequency (%) |
0 | 15 | < 0.1% |
16 | 2 | < 0.1% |
17 | 21 | < 0.1% |
18 | 71 | < 0.1% |
19 | 180 | < 0.1% |
20 | 333 | 0.1% |
21 | 677 | 0.1% |
22 | 1143 | |
23 | 1815 | |
24 | 2658 |
Value | Count | Frequency (%) |
100 | 227 | < 0.1% |
83 | 7 | < 0.1% |
82 | 41 | < 0.1% |
81 | 214 | < 0.1% |
80 | 290 | 0.1% |
79 | 498 | 0.1% |
78 | 626 | |
77 | 818 | |
76 | 986 | |
75 | 1328 |
Distinct | 70 |
---|---|
Distinct (%) | < 0.1% |
Missing | 19 |
Missing (%) | < 0.1% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 48.4168911 |
Minimum | 0 |
---|---|
Maximum | 100 |
Zeros | 142 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 30 |
Q1 | 39 |
median | 48 |
Q3 | 56 |
95-th percentile | 71 |
Maximum | 100 |
Range | 100 |
Interquartile range (IQR) | 17 |
Descriptive statistics
Standard deviation | 12.55843838 |
---|---|
Coefficient of variation (CV) | 0.2593813459 |
Kurtosis | 0.1813852678 |
Mean | 48.4168911 |
Median Absolute Deviation (MAD) | 9 |
Skewness | 0.4451975185 |
Sum | 26444967 |
Variance | 157.7143745 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
45 | 16195 | 3.0% |
54 | 16182 | 3.0% |
40 | 16138 | 3.0% |
46 | 16127 | 3.0% |
53 | 16083 | 2.9% |
41 | 15976 | 2.9% |
39 | 15967 | 2.9% |
44 | 15774 | 2.9% |
47 | 15762 | 2.9% |
42 | 15716 | 2.9% |
Other values (60) | 386273 |
Value | Count | Frequency (%) |
0 | 142 | < 0.1% |
20 | 32 | < 0.1% |
21 | 165 | < 0.1% |
22 | 590 | 0.1% |
23 | 1625 | 0.3% |
24 | 2136 | 0.4% |
25 | 2541 | |
26 | 3314 | |
27 | 4766 | |
28 | 5492 |
Value | Count | Frequency (%) |
100 | 1247 | |
87 | 21 | < 0.1% |
86 | 253 | < 0.1% |
85 | 250 | < 0.1% |
84 | 313 | 0.1% |
83 | 404 | 0.1% |
82 | 690 | |
81 | 1367 | |
80 | 1225 | |
79 | 1551 |
Distinct | 389 |
---|---|
Distinct (%) | 0.1% |
Missing | 0 |
Missing (%) | 0.0% |
Infinite | 0 |
Infinite (%) | 0.0% |
Mean | 246.1864642 |
Minimum | 0 |
---|---|
Maximum | 477 |
Zeros | 4 |
Zeros (%) | < 0.1% |
Negative | 0 |
Negative (%) | 0.0% |
Memory size | 4.2 MiB |
Quantile statistics
Minimum | 0 |
---|---|
5-th percentile | 167 |
Q1 | 207 |
median | 243 |
Q3 | 282 |
95-th percentile | 335 |
Maximum | 477 |
Range | 477 |
Interquartile range (IQR) | 75 |
Descriptive statistics
Standard deviation | 51.38685767 |
---|---|
Coefficient of variation (CV) | 0.2087314501 |
Kurtosis | -0.4595364194 |
Mean | 246.1864642 |
Median Absolute Deviation (MAD) | 37 |
Skewness | 0.2625281501 |
Sum | 134470001 |
Variance | 2640.609142 |
Monotonicity | Not monotonic |
Value | Count | Frequency (%) |
230 | 4596 | 0.8% |
235 | 4593 | 0.8% |
238 | 4578 | 0.8% |
237 | 4574 | 0.8% |
245 | 4552 | 0.8% |
233 | 4527 | 0.8% |
240 | 4500 | 0.8% |
243 | 4474 | 0.8% |
242 | 4456 | 0.8% |
227 | 4441 | 0.8% |
Other values (379) | 500921 |
Value | Count | Frequency (%) |
0 | 4 | |
11 | 1 | < 0.1% |
14 | 1 | < 0.1% |
21 | 1 | < 0.1% |
28 | 1 | < 0.1% |
44 | 1 | < 0.1% |
46 | 1 | < 0.1% |
54 | 1 | < 0.1% |
55 | 1 | < 0.1% |
57 | 1 | < 0.1% |
Value | Count | Frequency (%) |
477 | 1 | < 0.1% |
475 | 1 | < 0.1% |
473 | 1 | < 0.1% |
467 | 1 | < 0.1% |
460 | 1 | < 0.1% |
457 | 1 | < 0.1% |
452 | 1 | < 0.1% |
450 | 1 | < 0.1% |
449 | 2 | |
448 | 4 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
ESTU_GENERO | ESTU_DEPTO_RESIDE | FAMI_ESTRATOVIVIENDA | FAMI_TIENEINTERNET | FAMI_SITUACIONECONOMICA | ESTU_DEDICACIONLECTURADIARIA | ESTU_DEDICACIONINTERNET | ESTU_HORASSEMANATRABAJA | COLE_DEPTO_UBICACION | PUNT_LECTURA_CRITICA | PUNT_MATEMATICAS | PUNT_C_NATURALES | PUNT_SOCIALES_CIUDADANAS | PUNT_INGLES | PUNT_GLOBAL | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
0 | M | MAGDALENA | Estrato 3 | Si | Peor | Entre 30 y 60 minutos | Entre 30 y 60 minutos | Menos de 10 horas | MAGDALENA | 47 | 48 | 37 | 30 | 54.0 | 208 |
1 | M | BOGOTÁ | Estrato 3 | Si | Mejor | Entre 30 y 60 minutos | Entre 30 y 60 minutos | Menos de 10 horas | BOGOTÁ | 60 | 65 | 54 | 59 | 63.0 | 299 |
2 | M | BOLIVAR | Estrato 1 | No | Igual | Entre 30 y 60 minutos | Más de 3 horas | 0 | BOLIVAR | 66 | 57 | 41 | 74 | 64.0 | 299 |
3 | M | BOGOTÁ | Estrato 3 | No | Igual | 30 minutos o menos | Entre 30 y 60 minutos | Más de 30 horas | BOGOTÁ | 62 | 54 | 61 | 73 | 53.0 | 309 |
4 | M | BOGOTÁ | Estrato 3 | Si | Mejor | No leo por entretenimiento | Más de 3 horas | Más de 30 horas | BOGOTÁ | 63 | 57 | 55 | 57 | 52.0 | 288 |
5 | M | ATLANTICO | - | - | Mejor | - | NaN | Menos de 10 horas | ATLANTICO | 49 | 29 | 41 | 41 | 35.0 | 198 |
6 | M | VALLE | Estrato 4 | Si | Mejor | 30 minutos o menos | Entre 30 y 60 minutos | 0 | VALLE | 76 | 70 | 70 | 68 | 72.0 | 355 |
7 | M | SANTANDER | Estrato 3 | Si | Igual | Entre 1 y 2 horas | Más de 3 horas | Menos de 10 horas | SANTANDER | 57 | 65 | 63 | 66 | 60.0 | 313 |
8 | M | CUNDINAMARCA | Estrato 3 | Si | Igual | No leo por entretenimiento | Entre 1 y 3 horas | 0 | CUNDINAMARCA | 62 | 62 | 66 | 39 | 63.0 | 288 |
9 | M | SUCRE | Estrato 3 | Si | Igual | Entre 30 y 60 minutos | Entre 30 y 60 minutos | 0 | SUCRE | 68 | 66 | 63 | 77 | 51.0 | 336 |
Last rows
ESTU_GENERO | ESTU_DEPTO_RESIDE | FAMI_ESTRATOVIVIENDA | FAMI_TIENEINTERNET | FAMI_SITUACIONECONOMICA | ESTU_DEDICACIONLECTURADIARIA | ESTU_DEDICACIONINTERNET | ESTU_HORASSEMANATRABAJA | COLE_DEPTO_UBICACION | PUNT_LECTURA_CRITICA | PUNT_MATEMATICAS | PUNT_C_NATURALES | PUNT_SOCIALES_CIUDADANAS | PUNT_INGLES | PUNT_GLOBAL | |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
546202 | F | HUILA | Estrato 3 | Si | Igual | Más de 2 horas | Entre 30 y 60 minutos | 0 | HUILA | 100 | 73 | 70 | 72 | 84.0 | 396 |
546203 | F | GUAVIARE | Estrato 1 | No | Mejor | Entre 30 y 60 minutos | 30 minutos o menos | Menos de 10 horas | GUAVIARE | 71 | 66 | 50 | 56 | 56.0 | 302 |
546204 | F | VICHADA | Estrato 2 | Si | Igual | Entre 1 y 2 horas | Entre 1 y 3 horas | 0 | VICHADA | 71 | 71 | 58 | 75 | 68.0 | 343 |
546205 | M | NORTE SANTANDER | Estrato 2 | No | Mejor | 30 minutos o menos | Entre 30 y 60 minutos | Menos de 10 horas | NORTE SANTANDER | 54 | 58 | 60 | 54 | 49.0 | 280 |
546206 | M | SUCRE | Estrato 2 | Si | Igual | Entre 30 y 60 minutos | Entre 1 y 3 horas | 0 | SUCRE | 100 | 100 | 82 | 75 | 100.0 | 450 |
546207 | M | ANTIOQUIA | Estrato 2 | Si | Igual | No leo por entretenimiento | Más de 3 horas | Menos de 10 horas | ANTIOQUIA | 76 | 78 | 65 | 74 | 58.0 | 360 |
546208 | M | BOGOTÁ | Estrato 3 | Si | Mejor | No leo por entretenimiento | Más de 3 horas | 0 | BOGOTÁ | 75 | 73 | 72 | 67 | 74.0 | 360 |
546209 | M | ARAUCA | Estrato 2 | Si | Igual | 30 minutos o menos | Entre 30 y 60 minutos | 0 | ARAUCA | 72 | 83 | 71 | 77 | 72.0 | 377 |
546210 | M | SANTANDER | Estrato 1 | No | Igual | 30 minutos o menos | Entre 30 y 60 minutos | Más de 30 horas | SANTANDER | 59 | 61 | 54 | 52 | 46.0 | 278 |
546211 | M | BOGOTÁ | Estrato 3 | Si | Igual | Entre 30 y 60 minutos | Entre 1 y 3 horas | 0 | BOGOTÁ | 76 | 73 | 72 | 71 | 74.0 | 365 |