data management summary notes, Cheat Sheet of Database Management Systems (DBMS)

data management summary notes including graphs and the collection of data

Typology: Cheat Sheet

2021/2022

Uploaded on 11/13/2025

bianca-asmr
bianca-asmr šŸ‡ØšŸ‡¦

8 documents

1 / 1

Toggle sidebar

This page cannot be seen from the preview

Don't miss anything!

bg1
Section 3: Relavent Points
Country
Mortality Rate (Per 1000) (%)
Covid-19 Cases (Per 100,000) (%)
Model Distance Country
Mortality Rate (Per 1000) (%)
Murder Cases (Per 100,000) (%)
Model Distance
Peru 5.6 4.9 2.0484 2.8516 Peru 5.6 8.3 17.224 -8.924
Chile 6.3 1.2 1.9882 -0.7882 Chile 6.3 3.7 17.952 -14.252
Brazil 6.6 1.9 1.9624 -0.0624 Brazil 6.6 19.3 18.264 1.036
Trinidad and Tobago
8.6 2.3 1.7904 0.5096
Trinidad and Tobago
8.6 28.2 20.344 7.856
Argentina 7.6 1.3 1.8764 -0.5764 Argentina 7.6 4.6 19.304 -14.704
Colombia 5.7 2.2 2.0398 0.1602 Colombia 5.7 24.3 17.328 6.972
Paraguay 5.6 2.5 2.0484 0.4516 Paraguay 5.6 6.6 17.224 -10.624
Mexico 6.1 4.5 2.0054 2.4946 Mexico 6.1 27 17.744 9.256
Uruguay 9.5 0.7 1.713 -1.013 Uruguay 9.5 9.3 21.28 -11.98
Ecuador 5.1 3.4 2.0914 1.3086 Ecuador 5.1 7.7 16.704 -9.004
Panama 5.1 0.8 2.0914 -1.2914 Panama 5.1 11.6 16.704 -5.104
Costa Rica 5.2 0.8 2.0828 -1.2828 Costa Rica 5.2 11.2 16.808 -5.608
Belize 4.8 1 2.1172 -1.1172 Belize 4.8 24.3 16.392 7.908
Jamaica 7.6 2.3 1.8764 0.4236 Jamaica 7.6 46.5 19.304 27.196
Guatemala 4.7 1.6 2.1258 -0.5258 Guatemala 4.7 15.3 16.288 -0.988
Honduras 4.5 2.4 2.143 0.257 Honduras 4.5 37.6 16.08 21.6
El Salvador 7.1 2.1 1.9194 0.1806 El Salvador 7.1 19.7 18.784 0.916
Dominican Republic
6.2 0.7 1.9968 -1.2968 As seen on the residual plot here, the point farthest from the line of best fit is (2.0484, 2.8516).
Dominican Republic
6.2 9 17.848 -8.848 As seen on the residual plot here, the point farthest from the line of best fit is (19.304, 27.196).
Venezuela 7.1 1.1 1.9194 -0.8194 On the linear model this point is (5.6, 4.9) and the country is Peru, which means that this is the Venezuela 7.1 45.6 18.784 26.816 On the linear model this point is (7.6, 46.5) and the country is Jamaica, which means that this is the
Haiti 8.5 2.5 1.799 0.701 most outlying point in this data set. Therefore, this point affects the model as it is far away from where Haiti 8.5 13 20.24 -7.24 most outlying point in this data set. Therefore, this point affects the model as it is far away from where
Nicaragua 5.1 1.6 2.0914 -0.4914 it is expected to lie in according to the rest of the data in the sample space and will affect the outcome. Nicaragua 5.1 4.4 16.704 -12.304 it is expected to lie in according to the rest of the data in the sample space and will affect the outcome.
Mean
Standard Deviation
X - Value Z - Score Percentile Mean
Standard Deviation
X - Value Z - Score Percentile
1.99047619 1.162714394 5.6 3.104394191 33rd 17.96190476 1.41572798 7.6 -7.319135389 76th
Linear Model Data Without The Outlier: Linear Model Data Without The Outlier:
Country
Mortality Rate (Per 1000) (%)
Covid-19 Cases (Per 100,000) (%) Percentile rank of X = ( # of values below x/ n) x 100 Country
Mortality Rate (Per 1000) (%)
Murder Cases (Per 100,000) (%) Percentile rank of X = ( # of values below x/ n) x 100
Chile 6.3 1.2 Percentile rank of X = 7/21 x 100 Peru 5.6 8.3 Percentile rank of X = 16/21 x 100
Brazil 6.6 1.9 Percentile rank = 33rd Chile 6.3 3.7 Percentile rank = 76th
Trinidad and Tobago
8.6 2.3 Brazil 6.6 19.3
Argentina 7.6 1.3 Linear Model With The Outlier:
Trinidad and Tobago
8.6 28.2 Linear Model With The Outlier:
Colombia 5.7 2.2 Argentina 7.6 4.6
Paraguay 5.6 2.5 Colombia 5.7 24.3
Mexico 6.1 4.5 Paraguay 5.6 6.6
Uruguay 9.5 0.7 Mexico 6.1 27
Ecuador 5.1 3.4 Uruguay 9.5 9.3
Panama 5.1 0.8 Ecuador 5.1 7.7
Costa Rica 5.2 0.8 Panama 5.1 11.6
Belize 4.8 1 Costa Rica 5.2 11.2
Jamaica 7.6 2.3 Belize 4.8 24.3
Guatemala 4.7 1.6 Guatemala 4.7 15.3
Honduras 4.5 2.4 Honduras 4.5 37.6
El Salvador 7.1 2.1 El Salvador 7.1 19.7
Dominican Republic
6.2 0.7
Dominican Republic
6.2 9
Venezuela 7.1 1.1 Venezuela 7.1 45.6
Haiti 8.5 2.5 Haiti 8.5 13
Nicaragua 5.1 1.6 Nicaragua 5.1 4.4
Venezuela 7.1 1.1
Haiti 8.5 2.5
Nicaragua 5.1 1.6
Linear Model Without The Outlier: Linear Model Without The Outlier:
The linear model for mortality rates and Covid - 19 cases in third world countries in 2020 has changed The linear model for mortality rates and murder cases in third world countries in 2020 has changed
after the outlier was removed. The correlation has decreased and is now 0 instead of 0.011. This was after the outlier was removed. The correlation has decreased and is now 0 instead of 0.013. This was
probably because the outlier was far from the data and closer to the line of best fit which is why it had probably because the outlier was far from the data and closer to the line of best fit which is why it had
made it closer to the higher point and made that correlation stronger than the correlation when the outlier made it closer to the higher point and made that correlation stronger than the correlation when the outlier
was removed. We can see that after the outlier was removed, most of the points are spread out around was removed. We can see that after the outlier was removed, most of the points are spread out around
the line of best fit indicating a more weaker negative correlation. the line of best fit indicating a more weaker negative correlation.

Partial preview of the text

Download data management summary notes and more Cheat Sheet Database Management Systems (DBMS) in PDF only on Docsity!

Section 3: Relavent Points Country Mortality Rate (Per 1000) (%)Covid-19 Cases (Per 100,000) (%)Model Distance Country Mortality Rate (Per 1000) (%)Murder Cases (Per 100,000) (%)Model Distance Peru 5.6 4.9 2.0484 2.8516 Peru 5.6 8.3 17.224 -8. Chile 6.3 1.2 1.9882 -0.7882 Chile 6.3 3.7 17.952 -14. Brazil 6.6 1.9 1.9624 -0.0624 Brazil 6.6 19.3 18.264 1. Trinidad and Tobago 8.6 2.3 1.7904 0.5096 Trinidad and Tobago 8.6 28.2 20.344 7. Argentina 7.6 1.3 1.8764 -0.5764 Argentina 7.6 4.6 19.304 -14. Colombia 5.7 2.2 2.0398 0.1602 Colombia 5.7 24.3 17.328 6. Paraguay 5.6 2.5 2.0484 0.4516 Paraguay 5.6 6.6 17.224 -10. Mexico 6.1 4.5 2.0054 2.4946 Mexico 6.1 27 17.744 9. Uruguay 9.5 0.7 1.713 -1.013 Uruguay 9.5 9.3 21.28 -11. Ecuador 5.1 3.4 2.0914 1.3086 Ecuador 5.1 7.7 16.704 -9. Panama 5.1 0.8 2.0914 -1.2914 Panama 5.1 11.6 16.704 -5. Costa Rica 5.2 0.8 2.0828 -1.2828 Costa Rica 5.2 11.2 16.808 -5. Belize 4.8 1 2.1172 -1.1172 Belize 4.8 24.3 16.392 7. Jamaica 7.6 2.3 1.8764 0.4236 Jamaica 7.6 46.5 19.304 27. Guatemala 4.7 1.6 2.1258 -0.5258 Guatemala 4.7 15.3 16.288 -0. Honduras 4.5 2.4 2.143 0.257 Honduras 4.5 37.6 16.08 21. El Salvador 7.1 2.1 1.9194 0.1806 El Salvador 7.1 19.7 18.784 0. Dominican Republic 6.2 0.7 1.9968 -1.2968 As seen on the residual plot here, the point farthest from the line of best fit is (2.0484, 2.8516). Dominican Republic 6.2 9 17.848 -8.848 As seen on the residual plot here, the point farthest from the line of best fit is (19.304, 27.196). Venezuela 7.1 1.1 1.9194 -0.8194 On the linear model this point is (5.6, 4.9) and the country is Peru, which means that this is the Venezuela 7.1 45.6 18.784 26.816 On the linear model this point is (7.6, 46.5) and the country is Jamaica, which means that this is the Haiti 8.5 2.5 1.799 0.701 most outlying point in this data set. Therefore, this point affects the model as it is far away from where Haiti 8.5 13 20.24 -7.24 most outlying point in this data set. Therefore, this point affects the model as it is far away from where Nicaragua 5.1 1.6 2.0914 -0.4914 it is expected to lie in according to the rest of the data in the sample space and will affect the outcome. Nicaragua 5.1 4.4 16.704 -12.304 it is expected to lie in according to the rest of the data in the sample space and will affect the outcome.

Mean Standard DeviationX - Value Z - Score Percentile Mean Standard DeviationX - Value Z - Score Percentile 1.99047619 1.162714394 5.6 3.104394191 33rd 17.96190476 1.41572798 7.6 -7.319135389 76th Linear Model Data Without The Outlier: Linear Model Data Without The Outlier: Country Mortality Rate (Per 1000) (%)Covid-19 Cases (Per 100,000) (%) Percentile rank of X = ( # of values below x/ n) x 100 Country Mortality Rate (Per 1000) (%)Murder Cases (Per 100,000) (%) Percentile rank of X = ( # of values below x/ n) x 100 Chile 6.3 1.2 Percentile rank of X = 7/21 x 100 Peru 5.6 8.3 Percentile rank of X = 16/21 x 100 Brazil 6.6 1.9 Percentile rank = 33rd Chile 6.3 3.7 Percentile rank = 76th Trinidad and Tobago 8.6 2.3 Brazil 6.6 19. Argentina 7.6 1.3 Linear Model With The Outlier: Trinidad and Tobago 8.6 28.2 Linear Model With The Outlier: Colombia 5.7 2.2 Argentina 7.6 4. Paraguay 5.6 2.5 Colombia 5.7 24. Mexico 6.1 4.5 Paraguay 5.6 6. Uruguay 9.5 0.7 Mexico 6.1 27 Ecuador 5.1 3.4 Uruguay 9.5 9. Panama 5.1 0.8 Ecuador 5.1 7. Costa Rica 5.2 0.8 Panama 5.1 11. Belize 4.8 1 Costa Rica 5.2 11. Jamaica 7.6 2.3 Belize 4.8 24. Guatemala 4.7 1.6 Guatemala 4.7 15. Honduras 4.5 2.4 Honduras 4.5 37. El Salvador 7.1 2.1 El Salvador 7.1 19. Dominican Republic 6.2 0.7 Dominican Republic 6.2 9 Venezuela 7.1 1.1 Venezuela 7.1 45. Haiti 8.5 2.5 Haiti 8.5 13 Nicaragua 5.1 1.6 Nicaragua 5.1 4. Venezuela 7.1 1. Haiti 8.5 2. Nicaragua 5.1 1. Linear Model Without The Outlier: Linear Model Without The Outlier:

The linear model for mortality rates and Covid - 19 cases in third world countries in 2020 has changed The linear model for mortality rates and murder cases in third world countries in 2020 has changed after the outlier was removed. The correlation has decreased and is now 0 instead of 0.011. This was after the outlier was removed. The correlation has decreased and is now 0 instead of 0.013. This was probably because the outlier was far from the data and closer to the line of best fit which is why it had probably because the outlier was far from the data and closer to the line of best fit which is why it had made it closer to the higher point and made that correlation stronger than the correlation when the outlier made it closer to the higher point and made that correlation stronger than the correlation when the outlier was removed. We can see that after the outlier was removed, most of the points are spread out around was removed. We can see that after the outlier was removed, most of the points are spread out around the line of best fit indicating a more weaker negative correlation. the line of best fit indicating a more weaker negative correlation.