I've been learning a lot about implementing statistics in real-life situations. In this project, we used Rstudio, a powerful data analysis tool, to explore the average age of death. We found that the global average life expectancy is 72.74 years. Using a 95% confidence level (α = 0.05), we performed a one-sample hypothesis test on a sample of 200 individuals from a dataset of 500.
Our data came from Kaggle, containing records of individuals from various times and places, including their birth and death years and ages. To enhance visualization, we reduced the sample size to 200 individuals. We used tools like Rstudio and Excel for analysis. The project includes the raw and modified datasets, as well as the Rscript file with all calculations.
In summary, we chose to study factors influencing death and their relationships. We used hypothesis testing to compare our dataset's mean age to the global average. We also used correlation to examine age vs. year of death and the impact of quality of life on age at death. Finally, the Chi-Square test helped us assess the significance and independence of variables. This project allowed us to dive into data analysis and gain a deeper understanding of our findings through statistical methods.