Database and R script used for Project 2
-
Download heart.csv
heart.csv Details
- Sunday, 03 July 2022 [11.1KB] -
Download Group3_Project2.R
Group3_Project2.R Details
- Sunday, 03 July 2022 [3.1KB]
Project 2 Presentation
Journey throughout the course
Probability and Statistical Data Analysis is my favorite course this semester alongside Programming Technique 2. A lot of students may wonder why they are even learning this in the first place in computer science. I believe there is a bold reason. Speaking from my knowledge gained throughout this entire course, off the top of my head I can figure out some reasons why this course is so important:
- Computer hardware: how will predict expected lifetime of a computer that is composed of so many components, where each component has their own probabilities of failure?
- Data structures: How do we implement hash tables with the fastest lookup times if we do not know probability?
- Databases: How are people supposed to distribute data between fast memory and cheaper slower memory, so that the best access times are maintained? This course was one of the few building blocks for database new semester.
- Computer algorithms: How must one efficiently test a very large number to verify that it is a prime number without statistics?
These are some of the few reasons why this course was so important. I was in SECI2143-05 under Dr Sharin Hazlin Binti Huspi. I have received some of the best guidance from her in learning this course. I will acknowledge my teacher in the next section. However, she is one of the most friendly, engaging and resourceful teachers I have met so far. She has answered all of my inquiries in the most detailed way possible. Her classes were based on full student to student, teacher to student interaction and I have loved it.
Throughout this course we have done 4 assignments and 2 projects. Project 1's objective were to obtain data from a survey and perform statistical analysis on it. Project 2, the main focus of this reflection, was a project about acquiring a very interesting dataset and performing hypothesis testing, correlation testing and other statistical analysis to obtain meaningful findings. I had some of the best time this semester with my group-mates. The teamwork, communication and cooperation between us was perfect.
Reflection on SECI2143-05 (Probability and Statistical Data Analysis)
RSS
I have learned basic and somewhat intermediate R. I have used the resources my Dr gave us. But I have also read the book "YaRrr! Pirate's Guide to R" until chapter 15. I have learned how to import and tidy datasets, do plotting, t-tests (hypothesis testing), correlation test, regression analysis, goodness of fit test and chi-square test of independence in R. R also serves as a programming language, and I was messing around with it for a while before all the tests.
Dr Sharin is a really fun, friendly and helpful lecturer. She has helped me out with all sorts of confusions I have had. She has helped me by answering all of my questions outside of the class, whenever she was free. She is extremely resourceful and she gave us resources to better understand concepts outside of class. She gave us resources for the projects such as R Cookbook and websites to find databases. In class, she tried to imitate a face to face classroom environment. She encourages a lot of questions and always makes sure that everyone is focused in class.