Publications by Susanna Wong

DATA 622 Assignment 1

09.03.2025

Assignment 1 Prompt Introduction This assignment focuses on one of the most important aspects of data science, Exploratory Data Analysis (EDA). Many surveys show that data scientists spend 60-80% of their time on data preparation. EDA allows you to identify data gaps & data imbalances, improve data quality, create better features and gain a deep un...

7286 sym R (11349 sym/49 pcs) 29 img 1 tbl

DATA 608 Story 3

16.12.2024

Story - 3 : Do stricter gun laws reduce firearm gun deaths? This story explores whether stricter firearm control laws are linked to reduced firearm mortality rates in the United States. Using CDC data on firearm deaths per 100,000 people and categorizing states by the strictness of their gun laws, we will analyze the correlation between gun control...

654 sym R (26802 sym/10 pcs) 2 img

DATA 608 STORY 7

16.12.2024

Story 7: “Where Do Strategic Minerals Come From?” The United States’ economic and national security depend on the secure supply of critical minerals, essential for industries such as defense, energy, and technology. Many of these minerals are sourced from countries outside the US, raising concerns about supply chain vulnerabilities during geo...

1545 sym R (7874 sym/11 pcs) 3 img

DATA 608 Story 5

15.12.2024

Story 5: “What Is The Effect Of The Earth’s Temperature on Cyclonic Storms?” As global temperatures rise, we’re seeing more intense and frequent cyclonic storms, including hurricanes, typhoons, and tornadoes. But how are these storms connected to warming temperatures? In this presentation, we’ll explore data from the past to understand ho...

1566 sym R (3849 sym/5 pcs) 6 img

DATA 608 Story 4

13.12.2024

Story 4: “How much Do We Get Paid?” Understanding salary variations across job roles is essential for career planning. With overlapping responsibilities among roles like Data Scientist, Data Engineer, and Business Analyst, it’s important to explore how pay differs by occupation and location. Using data from the U.S. Bureau of Labor Statistics...

1086 sym R (3898 sym/6 pcs) 7 img

DATA 608 Story 1 (Revised)

25.09.2024

Story - 1 : Infrastructure Investment & Jobs Act Funding Allocation The allocation of funds under the Infrastructure Investment & Jobs Act represents a significant federal investment aimed at improving infrastructure across the United States. Understanding the distribution of these funds can shed light whether allocation is equitable and if it favo...

4433 sym R (3640 sym/12 pcs) 5 img

DATA 608 Story 1

14.09.2024

...

10 sym

DATA 605 Crab Age

26.04.2024

Problem 2 You are to register for Kaggle.com (free) and compete in the Regression with a Crab Age Dataset competition. https://www.kaggle.com/competitions/playground-series-s3e16 I want you to do the following. raw_train <- read.csv("https://raw.githubusercontent.com/suswong/DATA-605/main/crabagetrain.csv") test <- read.csv("https://raw.githubuserc...

7037 sym R (1179610 sym/77 pcs) 9 img 1 tbl

Document

18.12.2023

R Markdown This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the...

597 sym 1 img

DATA 605 Final Part 1

18.12.2023

Problem 1 Using R, set a random seed equal to 1234 (i.e., set.seed(1234)). Generate a random variable X that has 10,000 continuous random uniform values between 5 and 15. Then generate a random variable Y that has 10,000 random normal values with a mean of 10 and a standard deviation of 2.89. set.seed(1234) X <- runif(n=10000, min = 5, max = 15) Y...

2558 sym 2 img