Publications by Gina Occhipinti

Discussion 7c

01.05.2024

rm(list = ls()) gc() ## used (Mb) gc trigger (Mb) limit (Mb) max used (Mb) ## Ncells 536646 28.7 1198565 64.1 NA 669417 35.8 ## Vcells 990944 7.6 8388608 64.0 16384 1851813 14.2 cat("\f") dev.off ## function (which = dev.cur()) ## { ## if (which == 1) ## stop("cannot shut down device 1 (the null device...

4037 sym R (7130 sym/28 pcs) 7 img

Discussion 7b

28.04.2024

setwd("/Users/ginaocchipinti/Documents/ADEC 7310 Data Analytics/Week 6") # then pull up the data from our WD and assign it to variables resorts and snow for merging resorts <- read.csv("resorts.csv") snow <- read.csv("snow.csv") #apply the merge function resort_snowfall <- merge(resorts, snow, by = c("ID", "ID")) price <- resort_snowfall$Price...

1683 sym Python (1268 sym/7 pcs)

Discussion 7a

28.04.2024

Correlation is about relationships between variables. Correlation tells us how the variables change together and whether change in one affects a change in other and how so. For example, if as one variable changes (increases or decreases), so does another variable in that same direction, that’s a positive correlation. If as one variable increases ...

2392 sym R (11235 sym/23 pcs) 1 img

Week5_HW5

23.04.2024

Homework 5, Data Analysis Resources : Please have a look at re-emphasizing the core concepts of CLT, standard error, hypothesis testing, confidence interval and p-values. Please skim through them (~15 minutes) before attempting the assignment to refresh your memories. Please find the Open Intro Statistics textbook (OpenStat_textbook.pdf) in our Dr...

17792 sym Python (14410 sym/68 pcs) 7 img

Week 5 Discussion 5

21.04.2024

0. Begin with setting seed in R. The recommended way to specify seeds is - set.seed(seed = 42) , where seed can take on any single value that is interpreted as an integer (42 here, but you can put your favorite number instead). set.seed(777) 1. Please Google and describe Law of Large Numbers in your own words. The Law of Large Numbers states that ...

4339 sym Python (5841 sym/23 pcs) 3 img

Week 4 Discussion 4

13.04.2024

I. SAMPLING METHODOLOGIES In your own words, after your online readings, describe what is probability sampling and non-probability sampling (max 5 lines each). You can even talk about the differences between the sampling methodologies. Sampling methods allow us to select a subset of individuals, or a sample, from a larger population, so that we ma...

7964 sym R (3364 sym/7 pcs) 1 img

Week 3 HW 3

07.04.2024

Q1. A researcher wishes to conduct a study of the color preferences of new car buyers. Suppose that 50% of this population prefers the color red. If 20 buyers are randomly selected, what is the probability that between 9 and 12 (both inclusive) buyers would prefer red? Only 2 possible outcomes? yes, buyer likes red or not Outcomes mutually exclus...

4349 sym 2 img

Week 3 Discussion 3

06.04.2024

Discussion 3 A: Please explain each of the 3 distributions in less than 4 sentences. Normal Distribution: Normal distributions are always centered around the average value, and the average or mean, mode and median are all equal. The normal distrubtion helps us understand the majority of values a given data point takes on. For example, adult height...

5489 sym Python (2008 sym/13 pcs) 2 img

Week 2 HW2

24.03.2024

Week 2, HW 2 Q1 What is the probability of rolling a sum of 12 on three rolls of six-sided dice? sides <- 6 sum1 <- 12 all_prob <- expand.grid(rep(list(1:sides), 3)) rowSums(all_prob) ## [1] 3 4 5 6 7 8 4 5 6 7 8 9 5 6 7 8 9 10 6 7 8 9 10 11 7 ## [26] 8 9 10 11 12 8 9 10 11 12 13 4 5 6 7 8 9 5 6 7 8 9 10 ...

3326 sym

Week 2 Discussion 2

23.03.2024

Discussion 2 Q1 Please explain Bayes Theorem in your own words, and give an example. Less than 10 sentences. Also, write out the formula. Pick up on how to to type equations in R Markdown using Latex terminology here Bayes Theorem or Bayes Rule is a way for us to update probabilities given new evidence. It has real world applications such as in m...

2734 sym R (8270 sym/77 pcs) 1 img