Publications by Gina Occhipinti
Discussion 7c
rm(list = ls()) gc() ## used (Mb) gc trigger (Mb) limit (Mb) max used (Mb) ## Ncells 536646 28.7 1198565 64.1 NA 669417 35.8 ## Vcells 990944 7.6 8388608 64.0 16384 1851813 14.2 cat("\f") dev.off ## function (which = dev.cur()) ## { ## if (which == 1) ## stop("cannot shut down device 1 (the null device...
4037 sym R (7130 sym/28 pcs) 7 img
Discussion 7b
setwd("/Users/ginaocchipinti/Documents/ADEC 7310 Data Analytics/Week 6") # then pull up the data from our WD and assign it to variables resorts and snow for merging resorts <- read.csv("resorts.csv") snow <- read.csv("snow.csv") #apply the merge function resort_snowfall <- merge(resorts, snow, by = c("ID", "ID")) price <- resort_snowfall$Price...
1683 sym Python (1268 sym/7 pcs)
Discussion 7a
Correlation is about relationships between variables. Correlation tells us how the variables change together and whether change in one affects a change in other and how so. For example, if as one variable changes (increases or decreases), so does another variable in that same direction, that’s a positive correlation. If as one variable increases ...
2392 sym R (11235 sym/23 pcs) 1 img
Week5_HW5
Homework 5, Data Analysis Resources : Please have a look at re-emphasizing the core concepts of CLT, standard error, hypothesis testing, confidence interval and p-values. Please skim through them (~15 minutes) before attempting the assignment to refresh your memories. Please find the Open Intro Statistics textbook (OpenStat_textbook.pdf) in our Dr...
17792 sym Python (14410 sym/68 pcs) 7 img
Week 5 Discussion 5
0. Begin with setting seed in R. The recommended way to specify seeds is - set.seed(seed = 42) , where seed can take on any single value that is interpreted as an integer (42 here, but you can put your favorite number instead). set.seed(777) 1. Please Google and describe Law of Large Numbers in your own words. The Law of Large Numbers states that ...
4339 sym Python (5841 sym/23 pcs) 3 img
Week 4 Discussion 4
I. SAMPLING METHODOLOGIES In your own words, after your online readings, describe what is probability sampling and non-probability sampling (max 5 lines each). You can even talk about the differences between the sampling methodologies. Sampling methods allow us to select a subset of individuals, or a sample, from a larger population, so that we ma...
7964 sym R (3364 sym/7 pcs) 1 img
Week 3 HW 3
Q1. A researcher wishes to conduct a study of the color preferences of new car buyers. Suppose that 50% of this population prefers the color red. If 20 buyers are randomly selected, what is the probability that between 9 and 12 (both inclusive) buyers would prefer red? Only 2 possible outcomes? yes, buyer likes red or not Outcomes mutually exclus...
4349 sym 2 img
Week 3 Discussion 3
Discussion 3 A: Please explain each of the 3 distributions in less than 4 sentences. Normal Distribution: Normal distributions are always centered around the average value, and the average or mean, mode and median are all equal. The normal distrubtion helps us understand the majority of values a given data point takes on. For example, adult height...
5489 sym Python (2008 sym/13 pcs) 2 img
Week 2 HW2
Week 2, HW 2 Q1 What is the probability of rolling a sum of 12 on three rolls of six-sided dice? sides <- 6 sum1 <- 12 all_prob <- expand.grid(rep(list(1:sides), 3)) rowSums(all_prob) ## [1] 3 4 5 6 7 8 4 5 6 7 8 9 5 6 7 8 9 10 6 7 8 9 10 11 7 ## [26] 8 9 10 11 12 8 9 10 11 12 13 4 5 6 7 8 9 5 6 7 8 9 10 ...
3326 sym
Week 2 Discussion 2
Discussion 2 Q1 Please explain Bayes Theorem in your own words, and give an example. Less than 10 sentences. Also, write out the formula. Pick up on how to to type equations in R Markdown using Latex terminology here Bayes Theorem or Bayes Rule is a way for us to update probabilities given new evidence. It has real world applications such as in m...
2734 sym R (8270 sym/77 pcs) 1 img