Publications by Arvind Sharma
week2_meetup
1 Linear Regression Linear regression quantifies the relationship between two (or more variables). We will cover it more thoroughly in the last week of the course. We can choose an outcome variable, and write it as a function of other variables called “covariates” and estimate the “best fit” line which tells us whether the two variables are...
2132 sym R (4475 sym/36 pcs) 4 img
Updated 03192024
R Markdown This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the...
664 sym R (54980 sym/62 pcs) 9 img
Draft 03192024
R Markdown This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within the...
1358 sym R (50177 sym/52 pcs) 7 img
Discussion1
1 Discussion on Iris Dataset In OpenStats Chapter 1, Exercises, Problem 9, there is a reference to Fisher’s iris data. Discuss the solutions to this problem, and then conduct a descriptive analysis of the data which are conveniently available in R. To access the data in R, simply type “iris.” Investigate any additional R libraries that might ...
3388 sym R (11761 sym/68 pcs) 7 img
HW1_solution
Instructions Go to Kaggle.com (owned by Google). Create a free account. Sign up for the “Titanic: Machine Learning through Disaster” competition located here: https://www.kaggle.com/c/titanic/data?select=train.csv Download the train.csv data. Open the train.csv file in R. To do so, use something like mydata <- read.csv('D:/train.csv') but rep...
6364 sym R (23874 sym/86 pcs) 3 img
in_class_MarkdownScript-saving
1 Section 1 This is my code. 2+2 ## [1] 4 1.1 Import Data remove(list = ls()) setwd("/Users/arvindsharma/Dropbox/WCAS/Data Analysis/Data Analysis - Spring II 2024/Data Analysis - Spring II 2024 (shared files)/W1/Week_1-2/titanic") ?read.csv test <- read.csv(file = "test.csv") # explicitly specifying out the default values - gives same output ...
339 sym R (2392 sym/12 pcs)
code_variance_in_R
0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.0.1 1 Prompt Calculate variance and standard deviation directly in R. Calculate variance and standard deviation by hand in R. Compare the numbers and ascertain they are the same. 2 Clear the work space rm(list = ls()) # Clear environment...
2474 sym Python (3461 sym/53 pcs)
Types_of_Data
1 R Inbuilt Data No need to install/load any package. datasets are already pre-installed. ?datasets() library(help = "datasets") ?swiss # give information on data set 'swiss' help(USArrests) # give information on data set 'USArrests' You will find some code that can be used to create graphs in the help files....
280 sym R (904 sym/14 pcs) 2 img
intro_D1_short
1 Install R, then RStudio IDE ( links for installation ) print("hello world !") ## [1] "hello world !" sessionInfo() ## R version 4.2.1 (2022-06-23) ## Platform: x86_64-apple-darwin17.0 (64-bit) ## Running under: macOS Big Sur ... 10.16 ## ## Matrix products: default ## BLAS: /Library/Frameworks/R.framework/Versions/4.2/Resources/lib/libRblas.0....
1484 sym R (6413 sym/60 pcs) 1 img
Higher_Order_Moments
1 Set Up # clear environment rm(list = ls()) # Clear environment gc() # Clear unused memory ## used (Mb) gc trigger (Mb) limit (Mb) max used (Mb) ## Ncells 525594 28.1 1167051 62.4 NA 669291 35.8 ## Vcells 967507 7.4 8388608 64.0 32768 1840395 14.1 cat("\f") # Clear the console graphics.off()...
5277 sym R (5687 sym/47 pcs) 13 img