Publications by Arvind Sharma
random_project_pipeline_simple_linear_regression
Table of contents 1 Setting Up Working directory, clearing all data and memory 1.1 Installing Package (required once only), then loading package 2 Plotting Data 3 Fitting the linear regression model 4 A more interpretable intercept 5 Changing scale 6 Predicting the price of a diamond 7 Properties of the residuals 8 Non-linear data 9 Heteroskeda...
1316 sym Python (5510 sym/33 pcs) 6 img
import_dataset
1 Data Lets try to import data into R using the base command using data() command that loads specified data sets, or list the available data sets. 1.1 Time Series Data Longley’s Economic Regression Data - a macroeconomic data set which provides a well-known example for a highly collinear regression. A data frame with 7 economical variables, obse...
921 sym R (6572 sym/30 pcs) 6 img
datasets_package
Table of contents 1 Datasets Inbuilt Package 2 Quakes Description 3 Carbon Dioxide Uptake in Grass Plants Description 4 Biochemical Oxygen Demand Description 5 Edgar Anderson’s Iris Data Description 6 Intercountry Life-Cycle Savings Data Description 7 Level of Lake Huron 1875–1972 Description 8 Diameter, Height and Volume for Black Cherry Tree...
2486 sym
quarto_setup
Table of contents 1 Introduction 1.1 Quarto 1.2 Running Code 1.3 Reproducible Reports with Quarto and R Markdown Setup 1 Introduction 1.1 Quarto Quarto enables you to weave together content and executable code into a finished document. To learn more about Quarto see https://quarto.org. 1.2 Running Code When you click the Render button a docum...
2289 sym 1 img
computations
Quarto Computations This dataset contains a subset of the fuel economy data from the EPA. Specifically, we use the mpg dataset from the ggplot2 package. library(ggplot2) The visualization below shows a positive, strong, and linear relationship between the city and highway mileage of these cars. Additionally, mileage is higher for cars with fewer...
468 sym 1 img
Math_Review
Table of contents 1 Theory 2 Implementation in R 2.1 Matrix A 2.1.1 Alternative ways to create a matrix 2.2 RConics for adjugate of a matrix 2.3 Matrix B 2.4 Matrix C 2.5 Matrix D 2.6 Matrix E 2.6.1 NA values 3 APPENDIX Math Review Author AS # Clear the work space rm(list = ls()) # Clear environment gc() # Clear unused...
3864 sym
Basic Demonstration
Table of contents Quarto Running Code EXTRA COMMAND testing Quarto Lets install psych package and use the describe function. # install.packages("psych") library(psych) ?psych remove(list=ls()) Installed and loaded - so ready for use. df <- mtcars describe(mtcars) vars n mean sd median trimmed mad min max range skew mp...
678 sym 1 img 1 tbl
Document
Table of contents Setup Clean the environment Load the packages Load the data EDA Summary Statistics Simple Linear Regression Estimating Equation Plot the regression line Explore the model output Multivariate Regression Quick way to create higher order terms Quick way to create interaction terms Declaring factor variables Residual Analysis...
6021 sym R (25900 sym/87 pcs) 7 img 2 tbl
Document
V. Statistical Analysis Author Arvind Sharma packages <- c("AER", "stargazer") ?library for (pkg in packages) { library(package = pkg, character.only = TRUE # whether package can be assumed to be character strings. ) } Loading required package: car Loading required package: carData Loading required package: lmtest Loa...
371 sym Python (7483 sym/22 pcs)
heteroskedasticty_white
1 Estimate a Multivariate Regression Should have atleast 3 independent variables. 1.1 Data Choose your data. I will continue with the data for my OLS point estimates in OLS_matrixVSlm in W1. remove(list=ls()) # install.packages("MASS") library(MASS) help(Boston) str(Boston) # 506 rows and 14 columns. ## 'data.frame': 506 obs. of 14 variab...
7704 sym R (13063 sym/57 pcs) 2 img