Publications by Arvind Sharma

Midterm_Solution

16.04.2024

Question 1: Basic Data Analysis in R (Assignment+Discussion 1) In 1986, the Challenger space shuttle exploded during “throttle up” due to catastrophic failure of o-rings (seals) around the rocket booster. The data (real) on all space shuttle launches prior to the Challenger disaster are in the file challenger.csv. Load the data into R or Python...

9773 sym R (13614 sym/100 pcs) 10 img

CPS_discussion

16.04.2024

PROMPT The Current Population Survey (CPS) is a monthly survey of households conducted by the Bureau of Census for the Bureau of Labor Statistics. It provides a comprehensive body of data on the labor force, employment, unemployment, persons not in the labor force, hours of work, earnings, and other demographic and labor force characteristics. Expl...

5254 sym R (20305 sym/36 pcs) 1 img

CLT

16.04.2024

Central Limit Theorem Definition and Proof The Central Limit Theorem (CLT) is one of the most important theorems in statistics and data science. The CLT states that the sample mean (\(\bar{x}\)) of a probability distribution sample is a random variable with a mean value given by population mean \(\mu\) and standard deviation \(\sigma_{\bar{x}}\) (...

4545 sym R (11847 sym/67 pcs) 11 img

CLT_LLN_relationship

16.04.2024

The Law of Large Numbers (LLN) and the Central Limit Theorem (CLT) are fundamental concepts in probability and statistics. 1 Law of Large Numbers (LLN): Definition: The Law of Large Numbers states that as the sample size increases (or as the number of trials in an experiment increases), the sample mean (or what) converges in probability to the pop...

5607 sym

LLN

16.04.2024

1 Introduction In probability theory, the law of large numbers (LLN) is a theorem that describes the result of performing the same experiment a large number of times. According to the law, the average of the results obtained from a large number of trials should be close to the expected value, and will tend to become closer as more trials are perfor...

3661 sym Python (3271 sym/15 pcs) 2 img

HW5_template

16.04.2024

Homework 5, Data Analysis Resources : Please have a look at re-emphasizing the core concepts of CLT, standard error, hypothesis testing, confidence interval and p-values. Please skim through them (~15 minutes) before attempting the assignment to refresh your memories. Please find the Open Intro Statistics textbook (OpenStat_textbook.pdf) in our Dr...

12575 sym Python (10288 sym/32 pcs) 5 img

Distributions_hypothesis_testing

16.04.2024

0.1 Normal Distribution: 0.1.1 Definition: A normal distribution is a symmetric, bell-shaped probability distribution that is fully characterized by its mean (μ) and standard deviation (σ). In the context of student grades, a normal distribution suggests that a significant number of students perform close to the average grade, with fewer student...

3882 sym R (3364 sym/12 pcs) 4 img

student_t_class_examples

16.04.2024

Standard Error: In statistics, the standard error (SE) is a measure of how much the sample mean is expected to vary from the true population mean. It is a standard deviation of the sampling distribution of the sample mean. The standard error quantifies the precision of your sample mean estimate. The formula for standard error when estimating the po...

6282 sym R (4647 sym/51 pcs) 8 img

ggplot2

16.04.2024

Read this help file for R Markdown. 1 Set Up Clear all environments. Installing and loading all the libraries. Make sure you have the libraries installed. 2 Import Data Now, I will import my data. Make sure you comment out or exclude or do not use View(train) command. df <- read.csv("~/Library/CloudStorage/Dropbox/WCAS/Summer/Data Analysis/share/...

3030 sym Python (15763 sym/65 pcs) 12 img

Z_distribution

16.04.2024

1 Z Scaling Z-scaling, also known as standardization or z-score normalization, is a common preprocessing step in statistics and machine learning. It involves transforming the features of a dataset so that they have a mean of 0 and a standard deviation of 1. The formula for z-scaling is: \[z=\dfrac{x-\mu}{\sigma}\]​ where: z is the standardized v...

4480 sym R (1815 sym/12 pcs) 3 img