Publications by Tony Mei

Data 607 Final Project

11.12.2019

Introduction There are 6454 observations in the given data set. Each case represents a forest fire reported. This dataset report of the number of forest fires in Brazil divided by states. The series comprises the period of approximately 10 years (1998 to 2017). The data were obtained from the official website of the Brazilian government. My goal ...

2137 sym R (4091 sym/23 pcs) 2 img 1 tbl

Data 606 Final Project

11.12.2019

Introduction My project is see if forests fire are the same occurence in all the states of Amazon or not. There are 6454 observations in the given data set. Each case represents a forest fire reported. This dataset report of the number of forest fires in Brazil divided by states. The series comprises the period of approximately 10 years (1998 to ...

1934 sym R (3524 sym/23 pcs) 4 img 1 tbl

Data 607 Tony Mei Tidyverse Part1

04.12.2019

Introduction I am observing the annual returns of the FANG stocks from 2013 to 2016. FANG stocks are the technology companies, Facebook, Amazon, Netflix and Google. I will like to study the perfomance of each stock in each year by only looking at the return rates. The data will be acquired from Yahoo Finance. Loading MyPackage library(tidyverse)...

1306 sym R (2794 sym/28 pcs) 3 img

Data 607 Tony Mei Tidyverse Part2

04.12.2019

mkolp0-" date: “11/30/2019” output: html_document — library(knitr) library(tidyverse) ## -- Attaching packages ----------------------------------------------- tidyverse 1.2.1 -- ## v ggplot2 3.2.1 v purrr 0.3.3 ## v tibble 2.1.3 v dplyr 0.8.3 ## v tidyr 1.0.0 v stringr 1.4.0 ## v readr 1.3.1 v forcats 0.4.0 ## -...

1271 sym R (3821 sym/20 pcs) 1 img 1 tbl

Data 606 Homework 9

24.11.2019

library(ggplot2) Baby weights, Part I. (9.1, p. 350) The Child Health and Development Studies investigate a range of topics. One study considered all pregnancies between 1960 and 1967 among women in the Kaiser Foundation Health Plan in the San Francisco East Bay area. Here, we study the relationship between smoking and weight of the baby. The v...

7688 sym R (1846 sym/21 pcs) 2 img

Data 606 Lab9

24.11.2019

download.file("http://www.openintro.org/stat/data/evals.RData", destfile = "evals.RData") load("evals.RData") Exploring the data Is this an observational study or an experiment? The original research question posed in the paper is whether beauty leads directly to the differences in course evaluations. Given the study design, is it possible to ...

10280 sym R (7644 sym/27 pcs) 13 img

Data 607 Homework 12

27.11.2019

Load Packages library(RMySQL) ## Loading required package: DBI library(mongolite) MySQL Here we pull the data from mySQL. con<-(dbConnect(MySQL(), user="root", password="1234", dbname="flights", host="35.225.71.0")) dbListFields(con, 'Ratings') ## [1] "Person" "TheLionKing" ## [3] "Avengers_Endgame" ...

707 sym R (3283 sym/15 pcs)

Data 605 Discussion 5

26.02.2020

Exercise 1 Modify the program CoinTosses to toss a coin n times and print out after every 100 tosses the proportion of heads minus 1/2. Do these numbers appear to approach 0 as n increases? Modify the program again to print out, every 100 times, both of the following quantities: the proportion of heads minus 1/2, and the number of heads minus hal...

671 sym R (2359 sym/28 pcs) 2 img

Data_605_Assign5

28.02.2020

Choose independently two numbers B and C at random from the interval [0, 1] with uniform density. Prove that B and C are proper probability distributions. Note that the point (B,C) is then chosen at random in the unit square. Find the probability that # Function to pick a random number from the interval [0,1] 100 times. # runif(n) generates n un...

353 sym R (412 sym/11 pcs)

Data 605 Discussion 8

19.03.2020

1 A fair coin is tossed 100 times. The expected number of heads is 50, and the standard deviation for the number of heads is (100·1/2·1/2)1/2 = 5. What does Chebyshev’s Inequality tell you about the probability that the number of heads that turn up deviates from the expected number 50 by three or more standard deviations (i.e., by at least 15...

626 sym R (36 sym/2 pcs)