Publications by Warner Alexis
DATA 607 - Week Assignment 7
creating XML, HTML and JSON Files I am going to create a book data set that contains 3 books where 2 of the books have 2 authors and one book has only one. We use line to identify the multiple records of the book. For example Line 1 indicates the main author and line 2 the second author. XLM library(XML) library(tidyverse) ## ── Attaching ...
1209 sym R (14302 sym/36 pcs) 1 tbl
DATA 607 - Project 2
National Science Foundation This data set contain research information census from Doctorate recipients by historical major fields. This data requires some data manipulation to transform it from wide to long. What are the fields that have the highest numbers of doctorate recipients in 2022. We only selected all the major fields and excluded any...
5874 sym R (32059 sym/36 pcs) 8 img 1 tbl
DATA 606 - Confidence Levels
If you have access to data on an entire population, say the opinion of every adult in the United States on whether or not they think climate change is affecting their local community, it’s straightforward to answer questions like, “What percent of US adults think climate change is affecting their local community?”. Similarly, if you had d...
11578 sym R (2312 sym/18 pcs) 4 img 1 tbl
DATA 606 - Sampling Distributions
In this lab, you will investigate the ways in which the statistics from a random sample of data can serve as point estimates for population parameters. We’re interested in formulating a sampling distribution of our estimate in order to learn about the properties of the estimate, such as its distribution. Setting a seed: We will take some ran...
12068 sym R (5413 sym/30 pcs) 5 img
DATA 607 - Week 5 assignment
Introduction We were given an data set of Arrival Airport delays between some of the big cities. The data set has a wide structure that needs some manipulation to transform it into long form. We load the csv file then rename the empty column name. We deleted all the empty row and reprocess the data inton a long structure. # read data csv file ...
625 sym 1 img
Data 607 - Data Analysis
Introduction In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database) with the following information for all of the players: Player’s Name, Player’s State, To...
1912 sym R (28888 sym/75 pcs)
DATA 606 - Normal Distribution
In this lab, you’ll investigate the probability distribution that is most central to statistics: the normal distribution. If you are confident that your data are nearly normal, that opens the door to many powerful statistical methods. Here we’ll use the graphical tools of R to assess the normality of our data and also learn how to generate ...
10175 sym R (3373 sym/27 pcs) 13 img
DATA 606 - Probability
The Hot Hand Basketball players who make several baskets in succession are described as having a hot hand. Fans and players have long believed in the hot hand phenomenon, which refutes the assumption that each shot is independent of the next. However, a 1985 paper by Gilovich, Vallone, and Tversky collected evidence that contradicted this beli...
11571 sym R (5341 sym/35 pcs) 6 img
DATA 607 - Week 3 assignment
Please deliver links to an R Markdown file (in GitHub and rpubs.com) with solutions to the problems below. You may work in a small group, but please submit separately with names of all group participants in your submission. #1. Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/features/th...
2859 sym 2 tbl
Data 607 Assignment – SQL and R
Data Set I select a set of data sets from Kaggle that cointains 4 tables necessary to do this project. The data sets are from both The Movie Database and IMDB. these files are located in this links https://www.kaggle.com/datasets/rounakbanik/the-movies-dataset Content This dataset consists of the following files: movies_metadata.csv: The main Mo...
5274 sym R (17411 sym/41 pcs) 2 img 1 tbl