Publications by Kory Martin

DATA 624 - Homework #4

01.10.2023

Problem 3.1 Using visualizations, explore the predictor variables to understand their distributions as well as the relationships between predictors data(Glass) str(Glass) ## 'data.frame': 214 obs. of 10 variables: ## $ RI : num 1.52 1.52 1.52 1.52 1.52 ... ## $ Na : num 13.6 13.9 13.5 13.2 13.3 ... ## $ Mg : num 4.49 3.6 3.55 3.69 3...

1920 sym 62 img

DATA 624 - Assignment #3

26.09.2023

Problem 5.1 Produce forecasts for the following series using whichever of NAIVE(y), SNAIVE(y) or RW(y ~ drift()) is more appropriate in each case: Australian Population (global_economy) aus_economy_pop <- global_economy %>% filter(Country == 'Australia') aus_fit_model <- aus_economy_pop %>% model(RW(Population ~ drift())) aus_forecast <- a...

1911 sym 22 img

DATA 624 - Homework #2

17.09.2023

Problem 3.1 Consider the GDP information in global_economy. Plot the GDP per capita for each country over time. # Identify the top 10 countries based on mean per cap GDP over time top_10_countries <- as_tibble(global_economy) %>% mutate(gdp_per_cap = GDP/Population) %>% group_by(Country) %>% summarize(mean_per_cap_gdp = mean(gdp_per_cap)) %>%...

5588 sym Python (11526 sym/62 pcs) 31 img

DATA 624 - Homework #1

11.09.2023

Overview This homework assignment is focused on Time Series and includes Exercises 2.1, 2.2, 2.3, 2.4, 2.5 and 2.8 from the Hyndman text. Setup options(repos = list(CRAN="http://cran.rstudio.com/")) knitr::opts_chunk$set(echo = TRUE) install.packages("USgas") ## ## The downloaded binary packages are in ## /var/folders/4l/182nghl547v3mxtj6p_9dg...

4548 sym R (8349 sym/67 pcs) 32 img

DATA 605 - Final Exam (Part I)

17.05.2023

Setup Probability Density 1: X~Gamma Using R, generate a random variable X that has 10,000 random Gamma pdf values. A Gamma pdf is completely describe by n (a size parameter) and lambda (\(\lambda\) , a shape parameter). Choose any n greater 3 and an expected value \(\lambda\) between 2 and 10 (you choose). set.seed(51023) n = 10000 shape = 5 lam...

3180 sym

DATA 604 - Final Exam (Part II)

17.05.2023

Setup You are to register for Kaggle.com (free) and compete in the House Prices: Advanced Regression Techniques competition. https://www.kaggle.com/c/house-prices-advanced-regression-techniques . I want you to do the following. Problem 1 - Descriptive and Inferential Statistics. Provide univariate descriptive statistics and appropriate plots for ...

5337 sym Python (31107 sym/104 pcs) 66 img

DATA 606 - Final Project

16.05.2023

1. Introduction For this project, we will explore data from the California Department of Education in an effort to determine if California charter schools are better than California public schools. We will attempt to evaluate this question by comparing the mean performance across charter schools and public schools across the following areas: Suspe...

12399 sym Python (20292 sym/67 pcs) 13 img 13 tbl

DATA 607 - Final Project

14.05.2023

1. Introduction a. Overview This project is focused on extending the work completed in Project 3, where we set out to identify the top skills required for a Data Scientist. In that project, we collected over 100+ postings collected across several job boards, and extracted job requirements information from the various postings to generate a corpus ...

10508 sym R (13820 sym/44 pcs) 4 img 2 tbl

DATA 607 - Tidyverse Extended Assignment

29.04.2023

Introduction: For my tidyverse create assignment, I chose a data set containing roster information for all NCAA Women’s Basketball teams. I intend to use readr to read in my data, dplyr to manipulate my data and ggplot2 to display my analysis. Loading/Installing Packages library(ggplot2) library(dplyr) ## ## Attaching package: 'dplyr' ## The fo...

4710 sym R (6793 sym/19 pcs) 1 img 2 tbl

DATA 605 - Week 12 Discussion

21.04.2023

Introduction For this exercise, I used a real estate price dataset obtained from Kaggle. This dataset includes 6 predictor variables, and is focused on trying to predict the house price of a unit area. Assignment: - Using R, build a multiple regression model for data that interests you. - Include in this model at least one quadratic term, one dicho...

7585 sym 8 img