Publications by Melissa Bowman

Data 605 Discussion Week 11

12.04.2023

Discussion Objective: Using R, build a regression model for data that interests you. Conduct residual analysis. Was the linear model appropriate? Why or why not? Dataset The data set used here was “babies” data from the OpenIntro package. The data consist of recorded pregnancies between 1960 and 1967 among women in the Kaiser Foundation He...

1189 sym 4 img

Data 605 HW8

19.03.2023

11) A company buys 100 lightbulbs, each of which has an exponential lifetime of 1000 hours. What is the expected time for the first of these bulbs to burn out? (See Exercise 10.) From exercise 10, the minimum value of an exponential density of \(n\) independent random variables is \(\mu\)/n. Here \(\mu\) = 1000 hours. n=100 mu = 1000 n = 100 ...

2242 sym

DATA 605 ASSIGNMENT 4

19.02.2023

Introduction In many fields such as computer vision, face recognition, and data compression, images can be represented as a matrix of pixel values. These matrices can be used to extract features and perform dimensionality reduction techniques such as Principal Component Analysis (PCA). In this context, Eigen Shoes refers to a set of orthogonal ...

3053 sym 3 img

DATA 605 ASSIGNMENT 3

12.02.2023

Problem set 1 (1) What is the rank of the matrix A? \[ \left[ \begin{matrix} 1 & 2 & 3 & 4 \\ -1 & 0 & 1 & 3\\ 0 & 1 & -2 & 1\\ 5 & 4 & -2 &-3 \end{matrix} \right] \] x <- matrix(c(1, -1, 0, 5, 2, 0, 1, 4, 3, 1, -2, -2, 4, 3, 1, -3), 4, 4) det(x) ## [1] -9 The determinate of this matrix is equal to -9. Since the determinant is a non-z...

5799 sym

Data 608 HW1

10.02.2023

Principles of Data Visualization and Introduction to ggplot2 I have provided you with data about the 5,000 fastest growing companies in the US, as compiled by Inc. magazine. lets read this in: inc <- read.csv("https://raw.githubusercontent.com/charleyferrari/CUNY_DATA_608/master/module1/Data/inc5000_data.csv", header= TRUE) And lets preview thi...

2061 sym R (4077 sym/11 pcs) 3 img

Data 6060 Lab 1: Intro to R

04.09.2022

library(tidyverse) library(openintro) Exercise 1 arbuthnot$girls ## [1] 4683 4457 4102 4590 4839 4820 4928 4605 4457 4952 4784 5332 5200 4910 4617 ## [16] 3997 3919 3395 3536 3181 2746 2722 2840 2908 2959 3179 3349 3382 3289 3013 ## [31] 2781 3247 4107 4803 4881 5681 4858 4319 5322 5560 5829 5719 6061 6120 5822 ## [46] 5738 5717 5847 6203 6...

6982 sym 3 img

Data 607 Assignment 1

04.09.2022

In this assignment, dataset and subset are explored. The data chosen was the 2022 FiveThirtyEight Election Forecastdatasets (senate only) on fivethirtyeight.com (link: https://github.com/fivethirtyeight/data/tree/master/election-forecasts-2022) The first part of the assignment is to read the data through its raw code for reproducibility. Here ...

954 sym R (6592 sym/11 pcs)

Assignment 3 in Data 607

18.09.2022

In Assignment 3 for Data 607, 4 problems were given to solve with a focus on regular expressions. The following are the libraries needed to complete the task and the answers to the 4 problems. Load libraries. library(RCurl) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## ...

3250 sym R (3885 sym/44 pcs)

Data 607 Project 1

24.09.2022

In Project 1 for Data 607, a text file is given with chess tournament results. In the assignment, the text file had to first be transformed organized data that can then be made into a .csv file. Load libraries. library(RCurl) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## ...

1200 sym R (6240 sym/27 pcs)

Data 607 Assignment 4

30.09.2022

For assignment 4, a .csv file was transformed from a wide structure data frame to a long structure data frame. The data was then analysis to compare the arrival delays for the two airlines listed. Load libraries. library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ...

1388 sym R (2755 sym/15 pcs)