Publications by Amber Ferger

DATA 607: Masculinity Tidyverse Assignment

01.12.2019

The Data: What Do Men Think It Means to be a Man? For this assignment, I decided to use the dataset that corresponds to the What Do Men Think It Means To Be A Man? article on FiveThirtyEight.com. The article can be found here: https://fivethirtyeight.com/features/what-do-men-think-it-means-to-be-a-man/ It contains the results of a survey of 1,615...

1441 sym R (2473 sym/8 pcs) 2 img

DATA 606: Homework 9

25.11.2019

Baby weights, Part I. (9.1, p. 350) The Child Health and Development Studies investigate a range of topics. One study considered all pregnancies between 1960 and 1967 among women in the Kaiser Foundation Health Plan in the San Francisco East Bay area. Here, we study the relationship between smoking and weight of the baby. The variable smoke is c...

8001 sym R (865 sym/13 pcs) 2 img

DATA 607: Assignment 12 - Graph Databases

23.11.2019

Assignment For this assignment, you should take information from a relational database and migrate it to a NoSQL database of your own choosing. For the relational database, you might use the flights database, the tb database, the “data skills” database your team created for Project 3, or another database of your own choosing or creation. For ...

2636 sym R (2295 sym/16 pcs)

DATA 605 - Week 2 Discussion

05.02.2020

Chapter D, Section DM, Question M16 (pg.271) Given the matrix \[B = \left[\begin{array}{cc} 4-x & -4 & -4\\ 2 & -2-x & -4\\ 3 & -3 & -4-x \end{array}\right]\] Find all values of x that are solutions of det(B) = 0. Solution \[(4-x)\begin{vmatrix}-2-x & -4\\ -3 & -4-x \end{vmatrix} - (-4)\begin{vmatrix}2 & -4\\ 3 & -4-x \end{vmatrix} + (...

675 sym R (631 sym/6 pcs)

DATA 605 - Assignment 2, Problem Set 1

09.02.2020

Problem Set 1 Problem 1 Show that \[A^{T}A \neq AA^{T}\] Proof: In general, AB != BA. Given a square matrix: \[ A = \left[\begin{array}{cc} a & b \\ c & d \end{array}\right] \, A^T = \left[\begin{array}{cc} a & c \\ b & d \end{array}\right]\] \[ A^{T}A = \left[\begin{array}{cc} a & c \\ b & d \end{array}\right] \,\left[\begin...

2157 sym R (131 sym/2 pcs)

DATA 605 - Week 3 Discussion

12.02.2020

Chapter E, Section EE, Question C19 (pg.298) Question C19: Find the eigenvalues, eigenspaces, algebraic multiplicities and geometric multiplicities for the matrix below. It is possible to do all these computations by hand, and it would be instructive to do so. \[C = \left[\begin{array}{cc} -1 & 2\\ -6 & 6 \end{array}\right]\] Solution Eigen...

2876 sym

DATA 605 - Week 12 Discussion

13.04.2020

Chapters 1 - 3: Linear Regression Using R Question Using R, build a regression model for data that interests you. Conduct residual analysis. Was the linear model appropriate? Why or why not? Solution I work in health insurance, so I thought it would be interesting to take a look at healthcare costs. Data Specs The data is publicly available o...

4766 sym R (1361 sym/9 pcs) 4 img

DATA 605 - Week 14 Discussion

28.04.2020

Exercise 4.3, Problem 17 (pg. 187) What are the dimensions of the rectangle with the largest area that can be drawn inside the unit circle? Solution We know that the unit circle has a radius of 1 and a diameter of 2. We will use this fact to draw out our problem. We are trying to maximize the dimensions of the rectangle, so we will take the der...

1376 sym 1 img

DATA 605 - Assignment 15

15.05.2020

Problem 1 Find the equation of the regression line for the given points. Round any final values to the nearest hundredth, if necessary. \[(5.6, 8.8), (6.3, 12.4), (7, 14.8), (7.7, 18.2), (8.4, 20.8)\] Solution The regression line can be modeled as \(y=mx+b\). Given the points above, we can solve for \(m\) and \(b\) by using the following equatio...

4865 sym R (869 sym/16 pcs)

DATA 612: Research Discussion 2

22.06.2020

Music Recommendations at Scale with Spark For this discussion item, please watch the following talk and summarize what you found to be the most important or interesting points. The first half will cover some of the mathematical techniques covered in this unit’s reading and the second half some of the data management challenges in an industrial-...

3076 sym