Publications by Dominika Markowska-Desvallons
606 Lab1
library(tidyverse) library(openintro) Exercise 1 arbuthnot$girls ## [1] 4683 4457 4102 4590 4839 4820 4928 4605 4457 4952 4784 5332 5200 4910 4617 ## [16] 3997 3919 3395 3536 3181 2746 2722 2840 2908 2959 3179 3349 3382 3289 3013 ## [31] 2781 3247 4107 4803 4881 5681 4858 4319 5322 5560 5829 5719 6061 6120 5822 ## [46] 5738 5717 5847 6203 6...
4126 sym R (2035 sym/16 pcs) 3 img
HW_Final R
1.Data Exploration: This should include summary statistics, means, medians, quartiles, or any other relevant information about the data set. Please include some conclusions in the R Markdown text. Question to ask: 1. what is the median income in Chile? 2. Is there a diffrence in earnings in the age - older -> higher income? 3. Is there a gap bewt...
2199 sym R (5106 sym/28 pcs) 6 img
Homework Two/ MSDS
beaver <- read.csv(file=“/Users/domin/OneDrive/Desktop/beaver.csv”, header=TRUE, sep=“,”) 1. Use the summary function to gain an overview of the data set. Then display the mean and median for at least two attributes. beaver <- read.csv(file="/Users/domin/OneDrive/Desktop/beaver.csv", header=TRUE, sep=",") summary(beaver) ## X ...
669 sym R (3241 sym/21 pcs)
Homework One
Write a loop that calculates 12-factorial x = 1 for( i in 1:12) { x = x * i } print(x) ## [1] 479001600 Show how to create a numeric vector that contains the sequence from 20 to 50 by 5. seq(20, 50, by = 5) ## [1] 20 25 30 35 40 45 50 Create the function “quadratic” that takes a trio of input numbers a, b, and c and solve the quadrat...
304 sym R (658 sym/6 pcs)
Data 606 - Lab 2
library(tidyverse) library(openintro) data(nycflights) The data set nycflights that shows up in your workspace is a data matrix, with each row representing an observation and each column representing a variable. R calls this data format a data frame, which is a term that will be used throughout the labs. For this data set, each observation is a ...
9473 sym R (7990 sym/50 pcs) 9 img
Data 606 - HW 2
Stats scores. (2.33, p. 78) Below are the final exam scores of twenty introductory statistics students. 57, 66, 69, 71, 72, 73, 74, 77, 78, 78, 79, 79, 81, 81, 82, 83, 83, 88, 89, 94 Create a box plot of the distribution of these scores. The five number summary provided below may be useful. Mix-and-match. (2.10, p. 57) Describe the distributio...
5499 sym R (32 sym/2 pcs) 5 img
MSDataScience 607 HW1
“Higher Rates Of Hate Crimes Are Tied To Income Inequality” https://fivethirtyeight.com/features/higher-rates-of-hate-crimes-are-tied-to-income-inequality/ Overview This article describes hate crimes rate in states before and after election. It is being discussed that states with higher income inequality reports more hate crimes compare to o...
822 sym R (4736 sym/8 pcs)
606 HW 1
What does each row of the data matrix represent? How many participants were included in the survey? Indicate whether each variable in the study is numerical or categorical. If numerical, identify as continuous or discrete. If categorical, indicate if the variable is ordinal. library(RCurl) df <- read.csv("https://raw.githubusercontent.com/jbrye...
2056 sym R (3045 sym/4 pcs)
Data 607 _ HW2
##List of Movies movies <- dbSendQuery(mydb, "SELECT * FROM movies") dbFetch(movies) ## id title ## 1 1 Good Girls ## 2 2 Bad Santa ## 3 3 Bad Boys ## 4 4 Smile ## 5 5 Hangover ## 6 6 Notebook ##List of Movies’ RATINGS ratings <- dbSendQuery(mydb, "SELECT * FROM ratings") dbFetch(ratings) ## id movie_id user_...
213 sym R (3285 sym/10 pcs)
Data 606 - Lab 7
Getting Started Load packages In this lab, we will explore and visualize the data using the tidyverse suite of packages, and perform statistical inference using infer. The data can be found in the companion package for OpenIntro resources, openintro. Let’s load the packages. library(tidyverse) library(openintro) library(infer) library(stats...
8829 sym R (8264 sym/88 pcs) 5 img