Publications by Dominika Markowska-Desvallons
DATA 607 HW 10
Assigment 10 In this assignment, you should start by getting the primary example code from chapter 2 working in an R Markdown document. You should provide a citation to this base code. You’re then asked to extend the code in two ways: Work with a different corpus of your choosing, and Incorporate at least one additional sentiment lexicon (poss...
1309 sym R (5826 sym/34 pcs) 3 img
data 606 HW8
Nutrition at Starbucks, Part I. (8.22, p. 326) The scatterplot below shows the relationship between the number of calories and amount of carbohydrates (in grams) Starbucks food menu items contain. Since Starbucks only lists the number of calories on the display items, we are interested in predicting the amount of carbs a menu item has based on i...
6859 sym R (1524 sym/11 pcs) 10 img
Data 607 - Discussion 11
Discussion 11 - Recommender Systems Task is to analyze an existing recommender system by -Perform a Scenario Design analysis -Attempt to reverse engineer what you can about the site, from the site interface and any available information that you can find on the Internet or elsewhere -Include specific recommendations about how to improve the site�...
2903 sym
Project 4
Introduction As a group, we worked with two files containing spam and ham to predict if a document is spam or not .By utilizing our ‘training’ documents, our group was able to classify the “test” documents. We were able to communicate via zoom meeting and collaborating with Github. For this project, we started with a spam/ham dataset, the...
3252 sym R (5832 sym/31 pcs)
Data 606 - Hw 9
Baby weights, Part I. (9.1, p. 350) The Child Health and Development Studies investigate a range of topics. One study considered all pregnancies between 1960 and 1967 among women in the Kaiser Foundation Health Plan in the San Francisco East Bay area. Here, we study the relationship between smoking and weight of the baby. The variable smoke is c...
7393 sym R (615 sym/14 pcs) 2 img
Data 605 Hw2
Problem set 2 Matrix factorization is a very important problem. There are supercomputers built just to do matrix factorizations. Every second you are on an airplane, matrices are being factorized. Radars that track flights use a technique called Kalman filtering. At the heart of Kalman Filtering is a Matrix Factorization operation. Kalman Filters...
532 sym R (742 sym/4 pcs)
Document
Assigment 4 Libraries library(doParallel) library(foreach) library(jpeg) library(EBImage) library(kableExtra) library(OpenImageR) Adding Graphics num=17 files=list.files("C:/Users/domin/OneDrive/Desktop/605/jpg",pattern="\\.jpg")[1:num] Reading files + dimensions +plotting by pixel height=1200; width=2500;scale=20 plot_jpeg = function(p...
804 sym R (2127 sym/14 pcs) 5 img
Data 605 Final Project
library(gridExtra) library(RColorBrewer) library(Matrix) library(scales) library(corrplot) library(MASS) library(psych) library(ggplot2) library(matlib) library(dplyr) library(tidyr) library(kableExtra) library(purrr) library(Hmisc) Problem 1 Using R, generate a random variable X that has 10,000 random uniform numbers from 1 to N, w...
3481 sym R (24785 sym/81 pcs) 6 img 2 tbl
Discussion11 Data 605
happiness<-read.csv("https://raw.githubusercontent.com/hrensimin05/Cuny_DataScience/master/2019.csv") #there are 156 observations and 9 variables happy<-data.frame(happiness) head(happy) ## Overall.rank Country.or.region Score GDP.per.capita Social.support ## 1 1 Finland 7.769 1.340 1.587 ## 2 ...
1043 sym R (6440 sym/30 pcs) 3 img
Project1 (Data 624)
Part A – ATM Forecast, ATM624Data.xlsx In part A, I want you to forecast how much cash is taken out of 4 different ATM machines for May 2010. The data is given in a single file. The variable ‘Cash’ is provided in hundreds of dollars, other than that it is straight forward. I am being somewhat ambiguous on purpose to make this have a little ...
5944 sym R (9709 sym/95 pcs) 22 img