Publications by Trishita Nath
Data 607 Assignment 3
library(stringr) library(tidyverse) ## -- Attaching packages --------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.3.3 v purrr 0.3.4 ## v tibble 3.0.6 v dplyr 1.0.4 ## v tidyr 1.1.2 v forcats 0.5.1 ## v readr 1.4.0 ## -- Conflicts ------------------------------------------ tidyverse_conflicts() -- ## ...
1059 sym R (2314 sym/19 pcs)
Data 607 Project 1
library(stringr) library(DT) Load Data # Data hosted on txt file on my GitHubr raw_chess_data <- read.table(url("https://gist.githubusercontent.com/trishitanath334/1ad12c228de8510e25c2edbcf1a69b89/raw/c66cbd8848d847ae035543c4140f89dd58fade3f/tournamentinfo.txt"), sep = ",") #exclude top row since it does not have relevant data raw_chess_dat...
207 sym R (6571 sym/14 pcs)
Week 1 Assignment
Introduction I asked 1,058 respondents a SurveyMonkey poll the following questions about their Thanksgiving: DataSet: https://fivethirtyeight.com/features/heres-what-your-part-of-america-eats-on-thanksgiving/ Github Link: https://github.com/fivethirtyeight/data/blob/master/thanksgiving-2015/thanksgiving-2015-poll-data.csv #loading required librar...
747 sym R (16238 sym/28 pcs) 2 img
DATA 605 Assignment 3
1. Problem set 1 What is the rank of the matrix A? #loading required libraries library(pracma) A <-matrix(c(1,-1,0,5,2,0,1,4,3,1,-2,-2,4,3,1,-3),nrow = 4,ncol = 4) A ## [,1] [,2] [,3] [,4] ## [1,] 1 2 3 4 ## [2,] -1 0 1 3 ## [3,] 0 1 -2 1 ## [4,] 5 4 -2 -3 rref(A) ## [,1] [,2] [,3] [,...
1640 sym R (998 sym/16 pcs)
DATA 605 Assignment 4
To build and visualize eigenimagery that accounts for 80% of the variability #loading required libraries library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union lib...
313 sym R (3022 sym/13 pcs) 6 img
Data 607 Assignment 10
Introduction In Text Mining with R, Chapter 2 looks at Sentiment Analysis. In this assignment, you should start by getting the primary example code from chapter 2 working in an R Markdown document. You should provide a citation to this base code. You’re then asked to extend the code in two ways: Work with a different corpus of your choosing, a...
1269 sym R (15901 sym/82 pcs) 8 img
Data 607 Project 3
INTRODUCTION I came across a study conducted by Jeff Hale in 2018 about The Most In-Demand Skills for Data Scientists. More can be found here I will be analyzing his two datasets to determine “Which are the most valued data science skills?” Data Source I will use the the two datasets from John Hale’s study. I have hosted them on my GitHub...
1022 sym R (10359 sym/55 pcs)
DATA 605 Homework 8
Question 11 (Page 303) A company buys 100 lightbulbs, each of which has an exponential lifetime of 1000 hours. What is the expected time for the flrst of these bulbs to burn out? Answer From Exercise 10, for n independent random variables that have an exponential density and mean μ, the minimum value M is exponential density with a mean as \(\fr...
3510 sym R (332 sym/8 pcs)
Data 605 Homework 7
Question 1 Let X1, X2, . . . , Xn be n mutually independent random variables, each of which is uniformly distributed on the integers from 1 to k. Let Y denote the minimum of the Xi’s. Find the distribution of Y Answer Let Xi be a discrete Uniform distribution on (1,k) \[ P(X_i = x) = \frac{1}{k}, x=1,…, k \\ P(X_i = x) \leq \sum_{y = 1}^{x}...
2447 sym R (1621 sym/24 pcs)
Data 607 Project 2
Introduction Preparing different datasets for downstream analysis work. Since I am a fan of movies, I am going to analyze the movies data-set. I will perform the following steps Deal with missing data Proper naming of columns Perform visual analysis My focus will be to analyze the gross and budget for Pirates of the Carribbean Films in the dat...
763 sym R (12092 sym/27 pcs) 1 img