Publications by Puja Roy

UpdatedDATA607Project3

17.03.2024

Load the packages library(tidyverse) library(openintro) library('DBI') library('RMySQL') Introduction For this project, the aim is to obtain data to answer the question, “Which are the most valued data science skills?” We obtained the Data Science Job Salaries dataset from Kaggle: https://www.kaggle.com/datasets/ruchi798/data-science-job-...

27864 sym Python (267425 sym/43 pcs) 9 img

DATA607Week7

10.03.2024

Introduction The objective of this assignment is to select three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three...

7737 sym

DATA607Project2

04.03.2024

Introduction Load Libraries library(tidyverse) library(openintro) library(readr) library(tidyr) library(dplyr) library(ggplot2) Exercise 1 Insert any text here. url <- "https://raw.githubusercontent.com/pujaroy280/DATA607Project2/main/MTA_Daily_Ridership_Data__Beginning_2020_20240303.csv" MTA_data <- read.csv(url) head(MTA_data) ## ...

19051 sym Python (2313895 sym/68 pcs) 4 img

DATA608Week4Story2

20.02.2024

Introduction For Story#2, I obtained the data for Unemployment Rate (Bureau of Labor Statistics) from the https://www.bls.gov/data/#unemployment website, Consumer Price Index (CPI) (Bureau of Labor Statistics) from the https://www.bls.gov/cpi/data.htm website and downloaded the series data from this link: https://download.bls.gov/pub/time.series...

5876 sym Python (3380 sym/13 pcs) 3 img

DATA607Week4Project1

19.02.2024

Introduction The goal of this project is to create an R Markdown file that generates a .CSV file containing all of the information for all of the chess players. Import the Libraries library(tidyverse) library(openintro) Load the data # Read the text file data <- read_lines("https://raw.githubusercontent.com/pujaroy280/DATA607Project1/main/tou...

6499 sym Python (2370 sym/14 pcs)

DATA607Week3

11.02.2024

Introduction The purpose of this assignment is to work with data manipulation involving strings and by using regex and testing the regular expressions by programming in R. Import Libraries library(tidyverse) library(openintro) Part 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/fe...

9517 sym Python (7037 sym/44 pcs)

DATA607Week2

07.02.2024

Part 3: Transfer data from SQL database to R dataframe # Import the libraries library(RMySQL) library(DBI) # Connect to the database con <- dbConnect(RMySQL::MySQL(), user = "puja.roy11", password = "puja.roy11", host = "cunydata607sql.mysql.database.azure.com", port = 3306,...

191 sym R (493 sym/5 pcs)

DATA607Week1

05.02.2024

Link to Article: https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/ Overview: This data set is about the majors that recent grads from colleges/universities have studied/pursued. It provides information about recent grads including both men and women employment rates, major category, and how many students perform...

2823 sym 1 img