Publications by Puja Roy
UpdatedDATA607Project3
Load the packages library(tidyverse) library(openintro) library('DBI') library('RMySQL') Introduction For this project, the aim is to obtain data to answer the question, “Which are the most valued data science skills?” We obtained the Data Science Job Salaries dataset from Kaggle: https://www.kaggle.com/datasets/ruchi798/data-science-job-...
27864 sym Python (267425 sym/43 pcs) 9 img
DATA607Week7
Introduction The objective of this assignment is to select three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Take the information that you’ve selected about these three...
7737 sym
DATA607Project2
Introduction Load Libraries library(tidyverse) library(openintro) library(readr) library(tidyr) library(dplyr) library(ggplot2) Exercise 1 Insert any text here. url <- "https://raw.githubusercontent.com/pujaroy280/DATA607Project2/main/MTA_Daily_Ridership_Data__Beginning_2020_20240303.csv" MTA_data <- read.csv(url) head(MTA_data) ## ...
19051 sym Python (2313895 sym/68 pcs) 4 img
DATA608Week4Story2
Introduction For Story#2, I obtained the data for Unemployment Rate (Bureau of Labor Statistics) from the https://www.bls.gov/data/#unemployment website, Consumer Price Index (CPI) (Bureau of Labor Statistics) from the https://www.bls.gov/cpi/data.htm website and downloaded the series data from this link: https://download.bls.gov/pub/time.series...
5876 sym Python (3380 sym/13 pcs) 3 img
DATA607Week4Project1
Introduction The goal of this project is to create an R Markdown file that generates a .CSV file containing all of the information for all of the chess players. Import the Libraries library(tidyverse) library(openintro) Load the data # Read the text file data <- read_lines("https://raw.githubusercontent.com/pujaroy280/DATA607Project1/main/tou...
6499 sym Python (2370 sym/14 pcs)
DATA607Week3
Introduction The purpose of this assignment is to work with data manipulation involving strings and by using regex and testing the regular expressions by programming in R. Import Libraries library(tidyverse) library(openintro) Part 1 Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset [https://fivethirtyeight.com/fe...
9517 sym Python (7037 sym/44 pcs)
DATA607Week2
Part 3: Transfer data from SQL database to R dataframe # Import the libraries library(RMySQL) library(DBI) # Connect to the database con <- dbConnect(RMySQL::MySQL(), user = "puja.roy11", password = "puja.roy11", host = "cunydata607sql.mysql.database.azure.com", port = 3306,...
191 sym R (493 sym/5 pcs)
DATA607Week1
Link to Article: https://fivethirtyeight.com/features/the-economic-guide-to-picking-a-college-major/ Overview: This data set is about the majors that recent grads from colleges/universities have studied/pursued. It provides information about recent grads including both men and women employment rates, major category, and how many students perform...
2823 sym 1 img