Publications by Alexander Simon
DATA607 Project 3
Introduction We selected the Data Science Skills option for Project 3. Here, we describe the identification, collection, database creation, and analyses of data to answer the question, “Which are the most valued data science skills?” We divided this question into two parts—hard (technical) skills and soft skills—and analyzed each separately...
18305 sym R (21232 sym/68 pcs) 21 img 3 tbl
DATA607 Assignment 7
0. Packages I used the jsonlite package to work with JSON files and the httr2, rvest, xml2, and xmltools packages to work with XML and HTML files. I also used the kableExtra package to create tables. If needed, you can install them using the commands below. install.packages("httr2") install.packages("jsonlite") install.packages("kableExtra") instal...
2550 sym Python (4153 sym/31 pcs) 3 tbl
DATA607 Project 2 - Dataset 3 of 3
0. Packages I used the kableExtra package to create tables and the sf and nycgeo packages to create maps. If needed, you can install them using the commands below. install.packages("kableExtra") install.packages("sf") remotes::install_github("mfherman/nycgeo") 1. Introduction Long-term exposure to air pollutants can negatively impact health and in...
14016 sym Python (9386 sym/46 pcs) 9 img 11 tbl
DATA607 Project 2 - Dataset 2 of 3
0. Packages I used the kableExtra package to create tables and the hexbin package to perform hexagonal binning (explained in section 4.2.2. Correlations). If needed, you can install them using the commands below. install.packages("kableExtra") install.packages("hexbin") 1. Introduction FIFA21 is a football simulation video game. Here, I analyze Lu...
7532 sym Python (9367 sym/36 pcs) 7 img 6 tbl
DATA607 Project 2 - Dataset 1 of 3
0. Packages I used the kableExtra package to create tables. If needed, you can install it using the command below. install.packages("kableExtra") 1. Introduction For this analysis, I tidied and analyzed the dataset that I chose for Discussion 5. The dataset contains information about the cost of publishing open-access journal articles about medica...
7430 sym Python (18823 sym/31 pcs) 3 img 9 tbl
Assignment 5
Preface I experimented with the kableExtra package to display the output of code blocks more nicely than the default, which limits the number of columns shown. If not already installed on your computer, you can run the command below. install.packages("kableExtra") Introduction This assignment aims to explain how analyses of individual groups of da...
5013 sym Python (4840 sym/15 pcs) 5 img 7 tbl
Project 1 Extra Credit
Introduction The Elo rating system is a statistical method for rating and comparing the relative skill level of players in games such as chess. The difference in Elo ratings between 2 players can be used to predict the outcome of a game (ie, win or loss). The rating difference also determines the number of points each player gains from a win or los...
7069 sym Python (5253 sym/19 pcs) 2 img
DATA607 Project 1
Introduction The aim of this project is to use a text file with chess tournament results to create a summary of each player’s name, state, total points, pre-rating, and average pre-rating of his/her opponents in games that resulted in a win, loss, or draw. Data Source I downloaded the tournament info file (tournamentinfo.txt) on Blackboard and ...
3953 sym Python (2207 sym/14 pcs)
DATA607 Assignment 3
1. Using the 173 majors listed in fivethirtyeight.com’s College Majors dataset, provide code that identifies the majors that contain either “DATA” or “STATISTICS” I downloaded majors-list.csv from https://github.com/fivethirtyeight/data/tree/master/college-majors and saved it to my GitHub repository. This file contains college majors in t...
4468 sym
DATA607 Assignment1
Introduction This assignment is based on an article called “How ‘Qi’ and ‘Za’ Changed Scrabble” by Oliver Roeder that was posted on fivethirtyeight.com in April 2017. The article discusses how the introduction of 2 high-value words, “qi” and “za”, to the Scrabble word game in March 2006 may have increased players’ scores, part...
4772 sym Python (2695 sym/12 pcs) 1 img