Publications by Dirk Hartog
Wk7: Working with XML, HTML, and JSON
library(tidyverse) library(jsonlite) library(XML) library(xml2) library(stringr) library(rvest) #Pick three of your favorite books on one of your favorite subjects. At least one of the books should have more than one author. For each book, include the title, authors, and two or three other attributes that you find interesting. Step 1: Create three...
767 sym R (2262 sym/4 pcs)
Wk6: Project 2_Movies
The first step was to read in the csv file from guthub. movieurl <- "https://raw.githubusercontent.com/D-hartog/DATA607/main/PROJECT2/movies_untidy.csv" movies_tv <- read_csv(movieurl) ## Rows: 9999 Columns: 9 ## ── Column specification ────────────────────────────────────�...
3766 sym Python (9182 sym/25 pcs) 2 img
Wk6: Project 2_World
library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.2 ✔ tidyr 1.3.0 ## ...
1233 sym R (8753 sym/24 pcs) 3 img
Wk6: Project 2_Spotify
This data set contains a comprehensive list of the most famous songs of 2023 as listed on Spotify. The data set offers a wealth of features beyond what is typically available in similar data sets. It provides insights into each song’s attributes, popularity, and presence on various music platforms. For my analysis I thought it would be interestin...
1817 sym R (4093 sym/11 pcs) 3 img
Wk5: Tidying and Transforming Data
library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyverse) ## ── Attaching core tidyverse packages ─────────────────�...
857 sym R (4447 sym/19 pcs) 2 img
Wk4: Project 1
In this project, you’re given a text file with chess tournament results where the information has some structure. Your job is to create an R Markdown file that generates a .CSV file (that could for example be imported into a SQL database). library(tidyverse) url <- "https://raw.githubusercontent.com/D-hartog/DATA607/main/PROJECT1/chess.txt" ches...
982 sym R (2276 sym/12 pcs)
Wk3: Character Manipulation and Data Processing
## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9.2 ✔ tidyr 1.3.0 ## ✔ purrr 1.0.1...
1916 sym
Wk1: Loading Data Into a Data Frame
OVERVIEW The article and the subsequent data that I choose to work with is titled “Congress Today Is Older Than It’s Ever Been: OK, boomer? More like boomer, OK!” publised on the FiveThirtyEight.com. The articl can be found here The article describes some basic statistics, identifies and visualizes trends in the ages of the memebers of The Ho...
2497 sym R (2469 sym/11 pcs) 2 img
R Bridge homework wk 3
“Illustration of the globe with trendlines” The data set that I am looking at is the growth regression data as provided by Durlauf & Johnson (1995). There are 10 variables that are captured for each country (observation). The 10 variables are: oil: factor. Is the country an oil-producing country? inter: factor. Does the country have better qu...
6005 sym R (9076 sym/21 pcs) 8 img
R-programming Bridge Course W2 Homework
Read in a .csv file from github using the raw data link BONUS – place the original .csv in a github file and have R read from the link. urlfile <- 'https://raw.githubusercontent.com/D-hartog/csv_file/main/mcu_films.csv' mcu <- read.csv(urlfile) head(mcu) Cleaned up data frame by Dropping the column labeled “X” that carried in index labels C...
1534 sym