Publications by Mingzhuo Tian
Final Projects
Introduction to Housing Market Analysis with the Ames Housing Dataset Photo by Tom Thain on Unsplash: https://storage.googleapis.com/kaggle-media/competitions/House%20Prices/kaggle_5407_media_housesbanner.png The goal of this project is to visualize a few statistical relationships in Dean De Cock’s Ames Housing for use in data science educa...
10882 sym R (8378 sym/27 pcs) 3 img
Project 2: World Educational Data
Introduction to Global Education Analysis This analysis is based on the “Global Education” dataset, which is rich in educational measures from all over the world. It includes important factors including enrollment, out-of-school, and literacy rates in addition to demographic and economic statistics like birth and unemployment rates. Priorit...
6873 sym R (12420 sym/27 pcs) 3 img
Project 2: World Educational Data
Introduction to Global Education Analysis This analysis is based on the “Global Education” dataset, which is rich in educational measures from all over the world. It includes important factors including enrollment, out-of-school, and literacy rates in addition to demographic and economic statistics like birth and unemployment rates. Priorit...
6859 sym R (12420 sym/27 pcs) 3 img
CDC 500 Cities/PLACES GIS Data Analysis
knitr::include_graphics("https://www.cdc.gov/places/about/500-cities-2016-2019/images/500-cities-logo.png") The 500 cities datasets from the Centers for Disease Control and Prevention (CDC), It describes how to load and handle a dataset that includes health indicators from various American cities. The dataset contains variables such as populati...
2891 sym R (15350 sym/26 pcs) 2 img
CDC 500 Cities/PLACES GIS Data Analysis
knitr::include_graphics("https://www.cdc.gov/places/about/500-cities-2016-2019/images/500-cities-logo.png") The 500 cities datasets from the Centers for Disease Control and Prevention (CDC), It describes how to load and handle a dataset that includes health indicators from various American cities. The dataset contains variables such as populati...
3694 sym R (49284 sym/33 pcs) 3 img
Web Scraping
Loading the rvest package library('rvest') Loading the tidyverse package library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4....
1644 sym R (22851 sym/107 pcs) 1 img
Web Scraping in R (using rvest)
Loading the rvest package library('rvest') Loading the tidyverse package library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4....
1644 sym R (22783 sym/107 pcs) 1 img
Week 8
# load required packages library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9...
963 sym R (6961 sym/20 pcs) 2 img
Wages by Education in the USA (1973-2022)
wages_data <- readr::read_csv("data/wages_by_education.csv") ## Rows: 50 Columns: 61 ## ── Column specification ──────────────────────────────────────────────────────── ## Delimiter: "," ## dbl (61): year, less_than_hs, high_school...
9547 sym R (7305 sym/16 pcs) 4 img
Wages by Education in the USA (1973-2022)
wages_data <- readr::read_csv("data/wages_by_education.csv") ## Rows: 50 Columns: 61 ## ── Column specification ──────────────────────────────────────────────────────── ## Delimiter: "," ## dbl (61): year, less_than_hs, high_school...
9533 sym R (7305 sym/16 pcs) 4 img