Publications by Mingzhuo Tian

Final Projects

13.12.2023

Introduction to Housing Market Analysis with the Ames Housing Dataset Photo by Tom Thain on Unsplash: https://storage.googleapis.com/kaggle-media/competitions/House%20Prices/kaggle_5407_media_housesbanner.png The goal of this project is to visualize a few statistical relationships in Dean De Cock’s Ames Housing for use in data science educa...

10882 sym R (8378 sym/27 pcs) 3 img

Project 2: World Educational Data

15.11.2023

Introduction to Global Education Analysis This analysis is based on the “Global Education” dataset, which is rich in educational measures from all over the world. It includes important factors including enrollment, out-of-school, and literacy rates in addition to demographic and economic statistics like birth and unemployment rates. Priorit...

6873 sym R (12420 sym/27 pcs) 3 img

Project 2: World Educational Data

15.11.2023

Introduction to Global Education Analysis This analysis is based on the “Global Education” dataset, which is rich in educational measures from all over the world. It includes important factors including enrollment, out-of-school, and literacy rates in addition to demographic and economic statistics like birth and unemployment rates. Priorit...

6859 sym R (12420 sym/27 pcs) 3 img

CDC 500 Cities/PLACES GIS Data Analysis

08.11.2023

knitr::include_graphics("https://www.cdc.gov/places/about/500-cities-2016-2019/images/500-cities-logo.png") The 500 cities datasets from the Centers for Disease Control and Prevention (CDC), It describes how to load and handle a dataset that includes health indicators from various American cities. The dataset contains variables such as populati...

2891 sym R (15350 sym/26 pcs) 2 img

CDC 500 Cities/PLACES GIS Data Analysis

08.11.2023

knitr::include_graphics("https://www.cdc.gov/places/about/500-cities-2016-2019/images/500-cities-logo.png") The 500 cities datasets from the Centers for Disease Control and Prevention (CDC), It describes how to load and handle a dataset that includes health indicators from various American cities. The dataset contains variables such as populati...

3694 sym R (49284 sym/33 pcs) 3 img

Web Scraping

01.11.2023

Loading the rvest package library('rvest') Loading the tidyverse package library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4....

1644 sym R (22851 sym/107 pcs) 1 img

Web Scraping in R (using rvest)

01.11.2023

Loading the rvest package library('rvest') Loading the tidyverse package library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4....

1644 sym R (22783 sym/107 pcs) 1 img

Week 8

25.10.2023

# load required packages library(tidyverse) ## ── Attaching core tidyverse packages ──────────────────────── tidyverse 2.0.0 ── ## ✔ dplyr 1.1.2 ✔ readr 2.1.4 ## ✔ forcats 1.0.0 ✔ stringr 1.5.0 ## ✔ ggplot2 3.4.3 ✔ tibble 3.2.1 ## ✔ lubridate 1.9...

963 sym R (6961 sym/20 pcs) 2 img

Wages by Education in the USA (1973-2022)

18.10.2023

wages_data <- readr::read_csv("data/wages_by_education.csv") ## Rows: 50 Columns: 61 ## ── Column specification ──────────────────────────────────────────────────────── ## Delimiter: "," ## dbl (61): year, less_than_hs, high_school...

9547 sym R (7305 sym/16 pcs) 4 img

Wages by Education in the USA (1973-2022)

18.10.2023

wages_data <- readr::read_csv("data/wages_by_education.csv") ## Rows: 50 Columns: 61 ## ── Column specification ──────────────────────────────────────────────────────── ## Delimiter: "," ## dbl (61): year, less_than_hs, high_school...

9533 sym R (7305 sym/16 pcs) 4 img