Publications by Wing Siang Chang (s3918329)

5.1.1

30.09.2021

Normalisation techniques There are three main types of normalisation techniques: Centring and scaling Centring (also known as mean-centring) involves the subtraction of the variable average from the data. Let \(y\) denote the variable at the original scale and \(\bar{y}\) be the average. The centered variable \(y'\) is defined as: \[ y' = y - ...

4889 sym R (4363 sym/17 pcs)

5.1.2

30.09.2021

...

7 sym 2 img

5.2.1

30.09.2021

Equal-width (distance) binning In equal-width binning, the variable is divided into n intervals of equal size. If \(y_{max}\) and \(y_{min}\) are the maximum and minimum values in the variable, the width of the intervals will be: \[ w = \frac{\left( y_{max} - y_{min} \right)}{n} \] Thus, you need to define the number of intervals, \(n\), prior ...

1486 sym R (1077 sym/7 pcs) 3 img

2.5.1

01.10.2021

Practice with the tidyr package Use table1, table2, table3, table4a, table4b and table5 datasets which are under {tidyr} package. As you can see, each dataset shows the same values of four variables, country, year, population, and cases, but each dataset organises the values in a different way. Click on the tables below to use them for completing...

4130 sym R (6441 sym/32 pcs)

2.5.1 Pipe Operator

01.10.2021

The pipe operator, %>% One of the main operators to be used is the pipe operator, %>%, which essentially removes the dataset input parameter. Therefore, it can be referred to with just the editable parameters. It is originally from the {magrittr} package, but is included in many {tidyverse} packages, including {dplyr} and {tidyr}. CTRL + SHIFT + ...

1925 sym R (1343 sym/5 pcs)

DW 6.3.1

06.10.2021

Convert strings to dates Step 1: Read the dataset: candy <- read.csv("data/candy_production.csv") # We're using base R to read the CSV because we want it to be read as a character # string, so we can learn to manually change it to date format. If we used # readr::read_csv(), it would automatically be read in date format: candy2 <- readr::read_...

2178 sym R (3134 sym/27 pcs)

A3 Plots

09.10.2021

...

7 sym 2 img

A3 Draft 1

10.10.2021

title: ‘Data Wrangling Assessment Task 3: Dataset challenge’ author: “Wing Siang Chang, s3918329” subtitle: CPI and All Ordinaries Index (Australia) output: html_document: df_print: paged html_notebook: default Feel free to DELETE the instructional text provided in the template. If you have any questions regarding the assignment instructi...

3228 sym R (20938 sym/167 pcs) 5 img

A3Draft10-10-21v2

11.10.2021

Required packages library (readxl) ## Warning: package 'readxl' was built under R version 4.1.1 library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union Data # CPI a...

943 sym R (19676 sym/160 pcs) 5 img

A3Draft11-10-21v2

12.10.2021

Voiceover Presentation URL: https://www.loom.com/share/035ada5bf8a74f32af56ba874f828a98 Required packages library (readxl) ## Warning: package 'readxl' was built under R version 4.1.1 library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are m...

1033 sym R (20080 sym/161 pcs) 6 img