Publications by James Liu

rpub_test

02.11.2022

R Markdown This is an R Markdown document. Markdown is a simple formatting syntax for authoring HTML, PDF, and MS Word documents. For more details on using R Markdown see http://rmarkdown.rstudio.com. When you click the Knit button a document will be generated that includes both content as well as the output of any embedded R code chunks within t...

591 sym 1 img

RPUBs 11/10

10.11.2022

This Portfolio will walk you through using PCA for what I call pseudo-cluster analysis. You will also examine the PCA scores generated by PCA, how they correlate with themselves, and how they correlate to the original data features. This will help illustrate how the vectors in a biplot relate to the original data and the layout of the points with...

8857 sym R (9965 sym/37 pcs) 16 img

Rpubs sleep

10.11.2022

Preliminaries Download the vegan package Only do this once, then comment out of the script. You may have already done this this for a previous assignment. ##install.packages("vegan") Load the libraries library(ggplot2) library(vegan) ## Loading required package: permute ## Loading required package: lattice ## This is vegan 2.6-4 Load the mslee...

2190 sym R (2298 sym/21 pcs) 5 img

Software Checkpoint 11/28

28.11.2022

This code checkpoint will make sure that you can load .vcf files on your computer. You will also review vocab and concepts related to .vcf files and SNPs. You will need to load an analyze a .vcf file on the final exam. Make notes on all of this material and include it on your notes sheet. Learning objectives This material will appear on the fina...

5485 sym R (3871 sym/24 pcs)

DocumentSoftware Checkpoint - Setting a working directory and loading data Start Assignment

29.11.2022

Learning objectives All of this material will appear on the exam. Take notes on the workflow, functions, and concepts. Main objectives By the end of this lesson you will know how to.. set a working directory in RStudio confirm the location of the working directory with getwd() confirm a file is present with and list.files(pattern = ...) load ty...

4461 sym Python (3483 sym/20 pcs) 2 img

Portfolio: Working with SNP files 1 - loading and transposing VCF files Start Assignment

02.12.2022

NOTE - before you begin, make sure your WORKING DIRECTORY is set to the location of the .vcf file being used. Learning objectives All of this material will appear on the exam. Take notes on the workflow, functions, and concepts. Set a working directory and confirm a file is present with getwd() and list.files(pattern = ...) Know what it means t...

4791 sym R (26668 sym/40 pcs)

Portfolio: Working with VCF files 2: Removing invariant columns Start Assignment

02.12.2022

Learning objectives This lesson introduces the concept of invariant columns and why they should be removed. It also provides a function to remove them. All of this material will appear on the exam. Take notes on the workflow, functions, and concepts. Main objectives By the end of this lesson you will Understand what can lead to a column of SNP...

7494 sym R (6488 sym/58 pcs)

In-class Exercise: Studying Natural Selection with SNPs Start Assignment

06.12.2022

Packages Install missing package ##install.packages("kgp") Load packages library(vcfR) ## ## ***** *** vcfR *** ***** ## This is vcfR 1.13.0 ## browseVignettes('vcfR') # Documentation ## citation('vcfR') # Citation ## ***** ***** ***** ***** library(kgp) library(ggplot2) library(ggpubr) The pa...

809 sym R (2958 sym/23 pcs) 2 img

Portfolio: Working with VCF files 3: Removing samples with many NAs Start Assignment

07.12.2022

Learning objectives Review the problem of missing data in SNP datasets Introduce the concept of researcher degrees of freedom Review how to locate NAs in R using is.na() and which() Outline the use of for() loops in R to carry out repetitive tasks. Review the use of regular expressions to clean text data. Introduction The data we are using to p...

11098 sym R (10031 sym/94 pcs) 3 img

Portfolio: Working with VCF files 4: Imputation of missing data Start Assignment

07.12.2022

Learning objectives All of this material will appear on the exam. Take notes on the workflow, functions, and concepts. Main objectives By the end of this lesson you will know how to … Identify all of the missing values in a column of a dataframe or vector Replaces all the NAs in a column with a new value, such as the mean. Know how a for() lo...

7433 sym 2 img