Publications by Nathan Brouwer

Accessing Data from Google Docs

16.11.2021

The goal of this exercise is to make you familiar with how to download data from Google Sheets and to briefly review some key concepts R functions and coding concepts. We’ll do the following things download a list of RefSeq accessions from a Google sheet remove the NAs using na.omit() select out all but one isoform using duplicated() Packages...

867 sym R (2919 sym/27 pcs)

FAQ: Adding a Correlation Coefficient to a Scatterplot Using ggpubr

17.12.2021

QUESTION: How do I add a correlation coefficient to my scatterplot when I use ggpubr/ggscatter? To do this, we simply add a small argument into the function. Data We’ll use the “palmerpenguins” packages (https://allisonhorst.github.io/palmerpenguins/) to address this question. You’ll need to install the package with install.packages(“p...

1306 sym R (2091 sym/16 pcs) 2 img

Global Alignment

17.11.2021

Global proteins aligments in R By: Avril Coghlan. Adapted, edited and expanded: Nathan Brouwer under the Creative Commons 3.0 Attribution License (CC BY 3.0). Preliminaries library(compbio4all) library(Biostrings) Download sequences As we did in the previous lesson on dotplots, we’ll look at two sequences. # Download ## sequence 1: APOL1 Hu...

10869 sym R (8417 sym/66 pcs)

Replicating Pie Graphs

03.11.2021

Introduction The data shown in the following pie graphs is from the paper “Genomics is Failing on Diversity” written by Alice B. Popejoy and Stephanie M. Fullerton. They were investigating the proportions of different ethnicities and their representation in Genome-Wide Association Studies, where in 2009, 96% of data came from individuals of E...

1187 sym R (1042 sym/3 pcs) 1 img

Investigating Shroom

26.10.2021

Add the necessary code to make this script function. Preliminaries Load packages library(seqinr) library(rentrez) library(compbio4all) library(Biostrings) ## Loading required package: BiocGenerics ## Loading required package: parallel ## ## Attaching package: 'BiocGenerics' ## The following objects are masked from 'package:parallel': ## ...

1196 sym R (6381 sym/40 pcs) 5 img

Dotplots in R

26.10.2021

Sequence dotplots in R By: Avril Coghlan. Adapted, edited and expanded: Nathan Brouwer under the Creative Commons 3.0 Attribution License (CC BY 3.0). NOTE: I’ve added some new material that is rather terse and lacks explication. Good sources of more info: https://omicstutorials.com/interpreting-dot-plot-bioinformatics-with-an-example/ http://r...

4418 sym R (1907 sym/17 pcs) 8 img

APOL1 Gene Fasta File in R

17.10.2021

This is a modification of “DNA Sequence Statistics” from Avril Coghlan’s A little book of R for bioinformatics.. Most of the text and code was originally written by Dr. Coghlan and distributed under the Creative Commons 3.0 license. NOTE: There is some redundancy in this current draft that needs to be eliminated. Functions library() help...

12904 sym R (2137 sym/40 pcs)

Working Directory Test

14.10.2021

Default working directory Open up this .Rmd file in RStudio. In the code chunk below type getwd() and run it. “wd” means “working directory”, or where R will currently save files if you tell it to save anything. # Get the current working directory getwd() ## [1] "C:/Users/fropo/Downloads" Copy and paste the current wd in the code chunk b...

1796 sym R (222 sym/4 pcs)

Bioinformatics Workflow

29.09.2021

Introduction In biology, phylogenies are used to create a phylogenetic tree that can then be further analyzed. Vocab Multiple Sequence Alignment (MSA) phylogenetic tree fasta file pairwise alignment shroom gene packages cat distance matrix percent identity (PID) list Key functions rentrez::entrez_fetch Biostrings::pairwiseAlignment devtools::in...

4675 sym R (15685 sym/60 pcs) 1 img

Working Directory Test

15.10.2021

Default working directory Open up this .Rmd file in RStudio. In the code chunk below type getwd() and run it. “wd” means “working directory”, or where R will currently save files if you tell it to save anything. # Get the current working directory getwd() ## [1] "C:/Users/fropo/University of Pittsburgh/compbio2021 - ktd16" Copy and paste...

1708 sym R (323 sym/4 pcs)