Publications by Maia Lim

Final Portfolio: Maia Lim

18.12.2021

Introduction This code compiles summary information about the gene EGLN1 (Egl-9 Family Hypoxia Inducible Factor 1). Links: RefSeq page: https://www.ncbi.nlm.nih.gov/gene/54583 Homologene page: https://www.ncbi.nlm.nih.gov/homologene/56936 UniProt page: https://www.uniprot.org/uniprot/F6P6J7 PDB page: NA To compare the evolutionary relationship be...

3525 sym R (33248 sym/144 pcs) 6 img 4 tbl

Portfolio: Predicting AA Characteristics

10.12.2021

Key vocab proteinogenic amino acids regression model / line of best fit pI confidence intervals (CI) confidence ellipse correlation coefficient Selenocysteine and Pyrrolysine re-coding stop codons y = m*x + b slope intercept Key functions / packages ggpubr pander lm() coef() cor() round() Predict pI for an Selenocysteine and Pyrrolysine Amino...

5664 sym R (4858 sym/37 pcs) 1 img 4 tbl

Review Portfolio (downloading, cleaning, and aligning data)

15.11.2021

The goal of this exercise is to make you familiar with how to download data from Google Sheets and to briefly review some key concepts R functions and coding concepts. We’ll do the following things -DOWLOAD A LIST OF REFSEQ ACCESSION NUMBERS FROM GOOGLE SHEETS -USE NA.OMIT() TO REMOVE THE NAS -USE DUPLICATED())M TO SELECT OUT ALL ISOFORMS EXCEP...

4484 sym R (17512 sym/122 pcs) 1 img

Google Sheets Access Test

15.11.2021

The goal of this exercise is to make you familiar with how to download data from Google Sheets and to briefly review some key concepts R functions and coding concepts. We’ll do the following things download a list of RefSeq accessions from a Google sheet remove the NAs using na.omit() select out all but one isoform using duplicated() Packages...

865 sym R (2756 sym/25 pcs)

GWAS Pie Graphs

02.11.2021

Introduction The following graphs uses information collected by Alice B. Popejoy and Stephanie M. Fullerton from the genome-wide association studies (GWAS) Catalog. The GWAS Catalog is produced by the US National Human Genome Research Institute in partnership with the European Bioinformatics Institute where every week, curators are notified of an...

653 sym R (1741 sym/4 pcs) 3 img

Introduction to DotPlots

26.10.2021

Sequence dotplots in R By: Avril Coghlan. Adapted, edited and expanded: Nathan Brouwer under the Creative Commons 3.0 Attribution License (CC BY 3.0). NOTE: I’ve added some new material that is rather terse and lacks explication. Good sources of more info: https://omicstutorials.com/interpreting-dot-plot-bioinformatics-with-an-example/ http://r...

4332 sym R (1587 sym/13 pcs) 8 img

Adjusting Dotplot Settings

26.10.2021

Add the necessary code to make this script function. Preliminaries Load packages library(seqinr) library(rentrez) library(compbio4all) library(Biostrings) ## Loading required package: BiocGenerics ## Loading required package: parallel ## ## Attaching package: 'BiocGenerics' ## The following objects are masked from 'package:parallel': ## ## ...

1196 sym R (6349 sym/40 pcs) 5 img

Maia Lim: Downloading and Saving FASTA File

16.10.2021

This is a modification of “DNA Sequence Statistics” from Avril Coghlan’s A little book of R for bioinformatics.. Most of the text and code was originally written by Dr. Coghlan and distributed under the Creative Commons 3.0 license. NOTE: There is some redundancy in this current draft that needs to be eliminated. Functions library() help...

12806 sym R (2126 sym/40 pcs)

Knit to HTML

16.10.2021

Default working directory Open up this .Rmd file in RStudio. In the code chunk below type getwd() and run it. “wd” means “working directory”, or where R will currently save files if you tell it to save anything. # Get the current working directory getwd() ## [1] "/Users/maialim/University of Pittsburgh/compbio2021 - mrl106" Copy and paste...

1708 sym R (314 sym/4 pcs)

Document

14.10.2021

Default working directory Open up this .Rmd file in RStudio. In the code chunk below type getwd() and run it. “wd” means “working directory”, or where R will currently save files if you tell it to save anything. # Get the current working directory (where R saves things by default) getwd() ## [1] "/Users/maialim/Downloads" Copy and paste t...

1708 sym R (328 sym/4 pcs)