Publications by Anna Podolskaya

VizQuiz_3

12.12.2020

Original graph: https://eduhseru.sharepoint.com/sites/AdvancedDataAnalysis/Shared%20Documents/General/Viz%20Quiz%203/viz_2waves.png The first thing, that is not nice about this two graphs (actually, it combines two points in one) - if it was supposed to be two graphs for comparison - it is nearly to impossible compare it correctly. First - differ...

2282 sym R (2010 sym/5 pcs) 1 img

Scraping - University Ratings from the Web

12.12.2020

Wiki table scrape library(httr) wiki <- GET("https://en.wikipedia.org/wiki/Times_Higher_Education_World_University_Rankings") wiki ## Response [https://en.wikipedia.org/wiki/Times_Higher_Education_World_University_Rankings] ## Date: 2020-12-12 10:20 ## Status: 200 ## Content-Type: text/html; charset=UTF-8 ## Size: 274 kB ## <!DOC...

9693 sym R (9152 sym/45 pcs) 2 img

Portfolio_DataAnalysis_2020

12.12.2020

All can be found here: https://rpubs.com/Nuta Scrape the data from Web : https://rpubs.com/Nuta/scraping Coding Reflection Paper : https://rpubs.com/Nuta/descr_stat Logistic Regression ft. dealing w/ missing data : https://rpubs.com/Nuta/log_reg Cluster Analysis : https://rpubs.com/Nuta/claster_an Principle Component Analysis : https://rpubs.com...

560 sym

VizQuiz_1

12.12.2020

Task: * Look at the picture and evaluate whether it is the correct type of graph for the data provided. * ‘Anything wrong?’ If you see an element that could be corrected or improved, name it. * Which graph would fit better? Suggest any improvements and describe them here. * Show it! Use the data and the graph at hand to produce a better grap...

1988 sym R (250 sym/1 pcs) 1 img

PCA_principle component analysis

12.12.2020

Data description In the data of The Times Higher Education World University Rankings 2021 we have information about 1448 universities from 92 countries (top-6 are US, Japan, UK, China, India, Brazil and the number of observed universities there very from 174 to 51). From the data we now how universirsities’ scores for their teaching, researches...

4263 sym R (11245 sym/39 pcs) 7 img

VizQuiz_2

12.12.2020

https://eduhseru.sharepoint.com/sites/AdvancedDataAnalysis/Shared%20Documents/General/St.P._stats.jpg Anything wrong? colors Firstly, if we are talking about positive and negative events it is good to use contrasting coloring for the chart. In this case: red for the number of new infected and green/blue for thу recovered. the type of graph I...

1950 sym R (264 sym/1 pcs) 5 img 1 tbl

Binary Logistic Regression ft. missing data

12.12.2020

Project Description The outcome variable is Attending lawful demonstrations in the past 12 months (the question is “Now I’d like you to look at this card. I’m going to read out some different forms of political action that people can take, and I’d like you to tell me, for each one, whether you have actually done any of these things, wheth...

8172 sym R (68406 sym/54 pcs) 15 img 1 tbl

Bayesian re-thinking

12.12.2020

Quick intro to my last year’s research Hey! As I have sometimes mentioned - my last (and, actually, this) year course paper is devoted to exploratory analysis of how parents choose schools: starting from the very first differences in bahavior of families with different socio-economic status, leading to understanding of various strategies of cho...

5974 sym R (2945 sym/11 pcs) 1 img 1 tbl

Cluster Analysis - University data

12.12.2020

Preparations Load the packages for data wrangling (tidyverse, stringr, purrr) and jsonlite for web scraping a JSON: library(tidyverse) library(stringr) library(purrr) require("V8") require("jsonlite") library(V8) library(jsonlite) library(psych) library(kableExtra) library(ggplot2) Extract the page with 2021 ratings as json file: college...

4272 sym R (7797 sym/38 pcs) 12 img

Doing Descriptive Statistics in R

12.12.2020

library(dplyr) Project Description Today I am going to compare different tools and packages in R for presenting statistical descriptions of some data. I will evaluate in in terms of ease of use and how well does it present necessary for me data. Introduction to the data For this project I have decided to let you know a bit of my thesis paper. T...

5205 sym R (13623 sym/30 pcs) 4 img 2 tbl