Publications by Laura Burenkov

Data 605 Final

20.05.2024

Overview You are to register for Kaggle.com (free) and compete in the House Prices: Advanced Regression Techniques competition. https://www.kaggle.com/c/house-prices-advanced-regression-techniques . I want you to do the following. Pick one of the quanititative independent variables from the training data set (train.csv) , and define that variabl...

34661 sym Python (98577 sym/152 pcs) 45 img

Blog 5

14.05.2024

Blog Entry 5: Predictive Analytics in Travel Planning Title: Predictive Analytics in Travel Planning Overview In today’s fast-paced world, efficient travel planning is essential for individuals, businesses, and transportation authorities alike. Predictive analytics, particularly using machine learning models like Random Forest, can revolution...

4997 sym Python (1654 sym/7 pcs) 1 img

Blog 1

13.05.2024

Blog Entry 1: Introduction to Descriptive Statistics with Iris Dataset Title: Exploring the Iris Dataset: A Statistical Journey Overview In this blog entry, we delve into the fascinating world of statistics through the lens of the famous Iris dataset. The Iris dataset is a classic in the field of statistics and data science, comprising measure...

5089 sym 2 img

Blog 2

13.05.2024

Blog Entry 2: Understanding Probability Distributions Title: Understanding Probability Distributions Overview In this blog entry, we delve into the exploration and analysis of automotive data using the mtcars dataset available in R. This dataset comprises measurements of various attributes for 32 different car models from the early 1970s, inclu...

6165 sym 7 img

Blog 3

13.05.2024

Blog Entry 3: Analyzing Car Performance Data with Statistical Methods Title: Analyzing Car Performance Data with Statistical Methods Overview In this blog entry, we get into the world of automotive engineering using the “cars” dataset available in R. Cars are integral parts of our daily lives, and understanding their performance characteris...

4630 sym 2 img

Blog 4

13.05.2024

Blog Entry 4: Logistic regression with Iris dataset Title: Logistic regression with Iris dataset Overview In this blog post, we explored the application of logistic regression using the famous iris dataset. Logistic regression is a powerful statistical method used for binary classification tasks, where the goal is to predict the probability of ...

6600 sym Python (4112 sym/17 pcs) 1 img

Story 7

13.05.2024

Overview The US Geological Survey publishes a list of Strategic Minerals ( https://www.usgs.gov/news/national-news-release/us-geological-survey-releases-2022-list-critical-minerals ). Having a secure supply of these minerals is essential to our security and economic prosperity. However many of these minerals are sourced from outside of the US. T...

23832 sym Python (36042 sym/34 pcs) 8 img 2 tbl

621final

09.05.2024

library(dplyr) library(tidyr) library(ggplot2) ## Warning: package 'ggplot2' was built under R version 4.3.3 library(flexdashboard) ## Warning: package 'flexdashboard' was built under R version 4.3.3 library(tidyverse) library(gapminder) ## Warning: package 'gapminder' was built under R version 4.3.3 library(ggthemes) ## Warning: package 'ggthe...

10418 sym R (14061 sym/44 pcs) 4 img 1 tbl

Data 605 A15

09.05.2024

Exercise 1 Find the equation of the regression line for the given points. Round any final values to the nearest hundredth, if necessary. ( 5.6, 8.8 ), ( 6.3, 12.4 ), ( 7, 14.8 ), ( 7.7, 18.2 ), ( 8.4, 20.8 ) To find the equation of the regression line for the given points, we’ll use linear regression. Let’s denote the given points as \((x_i,...

10974 sym

Discussion 15

07.05.2024

Exercise 23 For the function \(f(x, y, z) = \frac{x}{x + 2y - 4z}\), the domain and range are as follows: Domain: The function is defined for all real numbers \(x\), \(y\), and \(z\) such that the denominator \(x + 2y - 4z\) is not equal to zero. Thus, the domain can be expressed as: \[ \text{Domain} = \{(x, y, z) \in \mathbb{R}^3 \mid x + 2y -...

6539 sym