Publications by Group_Project: Venkata Naga Vamsidhar reddy karasani(vkara4), Anila Cheekati(vchee3), Venkata sai ram tirunagari(Vtiru5) , Pradeep kumar Naidu(Pnaid2), Simhadri Ramanjaneyulu(rsimh3), Subhalaxmi Rout(srout2)

Assignment 2 Hands on with KNN and Naïve Bayes Classifiers

27.02.2023

Problem 1: Applying k-Nearest Neighbors to predict income For this assignment, we will be using the census adult dataset from UCI ML repository. The Adult dataset was extracted by Barry Becker from the 1994 US Census Database. Each row in the dataset has de-identified dempgraphic information of an individual worker and their income. The income is a...

13452 sym Python (45357 sym/127 pcs) 17 img

Assignment 2

24.02.2023

Problem 1: Applying k-Nearest Neighbors to predict income For this assignment, we will be using the census adult dataset from UCI ML repository. The Adult dataset was extracted by Barry Becker from the 1994 US Census Database. Each row in the dataset has de-identified dempgraphic information of an individual worker and their income. The income is a...

5881 sym Python (16003 sym/53 pcs) 4 img

ML Assignment 1

19.02.2023

Problem 1- Exploring Vegas Hotel Dataset 1). Explore the overall structure of the dataset using str(). When you read the data,set StringAsFactors=True to let R convert String variables to factors. # load the vegas dataset into a dataframe # header set to true to indicate first row is column names # string as factors = true to convert string varia...

16433 sym R (21840 sym/121 pcs) 28 img

DATA 607 Final Project

11.05.2020

Final Project Research Objective Data Sources: Necessary R Packages: Gather Data Daily Presidential White House Briefings Stock Market Data Trump Approval Ratings Cleaning Data Clean WH Briefing Data Clean Approval Ratings Data Analysis White House Briefing & Sentiment Analysis Trump approval ratings analysis Stock Market Performance Twitter ...

10264 sym R (35822 sym/93 pcs) 15 img

DATA 607 Final Project

06.05.2020

Data source 2 Twitter API In this analysis, we use the twitter account of Donald Trump. All Twitter posts collected from Donald Trump’s twitter account realDonaldTrump post for corona pandemic. We will request for 10,000 tweets related to #COVID-19 and #realDonaldTrump from March 15 to April 30 in 2020 for analysis. We use the data science sof...

5270 sym R (20411 sym/37 pcs) 12 img

DATA 606 LAB 9

03.05.2020

Grading the professor Many college courses conclude by giving students the opportunity to evaluate the course and the instructor anonymously. However, the use of these student evaluations as an indicator of course quality and teaching effectiveness is often criticized because these measures may reflect the influence of non-teaching related charac...

13336 sym R (7563 sym/37 pcs) 15 img 1 tbl

DATA 606 Final Project

04.05.2020

DATA 606 Final Project Subhalaxmi Rout 4/29/2020 Introduction The Company ABC has very poor employee satisfaction and retention. Company has decided to look into the request of paying their employees for overtime hours. The information available for the sample employees includes currently available information such as satisfaction, number of pro...

5881 sym R (30743 sym/56 pcs) 11 img

DATA 606 Homework 9

02.05.2020

Baby weights, Part I. (9.1, p. 350) The Child Health and Development Studies investigate a range of topics. One study considered all pregnancies between 1960 and 1967 among women in the Kaiser Foundation Health Plan in the San Francisco East Bay area. Here, we study the relationship between smoking and weight of the baby. The variable smoke is c...

7996 sym R (840 sym/11 pcs) 2 img

Project 4 Document Classification

26.04.2020

Project 4: Document Classification Subhalaxmi Rout 2020-04-26 Introduction For this project, I will be classifying emails using the ham and spam dataset. I will use Naive Bayes Algorithm to create a model. After creating the model, I will predict the model and check the accuracy of the model. Data Loading I need to change the data into a usa...

1780 sym R (6197 sym/12 pcs) 2 img

Tidyverse Extend

19.04.2020

The assignment is to present a use case for the tidyverse packages and sdemonstrate how to use one or more of the capabilities TidyVerse package with your selected dataset 1) Libraries and Data Load needed libraries # The easiest way to get all libraries is to load the whole tidyverse but we will load just the packages we need #library(tidyvers...

3908 sym R (6956 sym/31 pcs) 3 img 1 tbl