Publications by Lisa Szydziak
HW1 V2 622 Lszydziak
Assignment Visit the following website and explore the range of sizes of this dataset (from 100 to 5 million records). https://eforexcel.com/wp/downloads-18-sample-csv-files-data-sets-for-testing-sales/ Based on your computer’s capabilities (memory, CPU), select 2 files you can handle (recommended one small, one large) Review the structure and ...
4198 sym R (101197 sym/77 pcs) 2 img
622 tree and random forest
#HOMEWORK #2 Based on the latest topics presented, bring a dataset of your choice and create a Decision Tree where you can solve a classification problem and predict the outcome of a particular feature or detail of the data used. Switch variables to generate 2 decision trees and compare the results. Create a random forest for regression and analy...
2816 sym R (9154 sym/54 pcs) 8 img
622 Tree and Random Forest
#HOMEWORK #2 Based on the latest topics presented, bring a dataset of your choice and create a Decision Tree where you can solve a classification problem and predict the outcome of a particular feature or detail of the data used. Switch variables to generate 2 decision trees and compare the results. Create a random forest for regression and analy...
4025 sym R (9154 sym/54 pcs) 8 img
HW4 622
Assignment Homework #4 You get to decide which dataset you want to work on. The data set must be different from the ones used in previous homeworks You can work on a problem from your job, or something you are interested in. You may also obtain a dataset from sites such as Kaggle, Data.Gov, Census Bureau, USGS or other open data portals. Select o...
10733 sym R (33559 sym/45 pcs) 9 img 1 tbl
HW4 622 LSzydziak
Assignment Homework #4 You get to decide which dataset you want to work on. The data set must be different from the ones used in previous homeworks You can work on a problem from your job, or something you are interested in. You may also obtain a dataset from sites such as Kaggle, Data.Gov, Census Bureau, USGS or other open data portals. Select o...
10865 sym R (33559 sym/45 pcs) 9 img 1 tbl
HW3 622 LSzydziak 041022
HOMEWORK #3 Perform an analysis of the dataset used in Homework #2 using the SVM algorithm. Compare the results with the results from previous homework. Based on articles https://www.hindawi.com/journals/complexity/2021/5550344/ https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8137961/ Search for academic content (at least 3 articles) that compare th...
6243 sym R (8420 sym/36 pcs) 1 img