Publications by Joyce

Course Project JHU_DS_10-2b

17.10.2024

A word prediction app Coursera capstone project Joyce Clemente 2024-10-17 (v.1); 2024-10-17 (last update) Objective and method Objective: Use 3 corpora (tweet, blog, news) to create a word prediction model used in a Shiny app. Number of unique ngrams (n_size) or colocates (c_size) used in the models. ngram n_size c c_size ugram ...

1830 sym 3 img 2 tbl

Placeholder. Course Project JHU_DS_10-2b

14.10.2024

A word prediction app Coursera capstone project Joyce Clemente 2024-10-12(v.1); 2024-10-13(last update) Objective and method Objective: Use 3 corpora (tweet, blog, news) to create a word prediction model used in a Shiny app....

238 sym

Course Project JHU_DS_10-2a

13.10.2024

A word prediction app Coursera capstone project Joyce Clemente 2024-10-12(v.1); 2024-10-12(last update) Objective and method Objective: Use 3 corpora (tweet, blog, news) to create a word prediction model used in a Shiny app. Number of unique ngrams (n_size) or colocates (c_size) used in the models. ngram n_size c c_size ugram 39...

1662 sym 3 img 2 tbl

Course Project JHU_DS_10-1

07.06.2024

I. Summary Three corpora: tweet (160 Mb), blog (201 Mb), and news (197 Mb) were explored in preparation for creating a text prediction model. The corpora were explored in three stages: prior to cleaning (full corpus and 10% sub-sample), after reshaping lines to sentences (10% sub-sample), and after cleaning (10% sub-sample). The 10% sub-samples...

15288 sym R (2211 sym/11 pcs) 6 img

Course Project 9-4

08.05.2024

It’s one kg bananas. What could it cost? Comparison of banana prices in Canadian provinces. Joyce Clemente 2024-05-01 Background Motivation: Why bananas? Agriculture Canada. Statistical overview of the Canadian fruit industry, 2022. 21% share in 2022 (fruits after losses), average <2 CAD per kg in 2023 (apples: >4.5 CAD per kg) About ...

962 sym Python (599 sym/2 pcs)

Course Project 9-2

08.05.2024

Large Canadian Population Centres (>100,000 people in 2021) April 18, 2024 Click on markers for additional population information. Total population 2021 = 36,991,981. References Search the Canadian Geographical Names Database (CGNDB). Accessed April 18, 2024. - https://geonames.nrcan.gc.ca/search-place-names/search Statistics Canada. Tabl...

538 sym

Course Project 9-3

08.05.2024

April 24, 2024 Canadian population distribution (2021) LargeUrban = 100k, Medium = 11k - 100k, Small = 82 - 29k people;not_gr is the difference between province or territory vs. (LargeUrban + Medium + Small) community counts;click box to magnify. Provinces and Territories Alta: AlbertaBC: British ColumbiaMan: ManitobaNB: New BrunswickNL: Newfoun...

783 sym

Classifying dumbbell lifts using random forest

25.04.2024

Note: This is an edited version of a project created in partial fulfillment of the Coursera “Practical Machine Learning” course by Johns Hopkins University. The project was first submitted through GitHub on April 6, 2024. I. Summary The objective was to use activity sensors to classify the activity of six users into one of five dumbbell li...

10836 sym R (758 sym/5 pcs) 4 img 1 tbl

Classifying dumbbell lifts using random forest

15.04.2024

Note: This is an edited version of a project created in partial fulfillment of the Coursera “Practical Machine Learning” course by Johns Hopkins University. The project was first submitted through GitHub on April 6, 2024. I. Summary The objective was to use activity sensors to classify the activity of six users into one of five dumbbell li...

10871 sym R (980 sym/5 pcs) 4 img 1 tbl

Population Health and Economic Effects of Storm Events (1996-2011)

04.02.2024

N.B This document was created to fulfill a requirement in the non-credit online Coursera course - Reproducible Research - by Johns Hopkins University. The analysis is incomplete. It is not to be taken as scientific opinion. I. Analysis Summary The original data collected in the U.S. between 1950 - 2011, which contained 902297 observations and ...

23320 sym R (91912 sym/249 pcs) 3 img 8 tbl