Publications by Thanh Dat

Topics in Academic Reading

08.02.2022

Topics in Academic Reading 1 Scrape data with rvest 1.1 All tests available in 4study 1.2 Scrape Cambridge 10 to 15 1.3 Passages to a data frame 2 Statistics with tidyverse & tidytext 2.1 Sentences 2.2 Words 3 Clean documents with tidytext & qdap 3.1 Top frequent words 3.2 Remove stopwords 3.3 Clean words 4 Topic modeling with quanteda & s...

11211 sym R (13360 sym/41 pcs) 4 img 4 tbl

String Manipulation with stringr in R

01.02.2022

String Manipulation with stringr in R 1 String basics 1.1 Welcome! 1.2 Turning numbers into strings 1.3 Putting strings together 2 Introduction to stringr 2.1 Introducing stringr 2.2 Hunting for matches 2.3 Splitting strings 2.4 Replacing matches in strings 3 Pattern matching with regex 3.1 Regular expressions 3.2 More regular expressions 3.3 ...

64354 sym R (54001 sym/378 pcs) 2 img

Introduction to Text Analysis in R

30.01.2022

Introduction to Text Analysis in R 1 Wrangling Text 1.1 Text as data 1.2 Counting categorical data 1.3 Tokenizing and cleaning 2 Visualizing Text 2.1 Plotting word counts 2.2 Improving word count plots 2.3 Faceting word count plots 2.4 Plotting word clouds 3 Sentiment Analysis 3.1 Sentiment dictionaries 3.2 Appending dictionaries 3.3 Improv...

19294 sym R (21272 sym/89 pcs) 12 img

Text Mining with Bag-of-Words in R

31.01.2022

Text Mining with Bag-of-Words in R 1 Text mining with bag of words 1.1 What is text mining? 1.2 Getting started 1.3 Clean and process text 1.4 The TDM & DTM 2 Word clouds & interesting visuals 2.1 Common text mining visuals 2.2 Intro to word clouds 2.3 Other clouds & word networks 3 Adding to your tm skills 3.1 Simple word clustering 3.2 Ge...

62758 sym R (68577 sym/327 pcs) 24 img

Reading Cambridge

06.02.2022

IELTS Reading 1 IELTS Cambridge 11 test 1 passage 1 2 IELTS Cambridge 11 test 1 passage 2 3 IELTS Cambridge 11 test 1 passage 3 4 IELTS Cambridge 11 test 2 passage 1 5 IELTS Cambridge 11 test 2 passage 2 6 IELTS Cambridge 11 test 2 passage 3 7 IELTS Cambridge 11 test 3 passage 1 8 IELTS Cambridge 11 test 3 passage 2 9 IELTS Cambridge 11 test 3 p...

324704 sym

Machine Learning with Tree-Based Models in Python

03.04.2022

Machine Learning with Tree-Based Models in Python 1 Classification and Regression Trees 1.1 Decision tree for classification 1.2 Classification tree Learning 1.3 Decision tree for regression 2 The Bias-Variance Tradeoff 2.1 Generalization Error 2.2 Diagnose bias and variance problems 2.3 Ensemble Learning 3 Bagging and Random Forests 3.1 Ba...

33984 sym R (19591 sym/158 pcs) 4 img

Supervised Learning with scikit-learn

01.04.2022

Supervised Learning with scikit-learn 1 Classification 1.1 Supervised learning 1.2 The classification challenge 1.3 Measuring model performance 2 Regression 2.1 Introduction to regression 2.2 The basics of linear regression 2.3 Cross-validation 2.4 Regularized regression 3 Fine-tuning your model 3.1 How good is your model? 3.2 Logistic regr...

59077 sym R (51024 sym/275 pcs) 9 img

Machine Learning with caret in R

25.03.2022

Machine Learning with caret in R 1 Regression models: fitting them and evaluating their performance 1.1 Welcome to the Toolbox 1.2 Out-of-sample error measures 1.3 Cross-validation 2 Classification models: fitting them and evaluating their performance 2.1 Logistic regression on sonar 2.2 Confusion matrix 2.3 Class probabilities and prediction...

55571 sym R (280282 sym/157 pcs) 5 img

Market Basket Analysis in R

02.03.2022

Market Basket Analysis in R 1 Introduction to Market Basket Analysis 1.1 Market basket introduction 1.2 Item combinations 1.3 What is market basket analysis 2 Metrics & Techniques 2.1 Transactional Data 2.2 Metrics 2.3 The apriori algorithm 2.4 “If this then that” 3 Visualization in Market Basket Analysis 3.1 Let’s see what’s in the...

40104 sym R (88032 sym/204 pcs) 32 img

Going Down to South Park: A Text Analysis

01.03.2022

Going Down to South Park: A Text Analysis 1 Import and explore data 2 Sentiments, swear words, and stemming 3 Summarize data by episode 4 South Park overall sentiment 5 South Park episode popularity 6 Are naughty episodes more popular? 7 Comparing profanity of two characters 8 Is Eric Cartman the naughtiest character? 9 Let’s answer some quest...

11399 sym R (6877 sym/24 pcs) 5 img 1 tbl