Publications by Thanh Dat
Topics in Academic Reading
Topics in Academic Reading 1 Scrape data with rvest 1.1 All tests available in 4study 1.2 Scrape Cambridge 10 to 15 1.3 Passages to a data frame 2 Statistics with tidyverse & tidytext 2.1 Sentences 2.2 Words 3 Clean documents with tidytext & qdap 3.1 Top frequent words 3.2 Remove stopwords 3.3 Clean words 4 Topic modeling with quanteda & s...
11211 sym R (13360 sym/41 pcs) 4 img 4 tbl
String Manipulation with stringr in R
String Manipulation with stringr in R 1 String basics 1.1 Welcome! 1.2 Turning numbers into strings 1.3 Putting strings together 2 Introduction to stringr 2.1 Introducing stringr 2.2 Hunting for matches 2.3 Splitting strings 2.4 Replacing matches in strings 3 Pattern matching with regex 3.1 Regular expressions 3.2 More regular expressions 3.3 ...
64354 sym R (54001 sym/378 pcs) 2 img
Introduction to Text Analysis in R
Introduction to Text Analysis in R 1 Wrangling Text 1.1 Text as data 1.2 Counting categorical data 1.3 Tokenizing and cleaning 2 Visualizing Text 2.1 Plotting word counts 2.2 Improving word count plots 2.3 Faceting word count plots 2.4 Plotting word clouds 3 Sentiment Analysis 3.1 Sentiment dictionaries 3.2 Appending dictionaries 3.3 Improv...
19294 sym R (21272 sym/89 pcs) 12 img
Text Mining with Bag-of-Words in R
Text Mining with Bag-of-Words in R 1 Text mining with bag of words 1.1 What is text mining? 1.2 Getting started 1.3 Clean and process text 1.4 The TDM & DTM 2 Word clouds & interesting visuals 2.1 Common text mining visuals 2.2 Intro to word clouds 2.3 Other clouds & word networks 3 Adding to your tm skills 3.1 Simple word clustering 3.2 Ge...
62758 sym R (68577 sym/327 pcs) 24 img
Reading Cambridge
IELTS Reading 1 IELTS Cambridge 11 test 1 passage 1 2 IELTS Cambridge 11 test 1 passage 2 3 IELTS Cambridge 11 test 1 passage 3 4 IELTS Cambridge 11 test 2 passage 1 5 IELTS Cambridge 11 test 2 passage 2 6 IELTS Cambridge 11 test 2 passage 3 7 IELTS Cambridge 11 test 3 passage 1 8 IELTS Cambridge 11 test 3 passage 2 9 IELTS Cambridge 11 test 3 p...
324704 sym
Machine Learning with Tree-Based Models in Python
Machine Learning with Tree-Based Models in Python 1 Classification and Regression Trees 1.1 Decision tree for classification 1.2 Classification tree Learning 1.3 Decision tree for regression 2 The Bias-Variance Tradeoff 2.1 Generalization Error 2.2 Diagnose bias and variance problems 2.3 Ensemble Learning 3 Bagging and Random Forests 3.1 Ba...
33984 sym R (19591 sym/158 pcs) 4 img
Supervised Learning with scikit-learn
Supervised Learning with scikit-learn 1 Classification 1.1 Supervised learning 1.2 The classification challenge 1.3 Measuring model performance 2 Regression 2.1 Introduction to regression 2.2 The basics of linear regression 2.3 Cross-validation 2.4 Regularized regression 3 Fine-tuning your model 3.1 How good is your model? 3.2 Logistic regr...
59077 sym R (51024 sym/275 pcs) 9 img
Machine Learning with caret in R
Machine Learning with caret in R 1 Regression models: fitting them and evaluating their performance 1.1 Welcome to the Toolbox 1.2 Out-of-sample error measures 1.3 Cross-validation 2 Classification models: fitting them and evaluating their performance 2.1 Logistic regression on sonar 2.2 Confusion matrix 2.3 Class probabilities and prediction...
55571 sym R (280282 sym/157 pcs) 5 img
Market Basket Analysis in R
Market Basket Analysis in R 1 Introduction to Market Basket Analysis 1.1 Market basket introduction 1.2 Item combinations 1.3 What is market basket analysis 2 Metrics & Techniques 2.1 Transactional Data 2.2 Metrics 2.3 The apriori algorithm 2.4 “If this then that” 3 Visualization in Market Basket Analysis 3.1 Let’s see what’s in the...
40104 sym R (88032 sym/204 pcs) 32 img
Going Down to South Park: A Text Analysis
Going Down to South Park: A Text Analysis 1 Import and explore data 2 Sentiments, swear words, and stemming 3 Summarize data by episode 4 South Park overall sentiment 5 South Park episode popularity 6 Are naughty episodes more popular? 7 Comparing profanity of two characters 8 Is Eric Cartman the naughtiest character? 9 Let’s answer some quest...
11399 sym R (6877 sym/24 pcs) 5 img 1 tbl