Publications by Phuong Linh
Mixed Data Clustering: k-prototype
1 Loading Library library(tidyverse) library(clustMixType) 2 Loading theme my_theme <- function(base_size = 10, base_family = "sans"){ theme_minimal(base_size = base_size, base_family = base_family) + theme( axis.text = element_text(size = 10), axis.text.x = element_text(angle = 0, vjust = 0.5, hjust = 0.5), axis.title = ...
723 sym R (87435 sym/35 pcs) 11 img
Mixed Data Clustering: clustMixType
1 Loading Library library(tidyverse) library(clustMixType) 2 Data iris %>% glimpse() ## Rows: 150 ## Columns: 5 ## $ Sepal.Length <dbl> 5.1, 4.9, 4.7, 4.6, 5.0, 5.4, 4.6, 5.0, 4.4, 4.9, 5.4, 4.… ## $ Sepal.Width <dbl> 3.5, 3.0, 3.2, 3.1, 3.6, 3.9, 3.4, 3.4, 2.9, 3.1, 3.7, 3.… ## $ Petal.Length <dbl> 1.4, 1.4, 1.3, 1.5, 1.4, 1.7, 1.4, 1.5, 1...
294 sym R (51422 sym/26 pcs) 7 img
Brand Health Check
BRAND HEALTH CHECK - 2019 Brand-Awareness Row Chart 1. Brand TOM (unpromted) Chart 2. Brand TOM (promted) Row Chart 3. Dịch vụ đang sử dụng thường xuyên nhất Chart 4. Lần đầu tiên biết đến dịch vụ Row Chart 5. Biết đến dịch vụ từ Kênh Row Chart 6. Mức độ tiếp xúc với dịch vụ (ng...
1098 sym 32 img
data wrangling_active user & promotion reaction
1 Library loading library(tidyverse) library(readxl) 2 Data wrangling 2.1 Past behavior Q1 <- df1 %>% ggplot(aes(x= Question_1, group = NhomKH)) + geom_bar(aes(y = ..prop.., fill = factor(..x..)), stat="count") + geom_text(aes( label = scales::percent(..prop..), y= ..prop.. ), stat= "count", vjust = -.0) + labs(...
147 sym R (2093 sym/5 pcs) 4 img
data wrangling_user clustering
1 Library Loading library(readxl) library(tidyverse) library("readxl") library(knitr) library(data.table) library(dplyr) library(ggplot2) library(tidyr) library(writexl) library(pastecs) library(fpc) library(magrittr) data <- read_excel("/Users/admin/Documents/Linh-R Studio/Irisgo/IRISGO_data.xlsx") 2 Data Wrangling 2.1 Change format from long ...
268 sym R (2063 sym/7 pcs) 1 img 2 tbl
data wrangling_app usage behavior (1)
1 Library Loading library(tidyverse) library("readxl") library("writexl") library(tidyr) library(dplyr) library(lubridate) 2 Data Wrangling 2.1 Remove duplicated data Active <- Aug30 Active <- Active %>% group_by(UserID,ActiveDate) %>% summarise(totalusage = sum(`Usage Time (minute)`)) Aug30_a <- Aug30 %>% select(UserID,RedeemedAt_text) Aug3...
186 sym R (4396 sym/7 pcs) 3 tbl
data wrangling_transaction data
1 Library Loading library(readxl) library(tidyverse) library(readxl) library(knitr) library(data.table) library(dplyr) library(ggplot2) library(tidyr) library(writexl) library(pastecs) library(fpc) library(magrittr) 2 Data Uploading data <- read_excel("Transaction_data.xlsx", sheet = "raw") data$userId <- NULL 3 Data Wrangling 3.1 Remove missi...
391 sym R (1957 sym/29 pcs)
Customer Purchase Analysis
1 Library Loading library(tidyverse) ## -- Attaching packages -------------------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.2.1 v purrr 0.3.3 ## v tibble 2.1.3 v dplyr 0.8.4 ## v tidyr 1.0.2 v stringr 1.4.0 ## v readr 1.3.1 v forcats 0.4.0 ## -- Conflicts --------------------------------------------...
237 sym R (13672 sym/69 pcs) 9 img
Correlation & Cronbach's alpha Analysis
1 Input Data knitr::opts_chunk$set(echo = TRUE) setwd("/Users/admin/Documents/Linh-R Studio/ESS - HR/ESS") library("readxl") my_data <- read_excel("ESS-Nopass.xlsx") my_data <- my_data[ -c(1) ] my_data <- na.omit(my_data) test <- my_data 2 Correlation for a specific variable (means of each item) my_data$DH <- rowMeans(my_data[,1:3]) my_data$...
511 sym R (39226 sym/172 pcs) 4 img
Travel Demand Analysis & Prediction
1 Data Processing Data <- bind_rows(Data2013,Data2014,Data2015,Data2016,Data2017,Data2018,Data2019,Data2020_JanFeb,Data2020_Mar,Data2020_Apr,Data2020_May,Data2020_Jun) 2 Descriptive Analysis 2.1 Route: SGN - HAN & HAN - SGN SGN_HAN <- Data %>% filter(Route %in% c("HAN-SGN", "SGN-HAN")) # Number of flight over time df <- SGN_HAN %>% g...
189 sym R (6564 sym/25 pcs) 13 img