Publications by Phuong Linh

Mixed Data Clustering: k-prototype

27.03.2022

1 Loading Library library(tidyverse) library(clustMixType) 2 Loading theme my_theme <- function(base_size = 10, base_family = "sans"){ theme_minimal(base_size = base_size, base_family = base_family) + theme( axis.text = element_text(size = 10), axis.text.x = element_text(angle = 0, vjust = 0.5, hjust = 0.5), axis.title = ...

723 sym R (87435 sym/35 pcs) 11 img

Mixed Data Clustering: clustMixType

27.03.2022

1 Loading Library library(tidyverse) library(clustMixType) 2 Data iris %>% glimpse() ## Rows: 150 ## Columns: 5 ## $ Sepal.Length <dbl> 5.1, 4.9, 4.7, 4.6, 5.0, 5.4, 4.6, 5.0, 4.4, 4.9, 5.4, 4.… ## $ Sepal.Width <dbl> 3.5, 3.0, 3.2, 3.1, 3.6, 3.9, 3.4, 3.4, 2.9, 3.1, 3.7, 3.… ## $ Petal.Length <dbl> 1.4, 1.4, 1.3, 1.5, 1.4, 1.7, 1.4, 1.5, 1...

294 sym R (51422 sym/26 pcs) 7 img

Brand Health Check

17.03.2022

BRAND HEALTH CHECK - 2019 Brand-Awareness Row Chart 1. Brand TOM (unpromted) Chart 2. Brand TOM (promted) Row Chart 3. Dịch vụ đang sử dụng thường xuyên nhất Chart 4. Lần đầu tiên biết đến dịch vụ Row Chart 5. Biết đến dịch vụ từ Kênh Row Chart 6. Mức độ tiếp xúc với dịch vụ (ng...

1098 sym 32 img

data wrangling_active user & promotion reaction

17.03.2022

1 Library loading library(tidyverse) library(readxl) 2 Data wrangling 2.1 Past behavior Q1 <- df1 %>% ggplot(aes(x= Question_1, group = NhomKH)) + geom_bar(aes(y = ..prop.., fill = factor(..x..)), stat="count") + geom_text(aes( label = scales::percent(..prop..), y= ..prop.. ), stat= "count", vjust = -.0) + labs(...

147 sym R (2093 sym/5 pcs) 4 img

data wrangling_user clustering

16.03.2022

1 Library Loading library(readxl) library(tidyverse) library("readxl") library(knitr) library(data.table) library(dplyr) library(ggplot2) library(tidyr) library(writexl) library(pastecs) library(fpc) library(magrittr) data <- read_excel("/Users/admin/Documents/Linh-R Studio/Irisgo/IRISGO_data.xlsx") 2 Data Wrangling 2.1 Change format from long ...

268 sym R (2063 sym/7 pcs) 1 img 2 tbl

data wrangling_app usage behavior (1)

16.03.2022

1 Library Loading library(tidyverse) library("readxl") library("writexl") library(tidyr) library(dplyr) library(lubridate) 2 Data Wrangling 2.1 Remove duplicated data Active <- Aug30 Active <- Active %>% group_by(UserID,ActiveDate) %>% summarise(totalusage = sum(`Usage Time (minute)`)) Aug30_a <- Aug30 %>% select(UserID,RedeemedAt_text) Aug3...

186 sym R (4396 sym/7 pcs) 3 tbl

data wrangling_transaction data

16.03.2022

1 Library Loading library(readxl) library(tidyverse) library(readxl) library(knitr) library(data.table) library(dplyr) library(ggplot2) library(tidyr) library(writexl) library(pastecs) library(fpc) library(magrittr) 2 Data Uploading data <- read_excel("Transaction_data.xlsx", sheet = "raw") data$userId <- NULL 3 Data Wrangling 3.1 Remove missi...

391 sym R (1957 sym/29 pcs)

Customer Purchase Analysis

07.03.2022

1 Library Loading library(tidyverse) ## -- Attaching packages -------------------------------------------------- tidyverse 1.3.0 -- ## v ggplot2 3.2.1 v purrr 0.3.3 ## v tibble 2.1.3 v dplyr 0.8.4 ## v tidyr 1.0.2 v stringr 1.4.0 ## v readr 1.3.1 v forcats 0.4.0 ## -- Conflicts --------------------------------------------...

237 sym R (13672 sym/69 pcs) 9 img

Correlation & Cronbach's alpha Analysis

07.03.2022

1 Input Data knitr::opts_chunk$set(echo = TRUE) setwd("/Users/admin/Documents/Linh-R Studio/ESS - HR/ESS") library("readxl") my_data <- read_excel("ESS-Nopass.xlsx") my_data <- my_data[ -c(1) ] my_data <- na.omit(my_data) test <- my_data 2 Correlation for a specific variable (means of each item) my_data$DH <- rowMeans(my_data[,1:3]) my_data$...

511 sym R (39226 sym/172 pcs) 4 img

Travel Demand Analysis & Prediction

07.03.2022

1 Data Processing Data <- bind_rows(Data2013,Data2014,Data2015,Data2016,Data2017,Data2018,Data2019,Data2020_JanFeb,Data2020_Mar,Data2020_Apr,Data2020_May,Data2020_Jun) 2 Descriptive Analysis 2.1 Route: SGN - HAN & HAN - SGN SGN_HAN <- Data %>% filter(Route %in% c("HAN-SGN", "SGN-HAN")) # Number of flight over time df <- SGN_HAN %>% g...

189 sym R (6564 sym/25 pcs) 13 img