# Load necessary libraries library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) library(ggplot2) # Data for large universities large_universities_data <-...
STM Author Giuseppe A. Veltri STM in R Structural Topic Modeling (STM) is a method for analyzing textual data that allows you to estimate topics while accounting for document-level covariates. The stm package in R is widely used for conducting STM analysis. Here’s an example of how to perform STM using the stm package in R: The first step is...
Decision Trees Author Giuseppe A. Veltri Decision Trees in R A decision tree is a type of supervised machine learning used to categorize or make predictions based on how a previous set of questions were answered. The model is a form of supervised learning, meaning that the model is trained and tested on a set of data that contains the desired...
Table of contents Principal Component Analysis or PCA The curse of dimensionality Variance and Covariance Intuition behind the PCA In plain language Eigenvectors and eigenvalues PCA using R Descriptive statistics Correlation matrix PCA Variances of the principal components Graph of individus and variables Variables factor map : The correlation ...
LCA Author Giuseppe A. Veltri Starting point Observed indicators are caused by an unobserved, or latent, variable of interest. Study the patterns of interrelationships among the observed indicators to understand and characterise the underlying latent variable Used factor analysis – continuous latent variables (generally continuous observed i...
RCA Author Giuseppe A. Veltri RCA (Relational Class Analysis) and CCA (Correlational Class Analysis) RCA and CCA are graph partitioning methods based on the assumption that individuals are related one another to the extent to which they construct meaning in a similar way. Both methods are specifically designed to address the challenge of findin...
Association Rules Author Giuseppe A. Veltri Association rules Association analysis identifies relations or correlations between observations and/or between variables in our datasets.These relationships are then expressed as a collection of “association rules”. Is a core technique of data mining. Is very useful for mining very large transact...
#######Text analysis Exercise########### library(tm) ## Loading required package: NLP #library(topicmodels) library(gutenbergr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, se...
Data and R packages This is an R Markdown document. library(party) ## Loading required package: grid ## Loading required package: mvtnorm ## Loading required package: modeltools ## Loading required package: stats4 ## Loading required package: strucchange ## Loading required package: zoo ## ## Attaching package: 'zoo' ## The following objects are...
