Publications by giuseppev
Censis analysis
# Load necessary libraries library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) library(ggplot2) # Data for large universities large_universities_data <-...
15 sym R (6000 sym/8 pcs) 4 img
CSS STM
STM Author Giuseppe A. Veltri STM in R Structural Topic Modeling (STM) is a method for analyzing textual data that allows you to estimate topics while accounting for document-level covariates. The stm package in R is widely used for conducting STM analysis. Here’s an example of how to perform STM using the stm package in R: The first step is...
21182 sym Python (74910 sym/28 pcs) 6 img
CSS DT
Decision Trees Author Giuseppe A. Veltri Decision Trees in R A decision tree is a type of supervised machine learning used to categorize or make predictions based on how a previous set of questions were answered. The model is a form of supervised learning, meaning that the model is trained and tested on a set of data that contains the desired...
3767 sym Python (14321 sym/90 pcs) 4 img 1 tbl
CSS-PCA
Table of contents Principal Component Analysis or PCA The curse of dimensionality Variance and Covariance Intuition behind the PCA In plain language Eigenvectors and eigenvalues PCA using R Descriptive statistics Correlation matrix PCA Variances of the principal components Graph of individus and variables Variables factor map : The correlation ...
7939 sym R (8397 sym/37 pcs) 17 img
CSS LCA
LCA Author Giuseppe A. Veltri Starting point Observed indicators are caused by an unobserved, or latent, variable of interest. Study the patterns of interrelationships among the observed indicators to understand and characterise the underlying latent variable Used factor analysis – continuous latent variables (generally continuous observed i...
6278 sym 2 img 1 tbl
CSS-RCA
RCA Author Giuseppe A. Veltri RCA (Relational Class Analysis) and CCA (Correlational Class Analysis) RCA and CCA are graph partitioning methods based on the assumption that individuals are related one another to the extent to which they construct meaning in a similar way. Both methods are specifically designed to address the challenge of findin...
4798 sym Python (6197 sym/22 pcs) 5 img
CSS Association rules
Association Rules Author Giuseppe A. Veltri Association rules Association analysis identifies relations or correlations between observations and/or between variables in our datasets.These relationships are then expressed as a collection of “association rules”. Is a core technique of data mining. Is very useful for mining very large transact...
3323 sym Python (8236 sym/28 pcs) 2 img 1 tbl
Dottorato Text mining 1
#######Text analysis Exercise########### library(tm) ## Loading required package: NLP #library(topicmodels) library(gutenbergr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, se...
13 sym R (9205 sym/55 pcs) 8 img
AI test
Data and R packages This is an R Markdown document. library(party) ## Loading required package: grid ## Loading required package: mvtnorm ## Loading required package: modeltools ## Loading required package: stats4 ## Loading required package: strucchange ## Loading required package: zoo ## ## Attaching package: 'zoo' ## The following objects are...
377 sym R (7372 sym/21 pcs) 5 img