Publications by giuseppev

Censis analysis

29.07.2024

# Load necessary libraries library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, setequal, union library(tidyr) library(ggplot2) # Data for large universities large_universities_data <-...

15 sym R (6000 sym/8 pcs) 4 img

CSS STM

03.05.2023

STM Author Giuseppe A. Veltri STM in R Structural Topic Modeling (STM) is a method for analyzing textual data that allows you to estimate topics while accounting for document-level covariates. The stm package in R is widely used for conducting STM analysis. Here’s an example of how to perform STM using the stm package in R: The first step is...

21182 sym Python (74910 sym/28 pcs) 6 img

CSS DT

27.04.2023

Decision Trees Author Giuseppe A. Veltri Decision Trees in R A decision tree is a type of supervised machine learning used to categorize or make predictions based on how a previous set of questions were answered. The model is a form of supervised learning, meaning that the model is trained and tested on a set of data that contains the desired...

3767 sym Python (14321 sym/90 pcs) 4 img 1 tbl

CSS-PCA

13.04.2023

Table of contents Principal Component Analysis or PCA The curse of dimensionality Variance and Covariance Intuition behind the PCA In plain language Eigenvectors and eigenvalues PCA using R Descriptive statistics Correlation matrix PCA Variances of the principal components Graph of individus and variables Variables factor map : The correlation ...

7939 sym R (8397 sym/37 pcs) 17 img

CSS LCA

13.04.2023

LCA Author Giuseppe A. Veltri Starting point Observed indicators are caused by an unobserved, or latent, variable of interest. Study the patterns of interrelationships among the observed indicators to understand and characterise the underlying latent variable Used factor analysis – continuous latent variables (generally continuous observed i...

6278 sym 2 img 1 tbl

CSS-RCA

13.04.2023

RCA Author Giuseppe A. Veltri RCA (Relational Class Analysis) and CCA (Correlational Class Analysis) RCA and CCA are graph partitioning methods based on the assumption that individuals are related one another to the extent to which they construct meaning in a similar way. Both methods are specifically designed to address the challenge of findin...

4798 sym Python (6197 sym/22 pcs) 5 img

CSS Association rules

18.04.2023

Association Rules Author Giuseppe A. Veltri Association rules Association analysis identifies relations or correlations between observations and/or between variables in our datasets.These relationships are then expressed as a collection of “association rules”. Is a core technique of data mining. Is very useful for mining very large transact...

3323 sym Python (8236 sym/28 pcs) 2 img 1 tbl

Dottorato Text mining 1

13.02.2020

#######Text analysis Exercise########### library(tm) ## Loading required package: NLP #library(topicmodels) library(gutenbergr) library(dplyr) ## ## Attaching package: 'dplyr' ## The following objects are masked from 'package:stats': ## ## filter, lag ## The following objects are masked from 'package:base': ## ## intersect, setdiff, se...

13 sym R (9205 sym/55 pcs) 8 img

AI test

15.07.2020

Data and R packages This is an R Markdown document. library(party) ## Loading required package: grid ## Loading required package: mvtnorm ## Loading required package: modeltools ## Loading required package: stats4 ## Loading required package: strucchange ## Loading required package: zoo ## ## Attaching package: 'zoo' ## The following objects are...

377 sym R (7372 sym/21 pcs) 5 img