Publications by George

Multidim IRT - Spanish Production

18.12.2020

Multi-dimensional IRT Models Here we attempt to determine whether the latent space of the data is multidimensional via exploratory multi-factor models. We fit exploratory 2- through 8-factor 2PL models, and then compare them. Below is a table showing sequential model comparisons from the ordinary 2PL up to the 8-factor exploratory model. AIC cont...

8772 sym R (506 sym/2 pcs) 8 img

CDI-CAT Validation

17.12.2020

Load the things Data were collected from 250 participants, 47 of whom were excluded based on our preregistered criteria (mismatched demographic data, low birthweight, or developmental delays). An additional 0 participants met our strict exclusion criterion of disagreement on at least 75% of their CAT vs. full CDI responses, but several other pa...

1942 sym R (3757 sym/16 pcs) 6 img

Curiobaby Drop Exp 2 Analysis

21.11.2020

## Loading required package: here ## here() starts at /Users/gkacherg/Documents/GitHub/curiobaby_drop ## Loading required package: tidyverse ## ── Attaching packages ─────────────────────────────────────── tidyverse 1.3.0 ── ## ✓ ggplot2 3.3.2 ✓ purrr ...

1561 sym R (4610 sym/21 pcs) 3 img

Input Distributions

05.11.2020

Data What do the overall adult word count (AWC) per minute look like for different children? Is there greater consistency within- than between-children? d’Apice, Latham, & von Stumm (2019) found that daily home language input varied as much within- as between families (intraclass correlation = .47), but this was computed on the daily totals per...

5805 sym R (6724 sym/17 pcs) 8 img

Wordbank IRT Analysis

17.11.2020

Let’s take a step back from implementing bespoke Stan code for the full Standard Model, with real, unscaled units for input rate, word frequency and difficulty. This was troublesome due to convergence issues in our preferred model which seems to imply issues with identifiability (see also discussion in Buerkner, 2017: unless you have fixed scal...

4302 sym R (11200 sym/22 pcs) 7 img

Curiodrop model analysis

31.01.2021

Preprocessing Load human data And define helper functions ## `summarise()` has grouped output by 'relation', 'drop'. You can override using the `.groups` argument. ## `summarise()` has grouped output by 'relation', 'drop'. You can override using the `.groups` argument. Fit models to all trials (MSE objective) Find best-fitting betas per feature...

3474 sym R (211 sym/1 pcs) 6 img 6 tbl

Peekbank Time Window Analysis

27.01.2021

Motivation Peelle and Van Engen (2020) style multiverse analysis considering possible time windows with logistic growth curve models in a dataset with words of varying frequency, stimuli with varying levels of noise, and with young or old adults. For our analysis, we will restrict ourselves to familiar words, and will model age effects. # get loc...

1496 sym R (4539 sym/11 pcs) 2 img

DLL Word List Evaluation, v2

27.01.2021

Goals The goals are 1) to create a word list that is informative about both English and Spanish vocabulary size and 2) to ensure that there are sufficient doublets to estimate lexical overlap. On an IRT view, we can’t perfectly assess 2 (at least not without better bilingual CDI data), but we can assess criterion 1 - that is, we can look at whe...

10693 sym R (15103 sym/21 pcs) 10 img 4 tbl

Exploratory Multidim IRT - English

13.01.2021

Multi-dimensional IRT Models Here we attempt to determine whether the latent space of the data is multidimensional via exploratory multi-factor models. We fit exploratory 2- through 8-factor 2PL models, and then compare them. ## `summarise()` regrouping output by 'definition', 'lexical_class' (override with `.groups` argument) Below is a table sh...

5088 sym R (430 sym/2 pcs) 8 img 7 tbl

DLL Word List Evaluation, v3

05.02.2021

Goals The goals are 1) to create a word list that is informative about both English and Spanish vocabulary size and 2) to ensure that there are sufficient doublets to estimate lexical overlap. On an IRT view, we can’t perfectly assess 2 (at least not without better bilingual CDI data), but we can assess criterion 1 - that is, we can look at whe...

10876 sym R (7260 sym/5 pcs) 10 img 5 tbl