tidylda: Latent Dirichlet Allocation Using 'tidyverse' Conventions

Implements an algorithm for Latent Dirichlet Allocation (LDA), Blei et at. (2003) <https://www.jmlr.org/papers/volume3/blei03a/blei03a.pdf>, using style conventions from the 'tidyverse', Wickham et al. (2019)<doi:10.21105/joss.01686>, and 'tidymodels', Kuhn et al.<https://tidymodels.github.io/model-implementation-principles/>. Fitting is done via collapsed Gibbs sampling. Also implements several novel features for LDA such as guided models and transfer learning based on ongoing and, as yet, unpublished research.

Version: 0.0.5
Depends: R (≥ 3.5.0)
Imports: dplyr, generics, gtools, Matrix, methods, mvrsquared (≥ 0.1.0), Rcpp (≥ 1.0.2), rlang, stats, stringr, tibble, tidyr, tidytext
LinkingTo: Rcpp, RcppArmadillo, RcppProgress, RcppThread
Suggests: ggplot2, knitr, parallel, quanteda, testthat, tm, slam, spelling, covr, rmarkdown
Published: 2024-04-22
DOI: 10.32614/CRAN.package.tidylda
Author: Tommy Jones ORCID iD [aut, cre], Brendan Knapp ORCID iD [ctb], Barum Park [ctb]
Maintainer: Tommy Jones <jones.thos.w at gmail.com>
BugReports: https://github.com/TommyJones/tidylda/issues
License: MIT + file LICENSE
URL: https://github.com/TommyJones/tidylda/
NeedsCompilation: yes
Language: en-US
Materials: README, NEWS
CRAN checks: tidylda results [issues need fixing before 2025-11-15]

Documentation:

Reference manual: tidylda.html , tidylda.pdf
Vignettes: Probabilistic Coherence (source, R code)
Transfer Learning with LDA (tLDA) (source, R code)
Introduction to tidylda (source, R code)

Downloads:

Package source: tidylda_0.0.5.tar.gz
Windows binaries: r-devel: tidylda_0.0.5.zip, r-release: tidylda_0.0.5.zip, r-oldrel: tidylda_0.0.5.zip
macOS binaries: r-release (arm64): tidylda_0.0.5.tgz, r-oldrel (arm64): tidylda_0.0.5.tgz, r-release (x86_64): tidylda_0.0.5.tgz, r-oldrel (x86_64): tidylda_0.0.5.tgz
Old sources: tidylda archive

Linking:

Please use the canonical form https://CRAN.R-project.org/package=tidylda to link to this page.