Package: textrecipes 1.0.7.9000
textrecipes: Extra 'Recipes' for Text Processing
Converting text to numerical features requires specifically created procedures, which are implemented as steps according to the 'recipes' package. These steps allows for tokenization, filtering, counting (tf and tfidf) and feature hashing.
Authors:
textrecipes_1.0.7.9000.tar.gz
textrecipes_1.0.7.9000.zip(r-4.5)textrecipes_1.0.7.9000.zip(r-4.4)textrecipes_1.0.7.9000.zip(r-4.3)
textrecipes_1.0.7.9000.tgz(r-4.4-x86_64)textrecipes_1.0.7.9000.tgz(r-4.4-arm64)textrecipes_1.0.7.9000.tgz(r-4.3-x86_64)textrecipes_1.0.7.9000.tgz(r-4.3-arm64)
textrecipes_1.0.7.9000.tar.gz(r-4.5-noble)textrecipes_1.0.7.9000.tar.gz(r-4.4-noble)
textrecipes_1.0.7.9000.tgz(r-4.4-emscripten)textrecipes_1.0.7.9000.tgz(r-4.3-emscripten)
textrecipes.pdf |textrecipes.html✨
textrecipes/json (API)
NEWS
# Install 'textrecipes' in R: |
install.packages('textrecipes', repos = c('https://tidymodels.r-universe.dev', 'https://cloud.r-project.org')) |
Bug tracker:https://github.com/tidymodels/textrecipes/issues
Pkgdown site:https://textrecipes.tidymodels.org
- emoji_samples - Sample sentences with emojis
Last updated 12 days agofrom:6c8f45171d. Checks:3 OK, 6 NOTE. Indexed: yes.
Target | Result | Latest binary |
---|---|---|
Doc / Vignettes | OK | Jan 24 2025 |
R-4.5-win-x86_64 | OK | Jan 24 2025 |
R-4.5-linux-x86_64 | OK | Jan 24 2025 |
R-4.4-win-x86_64 | NOTE | Jan 24 2025 |
R-4.4-mac-x86_64 | NOTE | Jan 24 2025 |
R-4.4-mac-aarch64 | NOTE | Jan 24 2025 |
R-4.3-win-x86_64 | NOTE | Jan 24 2025 |
R-4.3-mac-x86_64 | NOTE | Jan 24 2025 |
R-4.3-mac-aarch64 | NOTE | Jan 24 2025 |
Exports:%>%all_tokenizedall_tokenized_predictorscount_functionsngramrequired_pkgsshow_tokensstep_clean_levelsstep_clean_namesstep_dummy_hashstep_ldastep_lemmastep_ngramstep_pos_filterstep_sequence_onehotstep_stemstep_stopwordsstep_text_normalizationstep_textfeaturestep_texthashstep_tfstep_tfidfstep_tokenfilterstep_tokenizestep_tokenize_bpestep_tokenize_sentencepiecestep_tokenize_wordpiecestep_tokenmergestep_untokenizestep_word_embeddingstidytokenlisttunable
Dependencies:classcliclockcodetoolscpp11data.tablediagramdigestdplyrfansifuturefuture.applygenericsglobalsgluegowerhardhatipredKernSmoothlatticelavalifecyclelistenvlubridatemagrittrMASSMatrixnnetnumDerivparallellypillarpkgconfigprodlimprogressrpurrrR6RcpprecipesrlangrpartshapeSnowballCsparsevctrsSQUAREMstringistringrsurvivaltibbletidyrtidyselecttimechangetimeDatetokenizerstzdbutf8vctrswithr
Cookbook - Using more complex recipes involving text
Rendered fromcookbook---using-more-complex-recipes-involving-text.Rmd
usingknitr::rmarkdown
on Jan 24 2025.Last update: 2024-11-09
Started: 2018-11-04
Under the hood - tokenlist
Rendered fromtokenlist.Rmd
usingknitr::rmarkdown
on Jan 24 2025.Last update: 2024-04-01
Started: 2020-04-08
Working with n-grams
Rendered fromWorking-with-n-grams.Rmd
usingknitr::rmarkdown
on Jan 24 2025.Last update: 2024-04-01
Started: 2020-04-08