No articles match
Getting started with tidyclust2 months ago
Introduction | The tidyclust workflow | K-means example | 1. Create a specification | 2. Fit to data | 3. Extract results | 4. Evaluate | Hierarchical clustering example | Tidymodels integration | Next steps
Working with Tuning Parameters3 months ago
Tuning Parameters | Parameter Objects | Numeric Parameters | Discrete Parameters | Creating Novel Parameters | Unknown Values | Parameter Sets | Parameter Grids
Parallel tree evaluation in databases4 months ago
The problem | The solution | Batched summation | Example | Supported models | Output behavior | When to use | Good candidates for separate_trees = TRUE | When to stick with the default | Tradeoffs | Benchmarking recommendation
Supported Models and recipes steps4 months ago
Supported models | Recipes steps | tailor adjustments
catboost models4 months ago
tidypredict_ functions | Supported objectives | Regression objectives (identity transform) | Binary classification (sigmoid transform) | Multiclass classification | Binary classification example | Multiclass classification example | Categorical features | With parsnip/bonsai (recommended) | With raw CatBoost | Parse model spec | Limitations
Cubist models4 months ago
tidypredict_ functions | Parse model spec | Limitations
Float precision at split boundaries4 months ago
The issue | Which models are affected? | Example | What tidypredict does | Pros and cons | Recommendations
LightGBM models4 months ago
tidypredict_ functions | Supported objectives | Regression objectives (identity transform) | Regression objectives (exp transform) | Binary classification (sigmoid transform) | Multiclass classification | Binary classification example | Categorical features | parsnip | Parse model spec | Limitations
XGBoost models4 months ago
tidypredict_ functions | parsnip | Parse model spec
How tidypredict generates tree formulas4 months ago
Nested vs flat case_when | Flat case_when (old approach) | Nested case_when (current approach) | Why nested is better | Parsed model versions
Decision trees, using rpart4 months ago
How it works | Under the hood | Classification | parsnip | Categorical predictors | Surrogate splits
Metric types6 months ago
Example | Metrics
Understanding lime7 months ago
How lime explains stuff | How to permute an observation | Calculating similarities with permutations | Selecting the features to use | Fitting a model to the permuted and feature-reduced data | An example - Tabular Data | An example - text data | Interactive text model explanations | Session Info
Available axe methods7 months ago
butcher7 months ago
Introduction to parsnip7 months ago
Motivation | Placeholders for Parameters | Specifying Arguments | Process
Random Forest7 months ago
How it works | Under the hood | parsnip
Random Forest, using Ranger7 months ago
How it works | Under the hood | parsnip
glmnet models8 months ago
tidypredict_ functions | parsnip | Parse model spec
Linear Regression8 months ago
Highlights & Limitations | How it works | Prediction intervals | Under the hood | How it performs | parsnip
Workflow Stages11 months ago
Pre-processing | Model Fitting | Post-processing
Scoring via random forests11 months ago
Score class objects | A scoring example — random forest | Hyperparameter tuning | Seamless argument support | A scoring example — conditional random forest | An scoring example — oblique random forest | Available objects and engines
Introduction to filtro11 months ago
A scoring example | Filtering and ranking | A filtering exmple for score singular | A filtering example for scores plural | Available score objects and filter methods
Tidy t-Tests with infer1 years ago
Introduction | 1-Sample t-Test | 2-Sample t-Test
Get Started1 years ago
Generalized estimator equations (GEE) | Generalized least squares | Linear mixed effects via lme | Models using lmer, glmer, and stan_glmer | Using tidymodels workflows | Other tips
Equivocal zones1 years ago
Where does probably fit in?1 years ago
Introduction | Example
Full infer Pipeline Examples1 years ago
Introduction | Hypothesis tests | One numerical variable (mean) | One numerical variable (standardized mean $t$) | One numerical variable (median) | One numerical variable (paired) | One categorical (one proportion) | One categorical variable (standardized proportion $z$) | Two categorical (2 level) variables | Two categorical (2 level) variables (z) | One categorical (>2 level) - GoF | Two categorical (>2 level): Chi-squared test of independence | One numerical variable, one categorical (2 levels) (diff in means) | One numerical variable, one categorical (2 levels) (t) | One numerical variable, one categorical (2 levels) (diff in medians) | One numerical, one categorical (>2 levels) - ANOVA | Two numerical vars - SLR | Two numerical vars - correlation | Two numerical vars - SLR (t) | Multiple explanatory variables | Confidence intervals | One numerical (one mean) | One numerical (one mean - standardized) | Two categorical variables (diff in proportions) | Two categorical variables (z) | Two numerical vars - t
Getting to Know infer1 years ago
Introduction | specify(): Specifying Response (and Explanatory) Variables | hypothesize(): Declaring the Null Hypothesis | generate(): Generating the Null Distribution | calculate(): Calculating Summary Statistics | Other Utilities | Theoretical Methods | Multiple regression | Conclusion
Tidy ANOVA (Analysis of Variance) with infer1 years ago
Tidy Chi-Squared Tests with infer1 years ago
Introduction | Test of Independence | Goodness of Fit
Tidy inference for paired data1 years ago
Introduction
Using tags1 years ago
Available methods1 years ago
broom and dplyr1 years ago
Introduction to broom1 years ago
broom: let's tidy up a bit | Tidying functions | Other Examples | Generalized linear and non-linear models | Hypothesis testing | Conventions | All functions | tidy functions | augment functions | glance functions
Tidy bootstrapping1 years ago
Evaluating different predictor sets1 years ago
Classification Models With stacks1 years ago
Defining candidate ensemble members | Putting together a stack
Getting Started With stacks1 years ago
Define candidate ensemble members | Putting together a stack | Fit the stack
Introduction to bonsai1 years ago
Grouping behavior in yardstick1 years ago
Group-awareness | Groupwise metrics
Multiclass averaging1 years ago
Introduction | Macro averaging | Micro averaging | Specialized multiclass implementations
Handling categorical predictors1 years ago
Creating Dummy Variables | Interactions with Dummy Variables | Warning! | Getting All of the Indicator Variables | Novel Levels | Other Steps Related to Dummy Variables
Introduction to recipes1 years ago
An Example | An Initial Recipe | Preprocessing Steps | Checks
On skipping steps1 years ago
Example: Class Imbalance Sampling and Skipping Steps | Other Examples | How To Skip Steps | Be Careful!
Roles in recipes1 years ago
The Formula Method | The Non-Formula Interface | Role Inheritance
Selecting variables1 years ago
Cookbook - Using more complex recipes involving text1 years ago
Counting select words | Removing words in addition to the stop words list | Letter distributions | TF-IDF of ngrams of stemmed tokens
Under the hood - tokenlist1 years ago
tokens attribute | lemma and pos attributes
Working with n-grams1 years ago
Only using step_tokenize() | Using step_tokenize() and step_ngram()
Common Resampling Patterns1 years ago
Random Resampling | Initial Splits | V-Fold Cross-Validation | Monte-Carlo Cross-Validation | Bootstrap Resampling | Validation Set | Stratified Resampling | Grouped Resampling | Time-Based Resampling
Introduction to rsample1 years ago
Terminology | rset Objects Contain Many Resamples | Individual Resamples are rsplit Objects
Working with resampling sets1 years ago
Introduction | Model Assessment | Using the Bootstrap to Make Comparisons | Bootstrap Estimates of Model Coefficients | Keeping Tidy
Creating Modeling Packages With hardhat1 years ago
Introduction | What's Our Model? | Model Fitting | Model Constructor | Model Fitting Implementation | Model Fitting Bridge | User Facing Fitting Function | Adding an Intercept Option | Model Prediction | Prediction Implementation | Prediction Bridge | User Facing Prediction Function | Final Testing
Forging data for predictions1 years ago
Introduction | Connection with mold() | Outcomes | Validation | Column existence | Column types | Lossless conversion | Recipes and forge() | A note on recipes
Molding data for modeling1 years ago
Introduction | A First Example | blueprints | Formulas | Intercepts | Dummy variables | Multivariate outcomes | XY | Vector outcomes | Recipe
Writing new tidier methods1 years ago
Linear regression models3 years ago
Intro | Example setup | Model inside the database | Categorical variables | Multiple linear regression | Interactions | Full example
Database write-back3 years ago
Example setup | Model preparation | Scenario 1 - Update scores | Scenario 2- Append new scores
Generalized Linear Regression3 years ago
Highlights & Limitations | How it works | Under the hood | How it performs
MARS models via the earth package3 years ago
tidypredict_ functions | GLM models | parsnip | Parse model spec
K Means models3 years ago
Intro | Example setup | Running Kmeans clustering | Under the hood | Safeguards for long running jobs
Adding models to butcher3 years ago
Recap
Using spatial resamples for analysis3 years ago
Introduction to agua4 years ago
Introduction | Fitting models with the 'h2o' engine
Create a regression spec - version 24 years ago
Create a tree spec - version 24 years ago
Non-R Models4 years ago
python example | Read in R | tidypredict | broom
Save and re-load models4 years ago
Parse model | Saving the model | Re-load the model | broom
Introduction to tidyposterior4 years ago
Ordering of steps4 years ago
Transforming a variable | Handling levels in categorical data | Dummy variables | Recommended preprocessing outline
Using corrr6 years ago
Why a correlation data frame? | API | Other Resources
Using corrr with databases6 years ago
An example | sparklyr | Limitations
kmeans with dplyr and broom6 years ago
Tidying k-means clustering