1  Introduction

1.1 Tidymodels

  • One of the earlier initiatives to create a framwork for ML tasks in R was the caret package, led by Max Kuhn, which unified many modeling tools and provided support for preprocessing, resampling, and parameter tuning. Caretwas an early and widely-used framework that provided tools for preprocessing, resampling, and cross-validation.

  • Building on this foundation, Kuhn partnered with Hadley Wickham, the creator of the tidyverse, to introduce the tidymodels ecosystem in 2020: a modern, modular collection of R packages that applies tidyverse principles to make machine learning workflows more intuitive, readable, and consistent.

Some of the core packages under `tidymodels` framework https://www.tidymodels.org
core package function
provides infrastructure for efficient data splitting and resampling
parsnip is a tidy, unified interface to models that can be used to try a range of models without getting bogged down in the syntactical minutiae of the underlying packages
recipes is a tidy interface to data pre-processing tools for feature engineering
workflows bundle your pre-processing, modeling, and post-processing together
tune helps you optimize the hyperparameters of your model and pre-processing steps
yardstick measures the effectiveness of models using performance metrics

References