Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
19 changes: 19 additions & 0 deletions TMwR.bib
Original file line number Diff line number Diff line change
Expand Up @@ -147,4 +147,23 @@ @book{bolstad2004
author={Bolstad, B},
year={2004},
publisher={University of California, Berkeley}
}


@article{Durrleman1989,
author = {Durrleman, S and Simon, R},
title = {Flexible regression models with cubic splines},
journal = {Statistics in Medicine},
volume = {8},
number = {5},
pages = {551-561},
year = {1989}
}


@book{kuhn20202,
title={Feature engineering and selection: A practical approach for predictive models},
author={Kuhn, M and Johnson, K},
year={2020},
publisher={CRC Press}
}
2 changes: 1 addition & 1 deletion _book/a-model-workflow.html
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@
<meta name="author" content="Max Kuhn" />


<meta name="date" content="2019-12-16" />
<meta name="date" content="2019-12-20" />

<meta name="viewport" content="width=device-width, initial-scale=1" />
<meta name="apple-mobile-web-app-capable" content="yes" />
Expand Down
2 changes: 1 addition & 1 deletion _book/a-tale-of-two-models.html
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@
<meta name="author" content="Max Kuhn" />


<meta name="date" content="2019-12-16" />
<meta name="date" content="2019-12-20" />

<meta name="viewport" content="width=device-width, initial-scale=1" />
<meta name="apple-mobile-web-app-capable" content="yes" />
Expand Down
2 changes: 1 addition & 1 deletion _book/a-tale-of-two-models.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@



# A tale of two models
# A tale of two models {#two-models}

(tentative title)

Expand Down
2 changes: 1 addition & 1 deletion _book/a-tidyverse-primer.html
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@
<meta name="author" content="Max Kuhn" />


<meta name="date" content="2019-12-16" />
<meta name="date" content="2019-12-20" />

<meta name="viewport" content="width=device-width, initial-scale=1" />
<meta name="apple-mobile-web-app-capable" content="yes" />
Expand Down
177 changes: 177 additions & 0 deletions _book/data-spending.html
Original file line number Diff line number Diff line change
@@ -0,0 +1,177 @@
<!DOCTYPE html>
<html xmlns="http://www.w3.org/1999/xhtml" lang="" xml:lang="">
<head>

<meta charset="utf-8" />
<meta http-equiv="X-UA-Compatible" content="IE=edge" />
<title>4 Spending our data | Tidy Modeling with R</title>
<meta name="description" content="Modeling of data is integral to science, business, politics, and many other aspects of our lives. The goals of this book are to: introduce neophytes to models and the tidyverse, demonstrate the tidymodels packages, and to outline good practices for the phases of the modeling process." />
<meta name="generator" content="bookdown 0.14 and GitBook 2.6.7" />

<meta property="og:title" content="4 Spending our data | Tidy Modeling with R" />
<meta property="og:type" content="book" />


<meta property="og:description" content="Modeling of data is integral to science, business, politics, and many other aspects of our lives. The goals of this book are to: introduce neophytes to models and the tidyverse, demonstrate the tidymodels packages, and to outline good practices for the phases of the modeling process." />
<meta name="github-repo" content="topepo/TMwR" />

<meta name="twitter:card" content="summary" />
<meta name="twitter:title" content="4 Spending our data | Tidy Modeling with R" />
<meta name="twitter:site" content="@topepos" />
<meta name="twitter:description" content="Modeling of data is integral to science, business, politics, and many other aspects of our lives. The goals of this book are to: introduce neophytes to models and the tidyverse, demonstrate the tidymodels packages, and to outline good practices for the phases of the modeling process." />


<meta name="author" content="Max Kuhn" />


<meta name="date" content="2019-12-21" />

<meta name="viewport" content="width=device-width, initial-scale=1" />
<meta name="apple-mobile-web-app-capable" content="yes" />
<meta name="apple-mobile-web-app-status-bar-style" content="black" />


<link rel="prev" href="two-models.html"/>
<link rel="next" href="model-metrics.html"/>
<script src="libs/jquery-2.2.3/jquery.min.js"></script>
<link href="libs/gitbook-2.6.7/css/style.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-table.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-bookdown.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-highlight.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-search.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-fontsettings.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-clipboard.css" rel="stylesheet" />












</head>

<body>



<div class="book without-animation with-summary font-size-2 font-family-1" data-basepath=".">

<div class="book-summary">
<nav role="navigation">

<ul class="summary">
<li><strong><a href="./">Tidy Modeling with R</a></strong></li>

<li class="divider"></li>
<li class="chapter" data-level="" data-path="index.html"><a href="index.html"><i class="fa fa-check"></i>Hello World</a></li>
<li class="chapter" data-level="1" data-path="introduction.html"><a href="introduction.html"><i class="fa fa-check"></i><b>1</b> Introduction</a><ul>
<li class="chapter" data-level="1.1" data-path="introduction.html"><a href="introduction.html#types-of-models"><i class="fa fa-check"></i><b>1.1</b> Types of models</a></li>
<li class="chapter" data-level="1.2" data-path="introduction.html"><a href="introduction.html#some-terminology"><i class="fa fa-check"></i><b>1.2</b> Some terminology</a></li>
<li class="chapter" data-level="1.3" data-path="introduction.html"><a href="introduction.html#model-phases"><i class="fa fa-check"></i><b>1.3</b> How does modeling fit into the data analysis/scientific process?</a></li>
<li class="chapter" data-level="1.4" data-path="introduction.html"><a href="introduction.html#begin-model-end"><i class="fa fa-check"></i><b>1.4</b> Where does the model begin and end?</a></li>
<li class="chapter" data-level="1.5" data-path="introduction.html"><a href="introduction.html#outline-of-future-chapters"><i class="fa fa-check"></i><b>1.5</b> Outline of future chapters</a></li>
</ul></li>
<li class="chapter" data-level="2" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html"><i class="fa fa-check"></i><b>2</b> A tidyverse primer</a><ul>
<li class="chapter" data-level="2.1" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#principles"><i class="fa fa-check"></i><b>2.1</b> Principles</a></li>
<li class="chapter" data-level="2.2" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#code"><i class="fa fa-check"></i><b>2.2</b> Code</a></li>
<li class="chapter" data-level="2.3" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#modeling-via-base-r"><i class="fa fa-check"></i><b>2.3</b> Modeling via base R</a></li>
<li class="chapter" data-level="2.4" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#why-tidiness-is-important-for-modeling"><i class="fa fa-check"></i><b>2.4</b> Why tidiness is important for modeling</a></li>
<li class="chapter" data-level="2.5" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#some-additional-tidy-principals-for-modeling."><i class="fa fa-check"></i><b>2.5</b> Some additional tidy principals for modeling.</a></li>
</ul></li>
<li class="chapter" data-level="3" data-path="two-models.html"><a href="two-models.html"><i class="fa fa-check"></i><b>3</b> A tale of two models</a></li>
<li class="chapter" data-level="4" data-path="data-spending.html"><a href="data-spending.html"><i class="fa fa-check"></i><b>4</b> Spending our data</a></li>
<li class="chapter" data-level="5" data-path="model-metrics.html"><a href="model-metrics.html"><i class="fa fa-check"></i><b>5</b> How good is our model?</a></li>
<li class="chapter" data-level="6" data-path="feature-engineering.html"><a href="feature-engineering.html"><i class="fa fa-check"></i><b>6</b> Feature engineering</a></li>
<li class="chapter" data-level="7" data-path="workflows.html"><a href="workflows.html"><i class="fa fa-check"></i><b>7</b> A model workflow</a></li>
<li class="chapter" data-level="8" data-path="resampling.html"><a href="resampling.html"><i class="fa fa-check"></i><b>8</b> Resampling for evaluating performance</a></li>
</ul>

</nav>
</div>

<div class="book-body">
<div class="body-inner">
<div class="book-header" role="navigation">
<h1>
<i class="fa fa-circle-o-notch fa-spin"></i><a href="./">Tidy Modeling with R</a>
</h1>
</div>

<div class="page-wrapper" tabindex="-1" role="main">
<div class="page-inner">

<section class="normal" id="section-">
<div id="data-spending" class="section level1">
<h1><span class="header-section-number">4</span> Spending our data</h1>
<p>General data splitting</p>
<p>Re-emphasize roles or different data sets and good/bad ways of doing things.</p>
<p>Validation sets.</p>
<p>What we do differently with a lot of data.</p>
<p>Allude to resampling.</p>

</div>
</section>

</div>
</div>
</div>
<a href="two-models.html" class="navigation navigation-prev " aria-label="Previous page"><i class="fa fa-angle-left"></i></a>
<a href="model-metrics.html" class="navigation navigation-next " aria-label="Next page"><i class="fa fa-angle-right"></i></a>
</div>
</div>
<script src="libs/gitbook-2.6.7/js/app.min.js"></script>
<script src="libs/gitbook-2.6.7/js/lunr.js"></script>
<script src="libs/gitbook-2.6.7/js/clipboard.min.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-search.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-sharing.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-fontsettings.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-bookdown.js"></script>
<script src="libs/gitbook-2.6.7/js/jquery.highlight.js"></script>
<script src="libs/gitbook-2.6.7/js/plugin-clipboard.js"></script>
<script>
gitbook.require(["gitbook"], function(gitbook) {
gitbook.start({
"sharing": false,
"fontsettings": {
"theme": "white",
"family": "sans",
"size": 2
},
"edit": {
"link": "https://github.com/topepo/TMwR-temp/edit/master/data-spending.Rmd",
"text": "Edit"
},
"history": {
"link": null,
"text": null
},
"download": null,
"toc": {
"collapse": "section"
}
});
});
</script>

<!-- dynamically load mathjax for compatibility with self-contained -->
<script>
(function () {
var script = document.createElement("script");
script.type = "text/javascript";
var src = "true";
if (src === "" || src === "true") src = "https://mathjax.rstudio.com/latest/MathJax.js?config=TeX-MML-AM_CHTML";
if (location.protocol !== "file:")
if (/^https?:/.test(src))
src = src.replace(/^https?:/, '');
script.src = src;
document.getElementsByTagName("head")[0].appendChild(script);
})();
</script>
</body>

</html>
2 changes: 1 addition & 1 deletion _book/data-spending.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@



# Spending our data
# Spending our data {#data-spending}

General data splitting

Expand Down
49 changes: 32 additions & 17 deletions _book/feature-engineering.html
Original file line number Diff line number Diff line change
Expand Up @@ -24,15 +24,15 @@
<meta name="author" content="Max Kuhn" />


<meta name="date" content="2019-12-16" />
<meta name="date" content="2019-12-21" />

<meta name="viewport" content="width=device-width, initial-scale=1" />
<meta name="apple-mobile-web-app-capable" content="yes" />
<meta name="apple-mobile-web-app-status-bar-style" content="black" />


<link rel="prev" href="how-good-is-our-model.html"/>
<link rel="next" href="a-model-workflow.html"/>
<link rel="prev" href="model-metrics.html"/>
<link rel="next" href="workflows.html"/>
<script src="libs/jquery-2.2.3/jquery.min.js"></script>
<link href="libs/gitbook-2.6.7/css/style.css" rel="stylesheet" />
<link href="libs/gitbook-2.6.7/css/plugin-table.css" rel="stylesheet" />
Expand Down Expand Up @@ -72,22 +72,23 @@
<li class="chapter" data-level="1" data-path="introduction.html"><a href="introduction.html"><i class="fa fa-check"></i><b>1</b> Introduction</a><ul>
<li class="chapter" data-level="1.1" data-path="introduction.html"><a href="introduction.html#types-of-models"><i class="fa fa-check"></i><b>1.1</b> Types of models</a></li>
<li class="chapter" data-level="1.2" data-path="introduction.html"><a href="introduction.html#some-terminology"><i class="fa fa-check"></i><b>1.2</b> Some terminology</a></li>
<li class="chapter" data-level="1.3" data-path="introduction.html"><a href="introduction.html#where-does-modeling-fit-into-the-data-analysisscientific-process"><i class="fa fa-check"></i><b>1.3</b> Where does modeling fit into the data analysis/scientific process?</a></li>
<li class="chapter" data-level="1.4" data-path="introduction.html"><a href="introduction.html#modeling-is-a-process-not-a-single-activity"><i class="fa fa-check"></i><b>1.4</b> Modeling is a <em>process</em>, not a single activity</a></li>
<li class="chapter" data-level="1.3" data-path="introduction.html"><a href="introduction.html#model-phases"><i class="fa fa-check"></i><b>1.3</b> How does modeling fit into the data analysis/scientific process?</a></li>
<li class="chapter" data-level="1.4" data-path="introduction.html"><a href="introduction.html#begin-model-end"><i class="fa fa-check"></i><b>1.4</b> Where does the model begin and end?</a></li>
<li class="chapter" data-level="1.5" data-path="introduction.html"><a href="introduction.html#outline-of-future-chapters"><i class="fa fa-check"></i><b>1.5</b> Outline of future chapters</a></li>
</ul></li>
<li class="chapter" data-level="2" data-path="a-tidyverse-primer.html"><a href="a-tidyverse-primer.html"><i class="fa fa-check"></i><b>2</b> A tidyverse primer</a><ul>
<li class="chapter" data-level="2.1" data-path="a-tidyverse-primer.html"><a href="a-tidyverse-primer.html#principles"><i class="fa fa-check"></i><b>2.1</b> Principles</a></li>
<li class="chapter" data-level="2.2" data-path="a-tidyverse-primer.html"><a href="a-tidyverse-primer.html#code"><i class="fa fa-check"></i><b>2.2</b> Code</a></li>
<li class="chapter" data-level="2.3" data-path="a-tidyverse-primer.html"><a href="a-tidyverse-primer.html#why-tidiness-is-important-for-modeling"><i class="fa fa-check"></i><b>2.3</b> Why tidiness is important for modeling</a></li>
<li class="chapter" data-level="2.4" data-path="a-tidyverse-primer.html"><a href="a-tidyverse-primer.html#some-additional-tidy-principals-for-modeling."><i class="fa fa-check"></i><b>2.4</b> Some additional tidy principals for modeling.</a></li>
<li class="chapter" data-level="2" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html"><i class="fa fa-check"></i><b>2</b> A tidyverse primer</a><ul>
<li class="chapter" data-level="2.1" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#principles"><i class="fa fa-check"></i><b>2.1</b> Principles</a></li>
<li class="chapter" data-level="2.2" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#code"><i class="fa fa-check"></i><b>2.2</b> Code</a></li>
<li class="chapter" data-level="2.3" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#modeling-via-base-r"><i class="fa fa-check"></i><b>2.3</b> Modeling via base R</a></li>
<li class="chapter" data-level="2.4" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#why-tidiness-is-important-for-modeling"><i class="fa fa-check"></i><b>2.4</b> Why tidiness is important for modeling</a></li>
<li class="chapter" data-level="2.5" data-path="tidyverse-primer.html"><a href="tidyverse-primer.html#some-additional-tidy-principals-for-modeling."><i class="fa fa-check"></i><b>2.5</b> Some additional tidy principals for modeling.</a></li>
</ul></li>
<li class="chapter" data-level="3" data-path="a-tale-of-two-models.html"><a href="a-tale-of-two-models.html"><i class="fa fa-check"></i><b>3</b> A tale of two models</a></li>
<li class="chapter" data-level="4" data-path="spending-our-data.html"><a href="spending-our-data.html"><i class="fa fa-check"></i><b>4</b> Spending our data</a></li>
<li class="chapter" data-level="5" data-path="how-good-is-our-model.html"><a href="how-good-is-our-model.html"><i class="fa fa-check"></i><b>5</b> How good is our model?</a></li>
<li class="chapter" data-level="3" data-path="two-models.html"><a href="two-models.html"><i class="fa fa-check"></i><b>3</b> A tale of two models</a></li>
<li class="chapter" data-level="4" data-path="data-spending.html"><a href="data-spending.html"><i class="fa fa-check"></i><b>4</b> Spending our data</a></li>
<li class="chapter" data-level="5" data-path="model-metrics.html"><a href="model-metrics.html"><i class="fa fa-check"></i><b>5</b> How good is our model?</a></li>
<li class="chapter" data-level="6" data-path="feature-engineering.html"><a href="feature-engineering.html"><i class="fa fa-check"></i><b>6</b> Feature engineering</a></li>
<li class="chapter" data-level="7" data-path="a-model-workflow.html"><a href="a-model-workflow.html"><i class="fa fa-check"></i><b>7</b> A model workflow</a></li>
<li class="chapter" data-level="8" data-path="resampling-for-evaluating-performance.html"><a href="resampling-for-evaluating-performance.html"><i class="fa fa-check"></i><b>8</b> Resampling for evaluating performance</a></li>
<li class="chapter" data-level="7" data-path="workflows.html"><a href="workflows.html"><i class="fa fa-check"></i><b>7</b> A model workflow</a></li>
<li class="chapter" data-level="8" data-path="resampling.html"><a href="resampling.html"><i class="fa fa-check"></i><b>8</b> Resampling for evaluating performance</a></li>
</ul>

</nav>
Expand Down Expand Up @@ -118,8 +119,8 @@ <h1><span class="header-section-number">6</span> Feature engineering</h1>
</div>
</div>
</div>
<a href="how-good-is-our-model.html" class="navigation navigation-prev " aria-label="Previous page"><i class="fa fa-angle-left"></i></a>
<a href="a-model-workflow.html" class="navigation navigation-next " aria-label="Next page"><i class="fa fa-angle-right"></i></a>
<a href="model-metrics.html" class="navigation navigation-prev " aria-label="Previous page"><i class="fa fa-angle-left"></i></a>
<a href="workflows.html" class="navigation navigation-next " aria-label="Next page"><i class="fa fa-angle-right"></i></a>
</div>
</div>
<script src="libs/gitbook-2.6.7/js/app.min.js"></script>
Expand Down Expand Up @@ -156,6 +157,20 @@ <h1><span class="header-section-number">6</span> Feature engineering</h1>
});
</script>

<!-- dynamically load mathjax for compatibility with self-contained -->
<script>
(function () {
var script = document.createElement("script");
script.type = "text/javascript";
var src = "true";
if (src === "" || src === "true") src = "https://mathjax.rstudio.com/latest/MathJax.js?config=TeX-MML-AM_CHTML";
if (location.protocol !== "file:")
if (/^https?:/.test(src))
src = src.replace(/^https?:/, '');
script.src = src;
document.getElementsByTagName("head")[0].appendChild(script);
})();
</script>
</body>

</html>
2 changes: 1 addition & 1 deletion _book/how-good-is-our-model.html
Original file line number Diff line number Diff line change
Expand Up @@ -24,7 +24,7 @@
<meta name="author" content="Max Kuhn" />


<meta name="date" content="2019-12-16" />
<meta name="date" content="2019-12-20" />

<meta name="viewport" content="width=device-width, initial-scale=1" />
<meta name="apple-mobile-web-app-capable" content="yes" />
Expand Down
2 changes: 1 addition & 1 deletion _book/how-good-is-our-model.md
Original file line number Diff line number Diff line change
@@ -1,7 +1,7 @@



# How good is our model?
# How good is our model? {#model-metrics}

(or how well does our model work? Superman does good; a model can work well)

Expand Down
Loading