Simon P. Couch

Simon P. Couch https://simonpcouch.com/blog/index.html A data science blog quarto-1.3.450 Mon, 08 Apr 2024 05:00:00 GMT Measuring elapsed time to fit with tidymodels https://simonpcouch.com/blog/2024-04-08-fit-time/index.html tl;dr: The development versions of tidymodels packages include methods for a new extract function, extract_fit_time(), that returns the time required to train a workflow. Pass extract_fit_time() as a control option while tuning and run collect_extracts() to see training times for resampled workflows. In this example, we can identify a modeling workflow that trains more than 10x faster than the most performant model with very little decrease in predictive performance. ]]> https://simonpcouch.com/blog/2024-04-08-fit-time/index.html Mon, 08 Apr 2024 05:00:00 GMT Run an Oracle Database with Docker Desktop on ARM (M1, M2, M3) MacOS https://simonpcouch.com/blog/2024-03-14-oracle/index.html https://simonpcouch.com/blog/2024-03-14-oracle/index.html Thu, 14 Mar 2024 05:00:00 GMT Analyzing my own music listening data with R and the tidyverse (2023) https://simonpcouch.com/blog/2023-11-30-listening-2023/index.html last year, I got to wondering whether I could make a lo-fi knockoff of wrapped using R, the tidyverse, and the data that I have access to. You already know: ]]> https://simonpcouch.com/blog/2023-11-30-listening-2023/index.html Thu, 30 Nov 2023 06:00:00 GMT Predicting flight delays with tidymodels🛩 https://simonpcouch.com/blog/2023-11-28-flights/index.html Last week, I virtually dropped by University of Wisconsin-Madison for a webinar on tidymodels. Heading into the holidays, I thought a fun example problem might be to try and predict flight delays using flights data from Madison’s airport. This is a very, very difficult modeling problem, and the results aren’t very impressive, but it’s a fun one nonetheless. ]]> https://simonpcouch.com/blog/2023-11-28-flights/index.html Tue, 28 Nov 2023 06:00:00 GMT Quarto! https://simonpcouch.com/blog/2023-10-24-quarto-site/index.html https://simonpcouch.com/blog/2023-10-24-quarto-site/index.html Tue, 24 Oct 2023 05:00:00 GMT Down the submodels rabbit hole with tidymodels https://simonpcouch.com/blog/2023-10-11-submodels-rabbit-hole/index.html mtcars: ]]> https://simonpcouch.com/blog/2023-10-11-submodels-rabbit-hole/index.html Wed, 11 Oct 2023 05:00:00 GMT Hiking the 2023 John Muir Trail / Nüümü Poyo https://simonpcouch.com/blog/2023-09-01-jmt-2023/index.html https://simonpcouch.com/blog/2023-09-01-jmt-2023/index.html Fri, 01 Sep 2023 05:00:00 GMT Optimizing model parameters faster with tidymodels https://simonpcouch.com/blog/2023-08-04-parallel-racing/index.html tune_grid() on one CPU core. Making use of parallel processing and using a near-drop-in replacement for tune_grid() can speed up hyperparameter tuning by 20-30x! ]]> https://simonpcouch.com/blog/2023-08-04-parallel-racing/index.html Fri, 04 Aug 2023 05:00:00 GMT Moving On From Baltimore https://simonpcouch.com/blog/2023-07-28-moving-on-s23/index.html a post around the time I graduated from college. I wrote about my gratitude for my time in Oregon, my excitement to start a PhD program in Biostatistics at Johns Hopkins, and the joy of raising my then-very-young dog, Millie. Soon after, in summer 2021, I made the road trip to Baltimore, MD. ]]> https://simonpcouch.com/blog/2023-07-28-moving-on-s23/index.html Fri, 28 Jul 2023 05:00:00 GMT The tidymodels is getting a whole lot faster https://simonpcouch.com/blog/2023-03-24-speedups-2023/index.html tidymodels packages provide a consistent, expressive, and safe interface to all sorts of modeling algorithms in R. ]]> https://simonpcouch.com/blog/2023-03-24-speedups-2023/index.html Fri, 24 Mar 2023 05:00:00 GMT Analyzing my own music listening data with R and the tidyverse https://simonpcouch.com/blog/2022-12-01-listening-2022/index.html https://simonpcouch.com/blog/2022-12-01-listening-2022/index.html Thu, 01 Dec 2022 06:00:00 GMT Redirecting from sub-domains with Netlify https://simonpcouch.com/blog/2022-07-20-netlify-subdomain-redirect/index.html https://simonpcouch.com/blog/2022-07-20-netlify-subdomain-redirect/index.html Wed, 20 Jul 2022 05:00:00 GMT {infer} v1.0.0 is on CRAN https://simonpcouch.com/blog/2021-08-21-infer-v-1-0-0/index.html https://simonpcouch.com/blog/2021-08-21-infer-v-1-0-0/index.html Sat, 21 Aug 2021 05:00:00 GMT Applying to the NSF Graduate Research Fellowship (GRFP) https://simonpcouch.com/blog/2021-08-02-apply-to-nsf-grfp/index.html post on applying to graduate school in statistics/biostatistics. ]]> https://simonpcouch.com/blog/2021-08-02-apply-to-nsf-grfp/index.html Mon, 02 Aug 2021 05:00:00 GMT Pipe-esque Programming with {ggplot2}’s Plus Operator https://simonpcouch.com/blog/2021-06-10-ggplot-pipe-plus/index.html +) operator rather than {magrittr}’s pipe (%>%) was a tough transition my first time around. ]]> https://simonpcouch.com/blog/2021-06-10-ggplot-pipe-plus/index.html Thu, 10 Jun 2021 05:00:00 GMT Big Things (Developer Documentation pt. 4) https://simonpcouch.com/blog/2021-05-13-dev-docs-p4/index.html Tidy Model Stacking with R. ]]> https://simonpcouch.com/blog/2021-05-13-dev-docs-p4/index.html Thu, 13 May 2021 05:00:00 GMT Naming the Things (Developer Documentation pt. 3) https://simonpcouch.com/blog/2021-05-12-dev-docs-p3/index.html Tidy Model Stacking with R. ]]> https://simonpcouch.com/blog/2021-05-12-dev-docs-p3/index.html Wed, 12 May 2021 05:00:00 GMT Splitting Things Up (Developer Documentation pt. 2) https://simonpcouch.com/blog/2021-05-11-dev-docs-p2/index.html Tidy Model Stacking with R. ]]> https://simonpcouch.com/blog/2021-05-11-dev-docs-p2/index.html Tue, 11 May 2021 05:00:00 GMT How {stacks} Came To Be (Developer Documentation pt. 1) https://simonpcouch.com/blog/2021-05-10-dev-docs-p1/index.html Tidy Model Stacking with R. ]]> https://simonpcouch.com/blog/2021-05-10-dev-docs-p1/index.html Mon, 10 May 2021 05:00:00 GMT What’s Next For Me https://simonpcouch.com/blog/2021-03-23-whats-next-s21/index.html https://simonpcouch.com/blog/2021-03-23-whats-next-s21/index.html Tue, 23 Mar 2021 05:00:00 GMT