登录查看更多内容

Ten random useful things you may not know about R

Keith McNulty

Leader in Technology, Science and Analytics | Mathematician, Statistician and Psychometrician | Author and Teacher | Coder, Engineer, Architect

发布日期: 2019年4月5日

Often I find myself telling my colleagues and fellow coders simple things that I use in R that really help me with the tasks that I need to progress on. These range from trivial shortcuts, to little known functions, to handy little tricks.

Because the R ecosystem is so rich and constantly growing, people can often miss out on knowing about something that can really help them in a task that they have to complete. So I often get a surprised reaction from my audience along the lines of ‘I never knew about that!’.

Here are ten things that make my life easier working in R. If you already know them all, sorry for wasting your reading time, and please consider adding a comment with something else that you find useful for the benefit of other readers.

1. The switch function

I LOVE switch(). It’s basically a convenient shortening of an if statement that chooses its value according to the value of another variable. I find it particularly useful when I am writing code that needs to load a different dataset according to a prior choice you make. For example, if you have a variable called animal and you want to load a different set of data according to whether animal is a dog, cat or rabbit you might write this:

data <- read.csv(
  switch(animal, 
         "dog" = "dogdata.csv", 
         "cat" = "catdata.csv",
         "rabbit" = "rabbitdata.csv")
)

This is particularly useful in Shiny apps where you might want to load different datasets or even environment files depending on one or more of the input menu choices.

2. RStudio shortcut keys

This is less an R hack and more about the RStudio IDE, but the shortcut keys available for common commands are super useful and can save a lot of typing time. My two favourite are Ctrl+Shift+M for the pipe operator %>% and Alt+- for the assignment operator <-. If you want to see a full set of these awesome shortcuts just type Atl+Shift+K in RStudio.

3. The flexdashboard package

If you want to get a quick Shiny dashboard up and running with a minimum of fuss, the flexdashboard package has everything you need. It provides simple HTML shortcuts that allow easy construction of sidebars and the organization of your display into rows and columns. It also has a super flexible title bar where you can organize your app into different pages and put in icons and links to Github code or an email address or whatever. As a package which operates within R Markdown it also allows you to keep all your app in one Rmd file rather than needing to break out out into separate server and UI files, like for example shinydashboard. I use flexdashboard whenever I need to create a simple prototype version of a dashboard before moving it on to more advanced design. I can often get dashboards up and running within an hour using flexdashboard.

4. The req and validate functions in R Shiny

R Shiny development can be frustrating, especially when you get generic error messages that don’t help you understand what is going wrong under the hood. As Shiny develops, more and more validation and testing functions are being added to help better diagnose and alert when specific errors occur. The req() function allows you to prevent an action from occurring unless a certain condition is satisfied, but does so silently and without displaying an error. So you can make the display of UI elements conditional on previous actions. For example, with reference to my example no 1 above:

output$go_button <- shiny::renderUI({
  
  # only display button if an animal input has been chosen  
  shiny::req(input$animal)

  # display button
  shiny::actionButton("go", 
                      paste("Conduct", input$animal, "analysis!") 
  )
})

validate() checks before rendering an output and enables you to return a tailored error message should a certain condition not be fulfilled, for example if the user uploaded the wrong file:

# get csv input file
inFile <- input$file1
data <- inFile$datapath

# render table only if it is dogs
shiny::renderTable({
  # check that it is the dog file, not cats or rabbits
  shiny::validate(
    need("Dog Name" %in% colnames(data)),
    "Dog Name column not found - did you load the right file?"
  )

  data
})

For more on these functions see my other article here.

5. Keeping all my credentials to myself using the system environment

If you are sharing code that requires login credentials to databases and the like, you can use the system environment to avoid posting those credentials to Github or other spaces where they might be at risk. You can put credentials as named environment variables in your R session, for example:

Sys.setenv(
  DSN = "database_name",
  UID = "User ID",
  PASS = "Password"
)

Then in your shared script, you can log in using these environment variables. For example:

db <- DBI::dbConnect(
  drv = odbc::odbc(),
  dsn = Sys.getenv("DSN"),
  uid = Sys.getenv("UID"),
  pwd = Sys.getenv("PASS")
)

Even more handy, if these are credentials you use frequently, you can set them as environment variables in your operating system, so that they will always be available to you whenever you are working in R, but you’ll never have to display them in your code. (Just be careful about who has access to these!)

6. Automate tidyverse styling with styler

It's been a tough day, you’ve had a lot on your plate. Your code isn’t as neat as you’d like and you don’t have time to line edit it. Fear not. The styler package has numerous functions to allow automatic restyling of your code to match tidyverse style. It’s a simple as running styler::style_file() on your messy script and it will do a lot (though not all) of the work for you.

7. Parameterizing R Markdown documents

So you write a lovely R Markdown document where you’ve analyzed a whole bunch of facts about dogs. And then you get told — ‘nah, I’m more interested in cats’. Never fear. You can automate a similar report about cats in just one command if you parameterize your R Markdown document.

You can do this by defining parameters in the YAML header of your R Markdown document, and giving each parameter a value. For example:

---
title: "Animal Analysis"
author: "Keith McNulty"
date: "21 March 2019"
output:
  html_document:
    code_folding: "hide"
params:
  animal_name:
    value: Dog
    choices:
      - Dog
      - Cat
      - Rabbit
  years_of_study:
    input: slider
    min: 2000
    max: 2019
    step: 1
    round: 1
    sep: ''value: [2010, 2017]
---

Now you can write these variables into the R code in your document as params$animal_name and params$years_of_study. If you knit your document as normal, it will knit with the devault values of these parameters as per the value variable. However, if you knit with parameters by selecting this option in RStudio’s Knit dropdown (or by using knit_with_parameters()), a lovely menu option appears for you to select your parameters before you knit the document. Awesome!

8. revealjs

revealjs is a package which allows you to create beautiful presentations in HTML with in intuitive slide navigation menu, with embedded R code. It can be used inside R Markdown and has very intuitive HTML shortcuts to allow you to create a nested, logical structure of pretty slides with a variety of styling options. The fact that the presentation is in HTML means that people can follow along on their tablets or phones as they listen to you speak, which is really handy. You can set up a revealjs presentation by installing the package and then calling it in your YAML header. Here’s an example YAML header of a talk I gave recently using revealjs

---
title: "Exporing the Edge of the People Analytics Universe"
author: "Keith McNulty"
output:
  revealjs::revealjs_presentation:
    center: yes
    template: starwars.html
    theme: black
date: "HR Analytics Meetup London - 18 March, 2019"
resource_files:
- darth.png
- deathstar.png
- hanchewy.png
- millenium.png
- r2d2-threepio.png
- starwars.html
- starwars.png
- stormtrooper.png
---

and here’s an example slide. You can find the code here and the presentation here.

9. HTML tags in R Shiny (eg play audio in your shiny app)

Most people don’t take full advantage of the HTML tags available in R Shiny. There are 110 tags which offer shortcuts to various HTML formatting and other commands. Recently I built a shiny app that took a long time to perform a task. Knowing that the user would likely multitask while waiting for it to complete, I used tags$audio to have the app play a victory fanfare to alert the user when the task was complete.

10. The praise package

Ridiculously simple but also awesome, the praise package delivers praise to users. While this can appear like pointless self-admiration, it's actually super useful in writing R packages where you can offer praise or encouragement to someone if they do something right, for example if a process completes successfully. You can also just put it at the end of a complicated script to give you that extra shot of happiness when it runs successfully.

I lead McKinsey's internal People Analytics and Measurement function. Originally I was a Pure Mathematician, then I became a Psychometrician. I am passionate about applying the rigor of both those disciplines to complex people questions. I'm also a coding geek and a massive fan of Japanese RPGs.

Jaclyn Sawyer

Community created and supported tech | Builder, Connector.

5 年

This was very useful, thanks!

William Foote

Decision Analytics

5 年

Awesome: code-folding has revolutionized my tutorials. Students still don’t seem to appreciate switch()! Much thanks

Arian J Evans

Results-driven leader @Amazon/Meta/multiple startups. 25 years launching Cybersecurity innovations zero-one, and scaling 8-figure revenue w/data-driven impact.

5 年

Great article. Hey Mickey L. - you might like this! ^

1 次回应

Andrés Acosta Escobar

Marketing AI & Analytics Executive | AI Strategy | $3M+ Revenue | Global Leader

5 年

Nice post, thanks.

Robert Sutor

Quantum Computing and AI, but not necessarily together: Tech Leader, Ph.D., Non-Executive Director, Author, Advisor, Pundit, Keynote Speaker, Professor, Cat Lover

5 年

I know that Python 3 is a far better programming language.

7 次回应

查看更多评论

要查看或添加评论，请登录

Keith McNulty的更多文章

A Somewhat Psychopathic Introduction to Bayesian Statistics

2025年1月6日

A Somewhat Psychopathic Introduction to Bayesian Statistics

I was recently taken aback to discover a question about bodies in a refrigerator on an old mathematics exam paper from…

8 条评论
Can a Horse Do Math? The Story Of Clever Hans and a Statistics Problem He Inspired

2024年12月17日

Can a Horse Do Math? The Story Of Clever Hans and a Statistics Problem He Inspired

In 1904, a panel convened by the German Board of Education decided that a horse was capable of doing math. Read that…

2 条评论
A Fun Introduction to the Concept of Bayesian Statistics

2024年11月25日

A Fun Introduction to the Concept of Bayesian Statistics

I recently came across this very hard looking and creatively articulated problem in an old entrance examination paper…

20 条评论
The Italian Origins of Imaginary Numbers

2024年9月23日

The Italian Origins of Imaginary Numbers

If you happened to be taking a stroll around Bologna or Milan in the mid-16th century, it’s possible you might have…

11 条评论
The Beauty of the Binomial Expansion

2024年8月28日

The Beauty of the Binomial Expansion

I’m going to take a sum of two terms a+b and I am going to square it. If you remember from your quadratic expansions…

7 条评论
My Top Tip for Tackling Tough Math Problems

2024年8月21日

My Top Tip for Tackling Tough Math Problems

I recently came across an algebra problem which doesn’t require any advanced math skills to solve, but still takes…

21 条评论
The Three Most Common Statistical Tests You Should Deeply Understand

2024年8月12日

The Three Most Common Statistical Tests You Should Deeply Understand

If, like me, you are not a fan of code formatting in LinkedIn articles, you can also read this article on Medium…

10 条评论
The Trick That Helps All Statisticians Survive

2024年8月6日

The Trick That Helps All Statisticians Survive

If, like me, you are not a fan of the code formatting in LinkedIn articles, you can view this article on Medium. I have…

8 条评论
How To Pipe Real-Time Info Into Your LLM Responses Using Tools

2024年7月31日

How To Pipe Real-Time Info Into Your LLM Responses Using Tools

If you don't want to deal with the poor code formatting in LinkedIn articles, you can also read this article via…

2 条评论
Two Fascinating Properties of the Fibonacci Sequence

2024年7月16日

Two Fascinating Properties of the Fibonacci Sequence

The Fibonacci sequence is a very well known and studied sequence of numbers which is often used in schools and in…

See all articles

Ten random useful things you may not know about R

Keith McNulty

Leader in Technology, Science and Analytics | Mathematician, Statistician and Psychometrician | Author and Teacher | Coder, Engineer, Architect

1. The switch function

2. RStudio shortcut keys

3. The flexdashboard package

4. The req and validate functions in R Shiny

5. Keeping all my credentials to myself using the system environment

6. Automate tidyverse styling with styler

7. Parameterizing R Markdown documents

8. revealjs

9. HTML tags in R Shiny (eg play audio in your shiny app)

10. The praise package

Keith McNulty的更多文章

社区洞察

其他会员也浏览了

constexpr if

Date - 29/1/2025 DSA Day 4: Understanding Time Complexity with Real-World Impact

"Why doesn't my code work?" — to anyone learning the art of programming and writing to the Stack Overflow community

Solve Leetcode like a pro

Understanding and Measuring Code Complexity: Big O Notation

Week of August 19th

Teach Your (Domain) Models How to Fish (And Scale)

Day-17: Form Handling and Validation in React.

Huffman Coding

What is Quarto? RStudio quietly rolls out next-generation R Markdown

1. The switch function

2. RStudio shortcut keys

3. The flexdashboard package

4. The req and validate functions in R Shiny

5. Keeping all my credentials to myself using the system environment

6. Automate tidyverse styling with styler

7. Parameterizing R Markdown documents

8. revealjs

9. HTML tags in R Shiny (eg play audio in your shiny app)

10. The praise package

Keith McNulty的更多文章

A Somewhat Psychopathic Introduction to Bayesian Statistics

Can a Horse Do Math? The Story Of Clever Hans and a Statistics Problem He Inspired

A Fun Introduction to the Concept of Bayesian Statistics

The Italian Origins of Imaginary Numbers

The Beauty of the Binomial Expansion

My Top Tip for Tackling Tough Math Problems

The Three Most Common Statistical Tests You Should Deeply Understand

The Trick That Helps All Statisticians Survive

How To Pipe Real-Time Info Into Your LLM Responses Using Tools

Two Fascinating Properties of the Fibonacci Sequence

社区洞察

其他会员也浏览了

constexpr if

Date - 29/1/2025 DSA Day 4: Understanding Time Complexity with Real-World Impact

"Why doesn't my code work?" — to anyone learning the art of programming and writing to the Stack Overflow community

Solve Leetcode like a pro

Understanding and Measuring Code Complexity: Big O Notation

Week of August 19th

Teach Your (Domain) Models How to Fish (And Scale)

Day-17: Form Handling and Validation in React.

Huffman Coding

What is Quarto? RStudio quietly rolls out next-generation R Markdown