登录查看更多内容

Tracking Progress in R

Samantha Bell

Veterinary Data Analysis | Dashboards & Reporting | LVT | E-commerce | Bioinformatics

发布日期: 2021年3月15日

It sure does seem like "a watched pot never boils" when waiting for loops or mapped functions to complete many iterations. Adding a progress bar or other indicator to your code can give the user some peace of mind - making the wait seem more reasonable.

Clear expectations = Happy users

With analyses that are frequently repeated, the input data can vary in size. This means that we might not be able to predict how long a particular manipulation may take. Similarly, when writing new code, it is nice to get an indication of the expected processing time of a loop or function call.

Seeing a line of code processing but not having any measure of progress makes time slow... to... a... crawl.

Let's take a simple for loop, where we run through every row of a matrix. For the sake of simplicity, we will just have our action be a short pause for each iteration:

m <- 1:150 # Possible lengths the data might have
data <- matrix(rnorm(sample(m, 1), 1, .5)) # Random data matrix


for(i in 1:dim(data)[1]){
  Sys.sleep(0.2) # the action
}

Depending on how many rows the data has at any time and how much of a lift the action is, this can take longer or shorter. We have no way to see what iteration we are in or how much remains as the code runs.

Option 1: cat

cat() can be used to print from within a loop (Look here or here if not familiar with cat). This can let us know many things, such as the current contents of a variable, the number of the iteration, or even the percent completion.

Printing a dot (.) each iteration gives you a sign how quickly your code is running and that it is working, but does not say how much time is left. However, this can be useful if you print n item each iteration and want to see the progress for that specific variable, file, etc.:

for(i in 1:dim(data)[1]){
   cat("Now working on", i, ".")
   for(j in 1:10){
      Sys.sleep(0.2) # the action
      cat(".")
   }
}

Option 2: cat with modulus

Using a modulus (explanation here) will let us print our progress every 10 iterations instead of each time. If you have a large number of iterations, try every 100 or every 1000 iterations.

for(i in 1:dim(data)[1]){
  if(i %% 10==0) {cat(round((i/dim(data)[1])*100, digits=0), "% completed...")} 
  Sys.sleep(0.2) # the action
}
cat("Done!")

Option 3: progress bar

Progress bars (available in base R as shown here) can be used with or without a modulus and will grow in size as progress is made. The percent of completion shows at the right side of the bar.

The bar must be initialized with a min and max for the size of the chunks (a larger number means more small additions to the bar as it loads), and comes in a 3 styles.

We control the progress of the bar by setTxtProgressBar() in the loop. The first item is the bar you created, the second how you are counting your iterations (in this case, i).

# Number of iterations
imax<-c(10)
# Initiate the bar
bar <- txtProgressBar(min = 0, max = imax, style = 3)


for(i in 1:dim(data)[1]){
  Sys.sleep(0.2) # the action
   # Update the progress bar
   setTxtProgressBar(bar, i)
}

Where does it belong?

Place your progress bar, cat(), or other indicator at a spot in your loop where it will be progressed once for each time the main action is run.

Most of the time, this will be at the end of the outer loop, before the closing bracket. In the case of our cat statement where we wanted to add dots to each thing that was done to an item in the main loop, we placed the progress indicator in a nested loop. Play around with placement to get progress displayed in the way that is most meaningful to your situation!

HAPPY PROGRAMMING!

要查看或添加评论，请登录

Samantha Bell的更多文章

Standardize and clean those phone numbers using the new CleanPhoneNumbers R package!

2022年2月18日

Standardize and clean those phone numbers using the new CleanPhoneNumbers R package!

Have some dirty phone numbers in your data? This package can help! THE TASK Many data analysts will encounter projects…
Grow your plot expertise in R with drag-and-drop from esquisse

2021年12月13日

Grow your plot expertise in R with drag-and-drop from esquisse

Ever felt overwhelmed by ggplot? Are you unsure of how to get started with building your own visuals in R? The esquisse…
UPDATE - Cleaning addresses (with a new package)

2021年6月8日

UPDATE - Cleaning addresses (with a new package)

If you have made use of code to simplify, clean, geocode, or round address coordinates, this package may be the one for…

2 条评论
Freshen up - Update your R version and packages from within R Studio!

2021年5月25日

Freshen up - Update your R version and packages from within R Studio!

Is it time for an update? If you can't remember the last time you updated R, the answer is most likely, "yes". Noticing…
Spot the difference - comparing tables in R

2021年5月17日

Spot the difference - comparing tables in R

Ever wondered how to compare code output without looking over each row and column by hand? This handy use of…
Simplifying and Grouping Address Fields Using R

2021年2月15日

Simplifying and Grouping Address Fields Using R

Trying to group records by street address can be a daunting task. Although hotspot analyses are a key part of writing…

1 条评论
Tying it all together with stringr

2020年12月3日

Tying it all together with stringr

Manipulating strings and pulling patterns of text is a frequent coding task and can be a challenge. Among the many…
Exporting Multiple Pages to an Excel Workbook from R

2020年11月6日

Exporting Multiple Pages to an Excel Workbook from R

Reports exported from R language can become unwieldy as results quickly start to fill up your destination folders…
FUN FACT: find those duplicates!

2020年10月22日

FUN FACT: find those duplicates!

Using duplicated() in R I thought I would share this fun & helpful R function which can be used to easily find…

1 条评论
Understanding the Chronic Optimist in Your Life

2019年12月20日

Understanding the Chronic Optimist in Your Life

In a world becoming increasingly aware of everyday anxieties, those of us who approach life with perpetual optimism can…

See all articles

社区洞察

Algorithms

How do you handle duplicate elements in randomized quicksort?

Tracking Progress in R

Samantha Bell

Veterinary Data Analysis | Dashboards & Reporting | LVT | E-commerce | Bioinformatics

Clear expectations = Happy users

Option 1: cat

Option 2: cat with modulus

Option 3: progress bar

Where does it belong?

HAPPY PROGRAMMING!

Samantha Bell的更多文章

社区洞察

其他会员也浏览了

LeetCode 2790 (Hard). Maximum Number of Groups With Increasing Length. O(N logN). Math.

Smart Tricks with Parameter Packs and Fold Expressions

Business Logic Component [4 of 4]

What are closures in?Rust?

Array iteration in Rust

The SplitTo Function...

A Beginner's Tutorial on Implementing IEnumerable Interface and Understanding yield Keyword

Closure != Function pointer

Use of .then() function in Cypress

Example 16

Clear expectations = Happy users

Option 1: cat

Option 2: cat with modulus

Option 3: progress bar

Where does it belong?

HAPPY PROGRAMMING!

Samantha Bell的更多文章

Standardize and clean those phone numbers using the new CleanPhoneNumbers R package!

Grow your plot expertise in R with drag-and-drop from esquisse

UPDATE - Cleaning addresses (with a new package)

Freshen up - Update your R version and packages from within R Studio!

Spot the difference - comparing tables in R

Simplifying and Grouping Address Fields Using R

Tying it all together with stringr

Exporting Multiple Pages to an Excel Workbook from R

FUN FACT: find those duplicates!

Understanding the Chronic Optimist in Your Life

社区洞察

其他会员也浏览了

LeetCode 2790 (Hard). Maximum Number of Groups With Increasing Length. O(N logN). Math.

Smart Tricks with Parameter Packs and Fold Expressions

Business Logic Component [4 of 4]

What are closures in?Rust?

Array iteration in Rust

The SplitTo Function...

A Beginner's Tutorial on Implementing IEnumerable Interface and Understanding yield Keyword

Closure != Function pointer

Use of .then() function in Cypress

Example 16