Field Data and Reliability
This image comes from Dictionary of French Architecture from 11th to 16th Centruy (1856) by Eugene Viollet-le-Duc (1814-1879).

Field Data and Reliability

Importance the Field Data

Customers experience product failures. Understanding these failures that occur in the hands of customers is an essential undertaking. We need this information to identify increasing failure rates, component batch or assembly errors, or design mistakes.

Our work to design for reliability includes assumptions about customer expectations and use stresses. The field performance either validates our work or illuminates the errors. Our work to select suppliers and build a stable assembly processes attempts to identify the highest risk elements for reliability (and quality) with plenty of assumptions. The field performance again validates our work or illuminates the errors.

The field reliability performance impacts the business directly. Customer satisfaction, brand loyalty, and warranty expenses. The business objectives hinge of reliability performance.

Customers may expect product improvements even if they did not experience the failures personally. The internet provides many venues for customers to compare notes and discuss failures. Customers increasingly do not simply want a replacement they may demand an improved design instead.

The Nature of Field Data

This data is never perfect. It is better than anything we create in the lab though. Field data is actual data. It is a record of how the product performs for those using the product. All the expectations, stresses, and component variation are present. No sample sizes or confidence bounds necessary.

Part of issue is we the data from customers is noisy. We do not know exactly when the first turn on or use occurs after purchase. Nor do we know exactly and under what circumstances failure occurs. Often, we do not know the exact failure. Just that a customer reports a failure. Not all customers even provide a complaint or report.

Yet it is the best data we have available.

Find the Field Data

Your organization most likely gathers information about customer experienced field failures. Call centers, return authorizations, replacements, repairs, warranty claims, all provide information on field failures.

Ideally, you will have date installed, date of failure, use conditions, symptoms or failure mode, plus a root causes analysis of each failure to the specific mechanism. Right. More likely we have the date the customer reported the failure, which may not be the same as when it actually failed.

When first looking for the field data it is often gathered for other purposes. The databases and records are to help serve the customer and track costs, not to reveal reliability performance. As you and the organization realize the value of the field data analysis, you’ll be able to establish better data capture processes.

Another element of data you require for an analysis is the number of units placed in service, both those that have failed and those that have not failed. The shipments data is often a good source. Better would be records of initialization or turn on.

This maybe complicated by delays in shipping and warehousing. Or with the use of units as spares. Likewise, not all units installed and operated continue to operate indefinitely. This may take some work and investigation to determine the nominal and range of operating durations. Most simply assume every unit shipped is still operating unless reported as failed.

Gather the Field Data

A common mistake is to simply count the number of returns each month and report the count on a month by month bar chart. This is easy and generally non-informative. Trends are likely to causes by variation in shipments as any other reason.

Beyond how many failures occur you need to gather the time to failure information as a minimum. When was it shipped, installed, failed and reported would be great, yet knowing the month of shipment and month of failure is often the best we can do. The time to failure data allows Weibull analysis or similar to estimate the overall failure rate trend versus the age of the product. Time zero is when installed (or shipped) for each unit. Do they show signs of wear out (increasing failure rate) after 3 months?

The conditions of use and reported failure mode provide a way to Pareto the issues. Adding the cost to the customer or the manufacturer may provide a way to refine the priorities for improvement work. Plot the various failure modes or better failure mechanisms with the Weibull analysis as each one is likely to be on a different failure rate trend. Some will indicate early life failures and other may show wear out behavior. At different points in the age of the product the Pareto of issues is likely to be different.

The last of often most useful element of the field data, is find out what happened. Do the root cause analysis on as many units as possible. Determine the sequence of events or stresses that leads to the product failure. If it’s not possible to redesign or improve the current product, you can on the next design cycle.

We’ll explore how to analyze the data in another article, yet gathering the data is often the difficult part of the exercise. Do you have good data? Where do you find is the best source of field data?

 

Fred Schenkelberg is an experienced reliability engineering and management consultant with his firm FMS Reliability. His passion is working with teams to create cost-effective reliability programs that solve problems, create durable and reliable products, increase customer satisfaction, and reduce warranty costs. If you enjoyed this articles consider subscribing to the ongoing series at Accendo Reliability.

Don Fitchett ??

Industrial Training Instructor - Distance learning at BIN95.com (All industrial training topics covered). PLC Training at your location too.

8 年

In reading your great take on this industry issue Fred Schenkelberg, it gave me an idea. When we are online with the machine's automation control (PLC/PAC), we teach students, when you click 'save as' the PLC uploads to computer a mirror image of the data table. This is what we tell them, "freezing a moment in time". Then they can go back and analyze all the data, what setting where made, if a limit switch was made or not during the time of failure, etc. The idea you gave me Fred is that machine makers could automate saving a copy of the PLC program when an error occurs, and have snapshots of a moment in time when the error occurred, with all the variables recorded for their later analysis. In the past hard drive memory size was an issue, but now days we have plenty of room. OEMs need only implement this idea. Once they all catch on to this idea, the industry will evolve to adding even more sensors to machines, to capture more environmental variables like ambient temperature during the time of fault, etc. This idea will also help alleviate the fact that often when failure occurs, a human is not present to report back to designer all they heard, felt, smelled during time of failure. :) I can even envision video camera recording machine, and on fault, 5 minutes of video footage before and after fault are also saved to the hard drive with automation control data table image.

Freddie Appoh PhD MBA CEng

Railway Engineering Consultant | Executive Director

8 年

Fred is always a pleasure to read your article. I must suggest from my experience the best place to acquire good quality data is from the guys looking after the assets. Experience technicians and operations teams always have considerable information in their heads that could provide valuable information about the assets, hence data. The best way to get the right level of data is about asking the right questions and making them feel that their contribution is valuable . This provide a basis for ownership of data management as well. Secondly, data management training and effective standard operating procedures are required to ensure 1) quantity of data recording i.e the efficient part and 2) quality data recording ie the effective part. Simple in-process checks by supervisors of maintenance organisation and common standard of recording across any organisation assures good quality data could be captured. Though, it might be resource intensive,it normally pays off. The third and final option is for organisations to reduce their heavily reliance on CMMSs as the magic tool . Though having one is good provided you used the CMMS for the right reasons ( supporting effective failure and decision analysis ) rather than having it for the sake of it. A lot of organisations are striving for magic tools and capabilities from their CMMSs with the hope that it will solve all their problems. In fact the reality is that , CMMS is not cheap and cost millions and still require human involvement to ensure the right data is recorded. I will end by saying that the need for effective training, development and ownership of assets by production and maintenance teams always play key in ensuring data integrity.

要查看或添加评论,请登录

Fred Schenkelberg的更多文章

  • Accendo Weekly Update #490 March 23, 2025

    Accendo Weekly Update #490 March 23, 2025

    Reliability Analysis Courses One using Minitab, the other using Reliasoft Weibull++. Both are collaborations between…

  • Accendo Weekly Update #489 March 16, 2025

    Accendo Weekly Update #489 March 16, 2025

    Course offered by Industrial Metallurgist Hosted on imetllc.com and taught by Michael Pfeifer.

    3 条评论
  • Accendo Weekly Update #488 March 9, 2025

    Accendo Weekly Update #488 March 9, 2025

    Barringer Process Reliability Introduction A new course by André-Michel Ferrari This is a Beta Launch with a 50%…

    1 条评论
  • Accendo Weekly Update #487 March 2, 2025

    Accendo Weekly Update #487 March 2, 2025

    CMMSradio A podcast series by Greg Christensen All things CMMS, Computerized Maintenance Management Software, including…

    2 条评论
  • Accendo Weekly Update #486 February 23, 2025

    Accendo Weekly Update #486 February 23, 2025

    NoMTBF An article series by Fred Schenkelberg and friends A series of articles devoted to the eradication of the misuse…

    4 条评论
  • Accendo Weekly Update #485 February 16, 2025

    Accendo Weekly Update #485 February 16, 2025

    The RCA An article series by Bob and Ken Latino According to Bob, "I tend to write about all things Root Cause Analysis…

    6 条评论
  • Accendo Weekly Update #484 February 9, 2025

    Accendo Weekly Update #484 February 9, 2025

    Courses offered by Integral Concepts A set of courses offered by Allise and Steven Wachs More than just Applied…

    4 条评论
  • Accendo Weekly Update #483 February 2, 2025

    Accendo Weekly Update #483 February 2, 2025

    The Manufacturing Academy A set of courses offered by Ray Harkins and team Courses designed to teach foundational…

    2 条评论
  • Accendo Weekly Update #482 January 26, 2025

    Accendo Weekly Update #482 January 26, 2025

    Speaking of Reliability A podcast where friends talk shop Enjoy an episode of Speaking of Reliability. Where you can…

    1 条评论
  • Accendo Weekly Update #481 January 19, 2025

    Accendo Weekly Update #481 January 19, 2025

    Everyday RCM Short videos and some articles by Nancy Regan Reliability Centered Maintenance (RCM) is a time-honored…

    1 条评论

社区洞察

其他会员也浏览了