登录查看更多内容

Different Types of Hardware Faults in ISO26262 and How to metric it?

Duong TRAN ????

Technical (Project/Department) Manager | Senior Team Leader | Senior R&D Engineer | +20 Years Experience

发布日期: 2024年9月19日

Hi there! In one of my previous article, I have pointed out the differences between "Fault", "Error" and "Failure" concepts in functional safety context. If you do not distinguish how different between these concepts, you can read at: How different is between "Fault", "Error" and "Failure" in context of functional safety?

And in this article, I will summarize the different Types of Hardware Faults in ISO26262 and how to metric them.

1. A brief introduction to Hardware metrics vs. Safety Life-cycle

The ISO 26262 reference safety lifecycle encompasses the principal safety activities during the concept phase, product development, production, operation, service and decommissioning (P.O.S.D). In regard to fault classification, it is done during the development phase at the hardware level.

Figure 1: Overview of development phase at the HW level (ISO26262-2:2018)

First of all, I would like to remind the “Fault” definition:

Failure: termination of an intended behavior of an element or an item due to a fault manifestation. Termination can be permanent or transient
Failure Mode: manner in which an element or an item fails to provide the intended behavior
Failure Mode Coverage (FMC): proportion of the failure rate of a failure mode of a hardware element that is detected or controlled by the implemented safety mechanism
Failure Rate: probability density of failure divided by probability of survival for a hardware element. The failure rate is assumed to be constant and is generally denoted as “λ”.

2. The types of faults mentioned in ISO26262

Safe fault (S): Fault whose occurrence will not significantly increase the probability of violation of a safety goal
Single-Point Fault (SPF): A single-point fault is a fault which is not covered by safety mechanisms, and directly lead to the violation of a safety goal.
Multiple-Point Fault (MPF): An individual fault that in combination with other independent faults, leads to the violation of a safety goal. Dual-point faults (DPF) are a subset of multiple-point faults, where an individual fault in combination with another independent fault, lead to the violation of a safety goal.
Latent Fault (LF): A latent fault is a multiple-point fault which is not detected nor perceived by the driver, i.e., the fault remains latent until another fault occurs which together with the latent fault violates a safety goal.
Residual Fault (RF): A residual fault is a portion of a fault in a hardware component which is not covered by a safety mechanism, that leads to the violation of a safety goal. That means that in order for a fault on a hardware component to be a residual fault instead of a single-point fault, the hardware component must be protected by a safety mechanism but the safety mechanism does not cover this certain fault.

A Multiple-Point Fault may be:

Detected MPF: Multiple-Point Fault that is detected, within a prescribed time, by a safety mechanism, that prevents it from being Latent.
Perceived MPF: Multiple-Point Fault whose presence is deducted by the driver within a prescribed time interval.
Latent MPF: Multiple-Point Fault whose presence is not detected by a safety mechanism nor perceived by the driver within the multiple-point fault detection interval.

Figure 2: Classification of faults according to ISO26262 [1]

The total failure rate λ can be broken down into:

λ = λSPF + λRF + λMPF + λS

where:

λSPF:? Single Point Faults (i.e. a DU fault where there are no diagnostics)

λRF:? Residual Faults (i.e. a DU fault not covered by diagnostics)

λMPF:? Multiple Point Faults (i.e. a combination of independent SPFs)

λS:? Safe Faults

3. ISO26262 Hardware Fault Metric

The Hardware Architectural Metrics evaluate the effectiveness of the hardware architecture with respect to safety. It must be calculated for each safety goal defined in the Safety Requirements Specifications, considering the entire safety relevant hardware (SR, HW). The Hardware Architectural Metrics need to be evaluated for ASIL C and D, recommended for ASIL (B).

领英推荐

Looking at IEC 61850, Part 2: The Protection Engineer

Doble Engineering 1 年前

Testing DP Redundancy Groups Pt.1

Paul Kerr 1 个月前

Judith Dahmann - My Pioneers of Systems Engineering (…

Bernardo A. Delicado 1 年前

Figure 3a: Simplified flow diagram of [1] for manual determination of fault classification

Figure 3b: SPFM and LFM definition in ISO26262

SPFM (Single-Point Failure Metric) reflects the robustness of the item to single-point and residual faults. For example, a high SPFM implies that the proportion of single-point faults and residual faults in the hardware of the item is low.
LFM (Latent Failure Metric) reflects the robustness of the item to latent faults. A high LFM implies that the proportion of latent faults in the hardware is low.

ISO26262:2018-Part 5, defined the achievable ASIL is a function of Hardware Architectural Metrics as following table:

Table 1: Recommended target values for the hardware architecture metrics [1] Part 5

How to evaluate Random Hardware Failures?

For the Random Hardware Failures, ISO26262 suggest to use the PMHF (Probabilistic Metric for random Hardware Failures) method is commonly the most widely used and gives the ASILs below:

Table 2: Recommended target values for PMHF and PFH

Lastly, FMEDA ends the Failure Classification process

In order to structure a methodical classification of failure rates for each safety goal, we can use the FMEDA (Failure Mode & Effect Diagnostic Analysis) method.

Here is an example of a complete calculation by using the FMEDA method:

In addition, ISO 26262 also address to the following faults:

Permanent Faults: These are faults that remain until the system is repaired. Examples include hardware failures like a short circuit or broken components.
Transient Faults: These faults occur temporarily and may not indicate a permanent issue. They can arise from environmental factors, such as electromagnetic interference.
Intermittent Faults: These faults appear and disappear sporadically. They can be challenging to diagnose since they do not manifest consistently.
Systematic Faults: These are faults caused by design flaws, implementation errors, or insufficient testing processes. Systematic faults often stem from incorrect assumptions made during development.
Random Faults: These faults arise unpredictably, often due to hardware wear and tear or external conditions, such as temperature extremes.
Human Errors: Errors made during design, coding, testing, or maintenance can lead to faults. ISO 26262 emphasizes the need for processes to minimize human error.

In summary, ISO 26262 mentioned to various types of hardware faults can affect the safety and functionality of automotive systems. Understanding these faults and how to measure them is essential for compliance and safety assurance.

Reference:

ISO26262:2018, Part 1, Part 2, Part 5
https://www.byhon.it/what-iso-26262-says-about-fault-classification/
https://functionalsafetyengineer.com/intro-to-iso-26262-fault-metrics/
Google Photos

要查看或添加评论，请登录

Duong TRAN ????的更多文章

[SE] Requirements Characteristics and Attributes

2025年2月20日

[SE] Requirements Characteristics and Attributes

Hi there! I'm back again on the #systemsengineering series. In this articles, I'd like to analyze more details about…

2 条评论
[SE] Quick guidelines for practicing ASPICE System Engineering Process Group

2024年12月20日

[SE] Quick guidelines for practicing ASPICE System Engineering Process Group

Hi there! Nice to meet you in the #SystemsEngineering knowledge sharing series. According to [1], from ASPICE v3.
100 Work Values + Values Statements

2024年11月24日

100 Work Values + Values Statements

100 Work Values + Values Statements 1. Accountability 2.
[SE] System Architectural Design Guidelines

2024年11月21日

[SE] System Architectural Design Guidelines

Hi there! Nice to meet you again in the #SystemsEngineering series. In my previous post, I have shared my understanding…
7 QUY LU?T L?N C?A V? TR?

2024年11月2日

7 QUY LU?T L?N C?A V? TR?

7 QUY LU?T L?N C?A V? TR? 1. Lu?t h?p d?n "M?t khi anh ?? quy?t chí thì c? v? tr? s? giúp anh ??t ???c vi?c ?ó".
[Book Review] TRí TU? C?M XúC - Emotional Intelligence

2024年10月31日

[Book Review] TRí TU? C?M XúC - Emotional Intelligence

TRí TU? C?M XúC - CU?N SáCH ??NH NGH?A L?I THàNH C?NG [English below] N?m 1995, gi?a th?i ?i?m th? gi?i v?n ?ang mê m?i…
Tóm t?t sách "HIGH FIVE HABIT"

2024年10月30日

Tóm t?t sách "HIGH FIVE HABIT"

Tóm t?t sách "HIGH FIVE HABIT" (Mel Robbins) [English below] 1. S?c m?nh c?a kho?nh kh?c ??u tiên Nh?ng phút ??u tiên…
Safety-Critical Systems and Safety Architecture Patterns for Functional Safety

2024年10月29日

Safety-Critical Systems and Safety Architecture Patterns for Functional Safety

Hi there! In one of my previous post [1], I shared my understanding about the E-gas Safety concept in details. In fact,…

3 条评论
An introduction to functional safety and ISO26262 standard

2024年10月26日

An introduction to functional safety and ISO26262 standard

Hi there! Nice to meet you again in the functional safety topic. In my previous posts, I have shared my understanding…

4 条评论
Các quy lu?t thành c?ng / The Law of Success

2024年10月25日

Các quy lu?t thành c?ng / The Law of Success

[English below] QUY LU?T THàNH C?NG Napoleon Hill (1883-1970) là m?t trong nh?ng tác gi? v? thành c?ng có ?nh h??ng…

See all articles

社区洞察

Computer Hardware Troubleshooting

What are some of the advanced hardware troubleshooting techniques that you use for complex or rare issues?

Different Types of Hardware Faults in ISO26262 and How to metric it?

Duong TRAN ????

Technical (Project/Department) Manager | Senior Team Leader | Senior R&D Engineer | +20 Years Experience

1. A brief introduction to Hardware metrics vs. Safety Life-cycle

2. The types of faults mentioned in ISO26262

3. ISO26262 Hardware Fault Metric

领英推荐

How to evaluate Random Hardware Failures?

Lastly, FMEDA ends the Failure Classification process

Reference:

Duong TRAN ????的更多文章

社区洞察

其他会员也浏览了

How different is between "Fault", "Error" and "Failure" in context of functional safety?

Indicator Basics

Microservice - Circuit Breaker Pattern

SIPROTEC 4 & DIGSI 4 Video-based Training Course

Switch back to simplicity!

The Functional Safety Mirror

Random Hardware Failure. What is random about it?

Do You Know What's The Quality Control Key Point For Hybrid AOC cable?

What causes a Power Supply (PSU) or Rectifier to fail? Do I need a PSU Failure Analysis or Fault Report? FMEA or RCFA?

Why relay configuration is significant while implementing the relay setting ?

1. A brief introduction to Hardware metrics vs. Safety Life-cycle

2. The types of faults mentioned in ISO26262

3. ISO26262 Hardware Fault Metric

领英推荐

How to evaluate Random Hardware Failures?

Lastly, FMEDA ends the Failure Classification process

Reference:

Duong TRAN ????的更多文章

[SE] Requirements Characteristics and Attributes

[SE] Quick guidelines for practicing ASPICE System Engineering Process Group

100 Work Values + Values Statements

[SE] System Architectural Design Guidelines

7 QUY LU?T L?N C?A V? TR?

[Book Review] TRí TU? C?M XúC - Emotional Intelligence

Tóm t?t sách "HIGH FIVE HABIT"

Safety-Critical Systems and Safety Architecture Patterns for Functional Safety

An introduction to functional safety and ISO26262 standard

Các quy lu?t thành c?ng / The Law of Success

社区洞察

其他会员也浏览了

How different is between "Fault", "Error" and "Failure" in context of functional safety?

Indicator Basics

Microservice - Circuit Breaker Pattern

SIPROTEC 4 & DIGSI 4 Video-based Training Course

Switch back to simplicity!

The Functional Safety Mirror

Random Hardware Failure. What is random about it?

Do You Know What's The Quality Control Key Point For Hybrid AOC cable?

What causes a Power Supply (PSU) or Rectifier to fail? Do I need a PSU Failure Analysis or Fault Report? FMEA or RCFA?

Why relay configuration is significant while implementing the relay setting ?