登录查看更多内容

Risks, Their Sources, Root Causes, and Handling Strategies

Glen Alleman MSSM

Vietnam Veteran, Applying Systems Engineering Principles, Processes & Practices to Increase the Probability of Program Success for Complex Systems in Aerospace & Defense, Enterprise IT, and Process and Safety Industries

发布日期: 2022年12月1日

Risk identification during early design phases of complex systems is commonly implemented but often fails to identify events and circumstances that challenge program performance.

Inefficiencies in cost and schedule estimates are usually held accountable for cost and schedule overruns, but the true root cause is often the realization of programmatic risks. A deeper understanding of frequent risk identification trends and biases pervasive during space system design and development is needed, for it would lead to improved execution of existing identification processes and methods.

Risk management means building a model of the risk, the impact of the risk on the program, and a model for handling of the risk, since it is a risk, the corrective or preventive action has not occurred yet.

Probabilistic Risk Assessment (PRA) is the basis of these models and provides the Probability of Program Success Probabilities result from uncertainty and are central to the analysis of the risk. Scenarios, model assumptions, with model parameters based on current knowledge of the behavior of the system under a given set of uncertainty conditions.

Since risk is the outcome of Uncertainty, distinguishing between the types of uncertainty in the definition and management of risk on complex systems is useful when building risk assessment and management models.

Epistemic uncertainty?? from the Greek επιστημη (episteme), meaning knowledge of uncertainty due to a lack of knowledge of quantities or processes of the system or the environment. Epistemic uncertainty is represented by the ranges of values for parameters, a range of workable models, the level of model detail, multiple expert interpretations, and statistical confidence. Epistemic uncertainty derives from a lack of knowledge about the appropriate value to use for a quantity that is assumed to have a fixed value in the context of a particular analysis. The accumulation of information and implementation of actions reduce epistemic uncertainty to eliminate or reduce the likelihood and/or impact of risk. This uncertainty is modeled as a subjective assessment of the probability of our knowledge and the probability of occurrence of an undesirable event.

Incomplete knowledge about some characteristics of the system or its environment are primary sources of Epistemic uncertainty.

Aleatory uncertainty?? from the Latin alea (a single die in Latin) is the inherent variation associated with a physical system or the environment. Aleatory uncertainty arises from an inherent randomness, natural stochasticity, environmental or structural variation across space and time in the properties or behavior of the system under study.?§ The accumulation of more data or additional information cannot reduce aleatory uncertainty. This uncertainty is modeled as a stochastic process of an inherently random physical model. The projected impact of the risk produced by Aleatory uncertainty can be managed through cost, schedule, and/or technical margin.

Naturally occurring variations associated with the physical system are primary sources of Aleatory uncertainty.

There is a third uncertainty found on some programs that is not addressed here, since this uncertainty is not correctable.

Ontological Uncertainty ? is attributable to the complete lack of knowledge of the states of a system. This is sometimes labeled an Unknowable Risk. Ontological uncertainty cannot be measured directly.

Ontological uncertainty creates risk from Inherent variations and incomplete information that is not knowable.

Separating Aleatory and Epistemic Uncertainty for Risk Management

Knowing the percentage of reducible uncertainty versus irreducible uncertainty is needed to construct a credible risk model. Without the separation, knowing what uncertainty is reducible and what uncertainty is irreducible inhibits the design of the corrective and preventive actions needed to increase the probability of program success.

Separating the types of uncertainty serves to increase the clarity of risk communication, making it clear which type of uncertainty can be reduced and which types cannot be reduced. For the latter (irreducible risk), only a margin can be used to protect the program from uncertainty.

As uncertainty increases, the ability to precisely measure the uncertainty is reduced to where a direct estimate of the risk can no longer be assessed through a mathematical model. While a decision in the presence of uncertainty must still be made, deep uncertainty and poorly characterized risks lead to the absence of data and risk models.

Epistemic Uncertainty Creates Reducible Risk

The risk created by Epistemic Uncertainty represents resolvable knowledge, with elements expressed as probabilistic uncertainty of a future value related to a loss in a future period of time. Awareness of this lack of knowledge provides the opportunity to reduce this uncertainty through direct corrective or preventive actions.

Epistemic uncertainty, and the risk it creates, is modeled by defining the probability that the risk will occur, the time frame in which that probability is?active, and the probability of an impact or consequence from the risk when it does occur, and finally, the probability of the residual risk when the handing of that risk has been applied.

Epistemic uncertainty statements define and model these event?based risks:

If?Then ? if we miss our next milestone then the program will fail to achieve its business value during the next quarter.
Condition?Concern?? our subcontractor has not provided enough information for us to status the schedule, and our concern is the schedule is slipping and we do not know it.
Condition?Event?Consequence ??our status shows there are some tasks behind schedule, so we could miss our milestone, and the program will fail to achieve its business value in the next quarter.

For these types of risks, an explicit or implicit?risk-handling plan is needed. The word?handling?is used with a special purpose. “We?Handle?risks” in a variety of ways.?Mitigation?is one of those ways. In order to mitigate the risk, new effort (work) must be introduced into the schedule. We are?buying down?the risk, or we are?retiring?the risk by spending money and/or consuming time to reduce the probability of the risk occurring. Or we could be spending money and consuming time to reduce the impact of the risk when it does occur. In both cases, actions are taken to address the risk.

Reducible Cost Risk

Reducible cost risk is often associated with unidentified reducible Technical risks, changes in technical requirements, and their propagation that impact cost. Understanding the uncertainty in cost estimates supports decision-making for setting targets and contingencies, risk treatment planning, and the selection of options in the management of program costs. Before reducible cost risk can take place, the cost structure must be understood. Cost risk analysis goes beyond capturing the cost of WBS elements in the Basis of Estimate and Cost Estimating Relationships. This involves:

Development of quantitative modeling of integrated cost and schedule, incorporating the drivers of reducible uncertainty in quantities, rates, and productivity, and the recording of these drivers in the Risk Register.
Determining how cost and schedule uncertainty can be integrated into the analysis of the cost risk model
Performing sensitivity analysis to provide an understanding of the effects of reducible uncertainty and the allocation of contingency amounts across the program.

Reducible Schedule Risk

Schedule Risk Analysis (SRA) is an effective technique to connect the risk information of program activities to the baseline schedule, to provide sensitivity information of individual program activities to assess the potential impact of uncertainty on the final program duration and cost.

Schedule risk assessment is performed in 4 steps:

Baseline Schedule ? Construct a credible activity network compliant with GAO?16?89G, “Schedule Assessment Guide: Best Practices for Project Schedule.”
Define Reducible Uncertainties ? for activity durations and cost distributions from the Risk Register and assign these to work activities affected by the risk and/or the work activities assigned to reduce the risk.
Run Monte?Carlo simulations ? for the schedule using the assigned Probability Distribution Functions (PDFs), using the Min/Max values of the distribution, for each work activity in the IMS.
Interpret Simulation Results ? using data produced by the Monte Carlo Simulation, including at least:

Criticality Index (CI): Measures the probability that an activity is on the critical path.?
Significance Index (SI): Measures the relative importance of an activity.?
Schedule Sensitivity Index (SSI): Measures the relative importance of an activity taking the CI into account.
Cruciality Index (CRI): Measures the correlation between the activity duration and the total program duration.?

Reducible Technical Risk

Technical risk is the impact on a program, system, or entire infrastructure when the outcomes from engineering development do not work as expected, do not provide the needed technical performance, or create higher than planned risk to the performance of the system. Failure to identify or properly manage this technical risk results in performance degradation, security breaches, system failures, increased maintenance time, and significant amount of technical debt [1] and addition cost and time for end item deliver for the program.

Reducible Cost Estimating Risk

Reducible cost-estimating risk is dependent on technical, schedule, and programmatic risks, which must be assessed to provide an accurate picture of program cost. Cost risk estimating assessment addresses the cost, schedule, and technical risks that impact the cost estimate. To quantify these cost impacts from the reducible risk, sources of risk need to be identified. This assessment is concerned with three sources of risk and ensure that the model calculating the cost also accounts for these risks:

The risk inherent in the cost estimating method. The Standard Error of the Estimate (SEE), confidence intervals, and prediction intervals.
The risk inherent in technical and programmatic processes. The technology’s maturity, design, and engineering, integration, manufacturing, schedule, and complexity. [395]
The risk inherent in the correlation between WBS elements decides to what degree one WBS element’s change in cost is related to another and in which direction. WBS elements within space systems have positive correlations with each other, and the cumulative effect of this positive correlation increases the range of the costs.?[2]

Aleatory Uncertainty Creates Irreducible Risk

Aleatory uncertainty and the risk it creates comes not from the lack of information, but from the naturally occurring processes of the system. For aleatory uncertainty, more information cannot?be bought nor specific risk reduction actions are taken to reduce the uncertainty and resulting risk. The objective of identifying and managing aleatory uncertainty to be prepared to handle the impacts when risk is realized.

The method for handling these impacts is to provide?the margin?for this type of risk, including cost, schedule, and technical margin.

Margin is the difference between the maximum possible value and the maximum expected Value and is separate from Contingency. Contingency is the difference between the current best estimates and the maximum expected estimate. For systems under development, the technical resources and the technical performance values carry both margin and contingency.

Schedule Margin should be used to cover the naturally occurring variances in how long it takes to do the work. The Cost Margin is held to cover the naturally occurring variances in the price of something being consumed in the program. Technical margin is intended to cover the naturally occurring variation of technical products.

领英推荐

How to interpret a Risk Distribution Curve –…

Dr. Warren Black 1 周前

What is Risk?

Glen Alleman MSSM 2 年前

Risks, Their Sources, and Handling Strategies

Glen Alleman MSSM 1 年前

Aleatory uncertainty and the resulting risk are modeled with a Probability Distribution Function (PDF) that describes the possible values the process can take and the probability of each value. The PDF for the possible durations of the work in the program can be determined. Knowledge can be bought about the aleatory uncertainty through?Reference Class Forecasting?and?past performance modeling. This new information then allows us to update ? adjust ? our past performance on similar work will provide information about our future performance. But the underlying processes are still random, and our new information simply created a new aleatory uncertainty PDF.

The first step in handling Irreducible Uncertainty is the creation of a Margin. Schedule margin, Cost margin, and Technical Margin, to protect the program from the risk of irreducible uncertainty. Margin is defined as the allowance in the budget, and programmed schedule … to account for uncertainties and risks. [255]

Margin needs to be quantified by:

Identifying WBS elements that contribute to margin.
Identifying uncertainty and risk that contributes to margin.

Irreducible Schedule Risk

Programs are over budget and behind schedule, to some extent because uncertainties are not accounted for in schedule estimates. Research and practice is now addressing this problem, often by using Monte Carlo methods to simulate the effect of variances in work package costs and durations on total cost and date of completion. However, many such program risk approaches ignore the significant impact of probabilistic correlation on work package cost and duration predictions.

Irreducible schedule risk is handled with Schedule Margin which is defined as the amount of added time needed to achieve a significant event with an acceptable probability of success.? Significant events are major contractual milestones or deliverables.

With minimal or no margins in schedule, technical, or cost present to deal with unanticipated risks, the successful acquisition is susceptible to cost growth and cost overruns.

The Program Manager owns the schedule margin. It does not belong to the client nor can it be negotiated away by the business management team or the customer. This is the primary reason to CLEARLY identify the Schedule Margin in the Integrated Master Schedule. [108] It is there to protect the program deliverable(s). The schedule margin is not allocated to over?running tasks, rather is planned to protect the end item deliverables.

The schedule margin should protect the delivery date of major contract events or deliverables. This is done with a Task in the IMS that has no budget (BCWS). The duration of this Task is derived from Reference Classes or Monte Carlo Simulation of aleatory uncertainty that creates a risk to the event or deliverable.

The Master Schedule, with a schedule margin to protect against the impact of aleatory uncertainty, represents the most likely and realistic risk-based plan to deliver the needed capabilities of the program.

Cost Contingency

Cost Contingency is a reserved fund held by the Government Program Manager, added to the base cost estimate to account for cost uncertainty. It is the estimated cost of the “known?unknowns” cost risk that impacts the planned budget of the program. This contingency is not the same as Management Reserve, rather this Cost Contingency is not intended to absorb the impacts of scope changes, escalation of costs, and unforeseeable circumstances beyond management control. Contingency is funding that is expected to be spent and therefore is tightly controlled. Contingency funds are for risks that have been identified in the program.

Irreducible cost risk is handled with Management Reserve and Cost Contingency are program cost elements related to program risks and are an integral part of the program’s cost estimate. Cost Contingency addresses the Ontological Uncertainties of the program. The Confidence Levels for the Management Reserve and Cost Contingency are based on the program’s risk assumptions, program complexity, program size, and program criticality.

When estimating the cost of work, that resulting cost number is a random variable. Point estimates of cost have little value in the presence of uncertainty. The planned unit cost of a deliverable is rarely the actual cost of that item. Covering the variance in the cost of goods may or may not be appropriate for Management Reserve.

Assigning Cost Reserves needs knowledge of where in the Integrated Master Schedule these cost reserves are needed. The resulting Integrated Master Schedule, with schedule margin, provides locations in the schedule where cost reserves are aligned with the planned work and provides the ability to layer cost reserves across the baseline to determine the funding requirements for the program. This allows the program management to determine realistic target dates for program deliverables and the cost reserves ? and schedule margin ? needed to protect those delivery dates.

Irreducible Technical Risk

The last 10 percent of the technical performance generates one?third of the cost and two?thirds of the problems ? Norman Augustine’s 15th Law.

Margin is the difference between the maximum possible value and the maximum expected Value and separating that from contingency is the difference between the current best estimates and the maximum expected estimate, then for the systems under development, the technical outcome and technical performance values both carry margin and contingency.

Technical Margin and Contingency serve several purposes:

Describe the need for and use of resource margins and contingency in system development.
Define and distinguish between margins and contingency.
Demonstrate that, historically, resource estimates grow as designs mature.
Provide a representative margin depletion table showing prudent resource contingency as a function of the program phase.

For any system, in any stage of its development, there is a maximum possible, maximum expected, and current best estimate for every technical outcome. The current best estimate of a technical performance change as the development team improves the design and the understanding of that design matures.

For a system in development, most technical outcomes should carry both margin and contingency. As designs mature, the estimate of any technical resource usually grows. This is true historically and, independent of exactly why development programs must plan for it to occur.

The goal of Technical Margin (unlike Cost and Schedule Margin) is to reduce the margins (for example Size Weight and Power) to as close to zero as possible, to maximize mission capabilities. The technical growth and its handling include:

Expected technical growth ? contingency accounts for expected growth

Recognize mass growth is historically inevitable.
As systems mature through their development life cycle, a better understanding of the design emerges from conceptual to actual designs.
Requirements changes often increase resource use

Unplanned technical growth ? margins account for unexpected growth

Recognize space system development is challenging
Programs encounter “unknown unknowns” with the use of new technology that is difficult to gauge, which develop into uncertainties in the design and execution of the program, all the way to manufacturing variations.

Ontological Uncertainty

On the scale of uncertainty ? from random naturally occurring processes (aleatory) to the Lack of Knowledge (epistemic), Ontological uncertainty lies at the far end of this continuum and is a state of complete ignorance. [344] Not only are the uncertainties not known, but the uncertainties also may not be knowable. While the truth is?out there, [3] it cannot be accessed because it is simply not known where to look in the first instance. Ontological uncertainty is called it operating outside the experience base, where things are done for the first time and operate?in a state of ignorance.

Management of uncertainty is the critical success factor of effective program management. Complex programs and the organizations that deliver them organizations can involve people with different genders, personality types, cultures, first languages, social concerns, and/or work experiences.

Such differences can lead to ontological uncertainty and semantic uncertainty. Ontological uncertainty involves different parties in the same interactions having different conceptualizations about what kinds of entities inhabit their world; what kinds of interactions these entities have; how the entities and their interactions change as a result of these interactions. Semantic uncertainty involves different participants in the same interactions giving different meanings to the same term, phrase, and/or actions. Ontological uncertainty and semantic uncertainty can lead to intractable misunderstandings between people.

When new technologies are introduced into these complex organizations, concepts, principles, and techniques may be fundamentally unfamiliar and carry a higher degree of ontological uncertainty than more mature technologies. A subtler problem is these uncertainties are often implicit rather than explicit and become an invisible and unquestioned part of the system. When epistemic and ontological uncertainty represents our lack of knowledge, then reducing the risks produced by this uncertainty requires improvement in our knowledge of the system of interest or avoiding situations that increase these types of uncertainty. To reduce Epistemic and Ontological uncertainty there must be a reduction in the uncertainty of the model of system behavior (ontology) or in the model’s parameters (epistemology).

[1] ????The use of the Design Structure Matrix provides visibility and modeling of these dependencies. Many models consider the dependencies as statistic or fixed at some value. But the dependencies are dynamic driven by nonstationary stochastic processes, that evolve as the program evolves.

[2] ?Technical debt is a term used in the Agile community to describe the eventual consequences of deferring complexity or implementing incomplete changes. As a development team progresses there may be additional coordinated work that must be addressed in other places in the schedule. When these changes do not get addressed within the planned time or get deferred for a later integration, the program accumulates a debt that must be paid at some point in the future.

[3] ?Apologies to Mulder and Scully and the X?Files

Herding Cats

3,339 位关注者

Diederik Biesboer

2 年

Wichard Hulsbergen Daniela Mayan your thoughts?

Victor A Barria

2 年

Glen Alleman this is very interesting.

David Espina, PMP

2 年

Good stuff!

Jan van den Berg

Author "Primavera P6 Practical Scheduling & Planning & Master Primavera P6" | Project Scheduling & Planning Expert

2 年

Thanks for sharing this knowledge about uncertainty and risk.

查看更多评论

要查看或添加评论，请登录

Glen Alleman MSSM的更多文章

Quote of the Day

2025年3月26日

Quote of the Day

“As Americans, we should be frightened—deeply afraid for the future of the nation. When good men and women can’t speak…

2 条评论
3 - Workforce Plan for Deploying Digital Engineering

2025年3月26日

3 - Workforce Plan for Deploying Digital Engineering

Digital Engineering is a fundamental change to the way people work and operate. It incorporates digital computing…
2 - Fundamentals of Digital Engineering Systems

2025年3月21日

2 - Fundamentals of Digital Engineering Systems

This is the 2nd in a 3-part series on Digital Engineering. The 1st introduced the Capabilities of Digital Engineering.
Some GovLoop Publications

2025年3月21日

Some GovLoop Publications

GovLoop is The Knowledge Network for the Government of more than 300,000 federal, state, and local government peers in…
Five Immutable Principles of Project Success No Matter the Domain, Context, Management Tools, or Processes

2025年3月19日

Five Immutable Principles of Project Success No Matter the Domain, Context, Management Tools, or Processes

Here is a collection of materials I use to guide project success when we are not immune to common reasons for project…

6 条评论
Planning is Everything

2025年3月18日

Planning is Everything

Plans are nothing; Planning is Everything. The notion that plans are nothing but planning is everything is a standard…

3 条评论
Learning from Mistakes is Overrated

2025年3月11日

Learning from Mistakes is Overrated

We've all heard this before: hire good people and let them learn from their mistakes. The first question is, who pays…

2 条评论
Quote of the Day

2025年3月10日

Quote of the Day

“The first rule of any technology used in a business is that automation applied to an efficient operation will magnify…

3 条评论
Quote of the Day

2025年3月2日

Quote of the Day

For the sake of persons of different types, scientific truth should be presented in different forms and should be…

1 条评论
The Fallacy of the Iron Tiangle

2025年2月28日

The Fallacy of the Iron Tiangle

The classic Iron Triangle of lore - Cost, Schedule, and Quality- has to go. The House Armed Services Committee (HASC)…

9 条评论

See all articles

Risks, Their Sources, Root Causes, and Handling Strategies

Glen Alleman MSSM

Vietnam Veteran, Applying Systems Engineering Principles, Processes & Practices to Increase the Probability of Program Success for Complex Systems in Aerospace & Defense, Enterprise IT, and Process and Safety Industries

Separating Aleatory and Epistemic Uncertainty for Risk Management

Epistemic Uncertainty Creates Reducible Risk

Reducible Cost Risk

Reducible Schedule Risk

Reducible Technical Risk

Reducible Cost Estimating Risk

Aleatory Uncertainty Creates Irreducible Risk

领英推荐

Irreducible Schedule Risk

Cost Contingency

Irreducible Technical Risk

Ontological Uncertainty

Herding Cats

3,339 位关注者

Glen Alleman MSSM的更多文章

社区洞察

其他会员也浏览了

Risks, Their Sources, and Handling Strategies

All Risk Comes From Uncertainty

Dependencies, Structures, and Correlations of Risk ?

Risk Analysis Through Feeling

Exploring FIG & BACOHM for Risk Assessment

ISO 14971 fundamentals - policy for establishing criteria for risk acceptability

3 Risk Perception Modifiers: Security, Risk & Management Sciences

Risk Assessment 101 "What is Risk?"

Understanding Rare Risks: A Simulation Approach

Managing in the Presence of Uncertainty That Creates Risk

Separating Aleatory and Epistemic Uncertainty for Risk Management

Epistemic Uncertainty Creates Reducible Risk

Reducible Cost Risk

Reducible Schedule Risk

Reducible Technical Risk

Reducible Cost Estimating Risk

Aleatory Uncertainty Creates Irreducible Risk

领英推荐

Irreducible Schedule Risk

Cost Contingency

Irreducible Technical Risk

Ontological Uncertainty

Herding Cats

3,339 位关注者

Glen Alleman MSSM的更多文章

Quote of the Day

3 - Workforce Plan for Deploying Digital Engineering

2 - Fundamentals of Digital Engineering Systems

Some GovLoop Publications

Five Immutable Principles of Project Success No Matter the Domain, Context, Management Tools, or Processes

Planning is Everything

Learning from Mistakes is Overrated

Quote of the Day

Quote of the Day

The Fallacy of the Iron Tiangle

社区洞察

其他会员也浏览了

Risks, Their Sources, and Handling Strategies

All Risk Comes From Uncertainty

Dependencies, Structures, and Correlations of Risk ?

Risk Analysis Through Feeling

Exploring FIG & BACOHM for Risk Assessment

ISO 14971 fundamentals - policy for establishing criteria for risk acceptability

3 Risk Perception Modifiers: Security, Risk & Management Sciences

Risk Assessment 101 "What is Risk?"

Understanding Rare Risks: A Simulation Approach

Managing in the Presence of Uncertainty That Creates Risk