登录查看更多内容

Test Engineer Perspective-Part 1:

Arif Chauhan

Test Automation | Performance Testing | Product Development | Python | K6 | Microservices |Bash Shell | TestOps| Kubernetes | Robot Framework | Azure | Telecom @Tanla Platforms Limited

发布日期: 2024年4月21日

When we build a high availability and enterprise scale platform, it is important to measure the availability and reliability before going live.

But the question arises, is it feasible to do it given the constraints of infra, time and sufficient data points?

Would love to hear opinions from like-minded professionals.

Personally, based on my experience in evaluating such kind of platforms, would follow the below approach - ?

1.? Setup test environment like production with all external interfaces connected with simulators.

2. Setup tools to generate input traffic like K6, JMeter, Locust etc.

3. Setup APM tools like Dynatrace, Grafana, ELK, cloud-native services etc.

4. Setup alerts in application stack, network, and server resources (CPU, Mem, storage etc.). One of the tools for this is Nagios or one can create custom scripts if feasible.

5. Create a workgroup of experts having skills of performance testing, network/Infra, Dev, DB and Ops.

6. Identify the metrics to be collected. In this context, we need to collect the following.

a. Uptime

b.?Downtime

c. MTBF – Avg time taken between consecutive failures.

d. MTTR – Average time taken to restore the platform.

7. Execute the endurance test for longer duration say for a week. Observe the platform and its components. And leverage the respective tools as mentioned above to record following -

- Number of failures

领英推荐

Top 10 Steps to Conduct Successful Performance Testing

PixelQA - Software Testing Company 10 个月前

Difference between Bug and Defect

QACraft - Software Testing Company 2 年前

The Single Most Expensive Element of Testing

Fortitude 17 Limited 2 年前

- Critical alerts

- Failure duration

- Time taken to repair/restore after failure

8. The interesting aspect is what if the platform does not fail. It sounds great; (however, it is unlikely ??).? So, we need to use chaos engg on critical components during the test run. And observe the above parameters.

?9. Calculate the availability as

(Uptime/Total Test Duration) *100

Where, Uptime = Total Test Duration - ∑ Downtime

10.? Calculate the MTBF as

(∑ Up Time)/Total No of Failures

11.? Calculate the MTTR

∑ (Down Time)/Total No of Failures ?

12.? Calculate reliability as

? (MTBF/ (MTBF + MTTR)) * 100

Would like to cover more practical aspects in my next post.

Arif Chauhan的更多文章

A Test Approach for Data Migration

2020年6月28日

A Test Approach for Data Migration

The consolidation of businesses continues to happen for obvious reasons particularly in Telecom, Banking, and the…

2 条评论

Test Engineer Perspective-Part 1:

Arif Chauhan

Test Automation | Performance Testing | Product Development | Python | K6 | Microservices |Bash Shell | TestOps| Kubernetes | Robot Framework | Azure | Telecom @Tanla Platforms Limited

领英推荐

Arif Chauhan的更多文章

社区洞察

其他会员也浏览了

Improve Test Coverage To Account For Run-Time Environmental Variations

Our Cross-Platform?Testautomation Expertise

PUT vs PATCH Requests in API Testing

Performance Testing Best Practices & Tutorial

The Importance of API Testing

Tips for API Testing

Test Environment Management: A Best Practices Guide

Software Performance Testing in 3min

How Integration Tests reduce developer fatigue?

Smoke Tests: Ensuring Stability with Quick and Essential Checks

领英推荐

Arif Chauhan的更多文章

A Test Approach for Data Migration

社区洞察

其他会员也浏览了

Improve Test Coverage To Account For Run-Time Environmental Variations

Our Cross-Platform?Testautomation Expertise

PUT vs PATCH Requests in API Testing

Performance Testing Best Practices & Tutorial

The Importance of API Testing

Tips for API Testing

Test Environment Management: A Best Practices Guide

Software Performance Testing in 3min

How Integration Tests reduce developer fatigue?

Smoke Tests: Ensuring Stability with Quick and Essential Checks