登录查看更多内容

Check out these new releases! Plus: why observability and testing go together

Gremlin

The Reliability Management Platform for high-velocity engineering teams

发布日期: 2024年7月16日

+ 关注

?? Latest Releases

Intelligent Health Checks: one-click observability for reliability tests

With Intelligent Health Checks, simply click a checkbox, and Gremlin creates a full set of Health Checks that can be used to determine service health during reliability tests—no third-party observability tools required.

In this blog post , we’ll explain how Intelligent Health Checks work, how they automate reliability testing, and how you can get them up and running in just a few minutes.

What is the Well-Architected Cloud Test Suite?

Designed around cloud reliability principles and best practices, the Well-Architected Cloud Test Suite gives you a testing foundation that covers the most common reliability failures out of the box. Based on cloud best practice guides like the Well-Architected Framework, it helps you automate and standardize resilience testing to make your system more reliable.

Read the blog post to find out more about test suites and which tests are included in the Well-Architected Test Suite.

——

??How-tos and best practices

How to load-balance across multiple availability zones for improved redundancy

Load balancers are some of the most important load-bearing (pun intended) components in cloud environments. They perform multiple critical tasks: network switching, packet inspection, and of course, routing. Most cloud-based load balancers focus on load balancing within a single zone, but what if you have resources spread across multiple zones?

In this blog , we’ll explain how cross-zone load balancing works, why it’s important to reliability, and how you can enable it in your own cloud deployments.

How to prevent accidental load balancer deletions

Accidental deletions, misconfigurations, and “fat-fingering” are unfortunate truths in the software industry, but there are ways to prevent them.

Oracle Cloud 1 年前

App Migration to 10x Clouds

W Martin W. 3 个月前

Empower Your Business with Cloud Computing: Key IT…

Vishal Mane 2 个月前

In this blog , we’ll tell you how to find critical resources that are at risk of being accidentally deleted, and how to mitigate this risk. Specifically, we’ll focus on the primary way customer traffic reaches your services: through load balancers.

Observability and incident response need resilience testing

Anyone wanting to minimize downtime and deliver reliable, available applications needs to have fully instrumented systems and playbooks so they can respond quickly and effectively to outages or incidents. But there’s another piece to the reliability puzzle: resilience testing.

Read this blog to find out how resilience testing works together with your observability and incident response practices to reduce the amount and severity of incidents, lower your MTTR, and make your system more reliable.

——

??? Office Hours

Upcoming! 5 essential resilience tests for a successful cloud migration

August 8th, 11am PT/2pm ET

Migrating to the cloud usually means faster deployments and easier scalability, but it also means latency. Cloud applications communicate over distributed networks, and while these networks are fast, little bits of latency can quickly add up.

In this Office Hours session , we’ll talk about the latency problem inherent to cloud computing and how it can impact your applications. We’ll discuss the network-centric design of cloud platforms, how to build applications to best use this design, and how to ensure your services are resilient and fault-tolerant.

On-demand: How to run fault injection tests on AWS managed services

Fully-managed SaaS services offer incredible scalability and accessibility, but at a cost: they’re also single points of failure. If your application depends on a SaaS service and the service fails, guess who your customers will blame?

In this Office Hours session , we’ll show you how you can recreate a failure in a managed service provider using Gremlin’s fault injection tools. You’ll learn how to run experiments that replicate SaaS outages in a safe, controlled, reversible way, while only impacting the services you want to test. We’ll also show you how you can easily choose from a pre-populated list of managed services directly in the Gremlin web app.

——

Check out these new releases! Plus: why observability and testing go together

Gremlin

The Reliability Management Platform for high-velocity engineering teams

?? Latest Releases

??How-tos and best practices

领英推荐

??? Office Hours

Gremlin Reliability Newsletter

1,855 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Why It’s Time to Evolve Network Automation

5 Best Practices to Ensure Cloud Resilience

10 Advantages of Moving On-Prem to the Cloud

How ComputerVault Overcomes Five Key Challenges of Digital Transformation

Facing Downtime During Azure Migration? How Bayshore Minimizes Business Disruption

Defining SLA agreements for the cloud-based solution?

Everyone is focused on Resiliency, but what does it really mean?

Accelerate Your Modernization Journey with G7 CR's Azure Plus Program

Well-Architected Review

Unleash RIM.

?? Latest Releases

??How-tos and best practices

领英推荐

??? Office Hours

Gremlin Reliability Newsletter

1,855 位关注者

?? Tips to help you avoid your worst reliability nightmares

2024年10月21日

Release roundup, customer webinar, office hours, and compliance!

2024年9月26日

AWS tips, new RBAC release, TLS/WR SSL certificate tests, and more!

2024年8月23日

Gremlin for AWS release, migration tips for Kubernetes, and microservice reliability

2024年6月27日

New testing how-tos, CI/CD office hours, and how to deal with layoffs

2024年5月14日

社区洞察

其他会员也浏览了

Why It’s Time to Evolve Network Automation

5 Best Practices to Ensure Cloud Resilience

10 Advantages of Moving On-Prem to the Cloud

How ComputerVault Overcomes Five Key Challenges of Digital Transformation

Facing Downtime During Azure Migration? How Bayshore Minimizes Business Disruption

Defining SLA agreements for the cloud-based solution?

Everyone is focused on Resiliency, but what does it really mean?

Accelerate Your Modernization Journey with G7 CR's Azure Plus Program

Well-Architected Review

Unleash RIM.