#11 The Rise of 10X SRE
Last week I was at a DevOps conference and met a lot of technology professionals in the DevOps space. What was surprising was that not even a single one of the attendees identified as a DevOps engineer. There were a lot of attendees who identified as site reliability engineers.
How Invention & Innovation Spreads
SRE role was invented at Google exactly 20 years back. Any invention at a hyperscaler first gets adopted by (relatively) smaller technology companies & Independent Software Vendors (ISVs). ISVs are interesting in the way that they have problems similar to hyperscalers but nowhere close to their scale.
The rubber meets the road at the next level of adoption when invention makes it to practitioners. Practitioners are companies whose primary business is not to sell software but to use technology to improve their business operations. Since "software is eating the world", these companies are super-heavy users of technology. They also evaluate every technology purely from the business value perspective.
The Equity & Leveling of the Playing Field
Organizations work best when each member is empowered and respected. Engineering organizations always had a huge class divide between developers and mortals. Mortals included QAs, release teams, production support teams, etc.
The issue of being mortal was that they did not have a right to innovate. They were simply supposed to follow the process. It was demoralizing for folks who were considered mortals and untapped productivity for the organization.
Site Reliability Engineering & DevOps both leveled the field in their own ways by applying software principles and practices to operations. It not only improved the process but also provided non-core developers an opportunity to be more skilled and help in innovation. Now they could also think outside the box and find ways to be more effective than just efficient.
领英推荐
DevOps vs SRE
The simplistic silicon valley definition of SRE is that they are monitoring people. These are the folks who handle issues once workloads have made it to production. In other words, they deal with day 2 problems and make sure SLAs, SLIs, and SLOs are met.
On the other hand, DevOps engineering is to operationalize the developer's work. They are CI/CD people. Their productivity is measured based on metrics like change failure rate, lead time to change, deployment frequency, etc.
The question comes when we need to decide if DevOps is a process or a role. DevOps is better understood as a process while SRE is a role. There is also release engineering as an alternative which is both a process and a role but is not used widely.
The virtue of social mobility
I have yet to meet a QA who does not want to become a developer. A developer who does not want to become an architect. An architect who does not want to become a principal architect. Upward social mobility is ingrained into our psyche and that's what keeps us driven and motivated.
In 2003, with a very tough job market, for a few months, I worked as a business analyst in Austin. It was essentially a QA role with extra focus on business requirements and making sure there were no gaps. I could clearly see how constrained I was in what I could do and how developers were looked at as a different class.
Summary
The rise of the cloud has not only worked as a great disruptor but also as a great equalizer. While agile methodologies provided a framework for how the modern development teams should operate, the cloud made rebuilding the whole technology organization an urgent problem. I see the following convergent patterns:
Engineering Specialist / NITI's AIM/ATL Mentor
1 年from my little experience of 1-year work for postproduction validation / PPV what i understood there is scope of improvement, compared to what is done for DevOps. Patches, updates of OS, software's run and restart of machine does not happen, at times OS or software under change gets tested, restart of it affects other software which earlier didn't happen. Had to create mind map of OS, software's, hardware, using neo4j DB and utilize it for testing after the change. came across an incident where on weekend an update of antivirus software happened, and it consumed entire memory bringing down all the servers.
Working with Product based Software Company, Handling Vendor’s Management for AMC renewal.
2 年Hello Bhaii
Engineering Leader in Cluster infrastructure management (Kubernetes) and Big-data, DevOps (CI/CD), Site Reliability (Efficiency and Performance), shift left Security, and Cloud Cost optimization.
2 年Well-captured article to distinguish between process and a role based on DevOps and SRE keywords. DevOps/SRE/CloudOps/Cloud Engineers are all playing a vital role in the company's success, by becoming a multiplier factor and it's no more a constrained role, rather have active participation in the software designing and architecting, as software is becoming more horizontally scaled where process and roles can't be considered constrained anymore.