To measure the success of your team, several frameworks provide metrics indicating team health, which help ensure the team is moving in a positive direction. Psychological safety matters for healthy teams to ensure each software engineer brings their own lived experiences to build better products and that they feel safe to do so. #devops #sre #cloud
Nilesh Nimkar的动态
最相关的动态
-
If you’ve ever wondered about the differences between DevOps, SRE, and Platform Engineering, here’s a breakdown in simple terms: 1. DevOps: DevOps is like the rockstar philosophy of the tech world. It’s all about empowering development teams to take charge of getting their code into production smoothly. Now, how they do it can vary a lot depending on the company. Some people swear by the technical side, while others say it’s more about building a collaborative culture. But at its core, DevOps is about finding smart ways to deploy new features and code faster, whether that means tweaking deployment pipelines, setting KPIs, or just automating everything in sight. 2. SRE (Site Reliability Engineering): Think of SRE as the guardian angel of your production systems. Once your code hits production, SRE kicks in to make sure everything keeps humming along smoothly. It’s all about monitoring, setting goals for how reliable your services should be, and jumping into action when things go wrong. Basically, SRE is like having a dedicated team making sure your services stay healthy and happy. 3. Platform Engineering: Platform Engineering is the backbone that supports both DevOps and SRE. It’s all about building a solid foundation that makes life easier for everyone involved. This might involve coding infrastructure tools like Terraform or Ansible, but the focus isn’t on the nitty-gritty business logic. Instead, it’s about creating a platform that’s flexible and efficient, whether you’re in development, operations, or SRE. And thanks to fancy stuff like IaaS, SaaS, and PaaS, you can often focus on the fun stuff without worrying too much about the underlying infrastructure. #Devops #Platform #SRE #AWS #GCP #Azure #terraform #ansible #ops #cloudcomputing #iaas #paas #saas #CloudOps #Cloudautomation #devospautomation #docker #kubernetes #cicd #jenkins #gitlab #PlatformEngineering #SoftwareDevelopment #SiteReliabilityEngineering #TechCulture #Automation #InfrastructureAsCode #ContinuousDelivery #DeploymentPipeline #Monitoring #ServiceReliability #CloudComputing #DigitalTransformation #TechIndustry #CodeDeployment #DevOpsCulture #TechTrends
要查看或添加评论,请登录
-
Google SRE NYC Tech Talk - May 22nd, 6:30pm Google SRE NYC proudly announces our second Tech Talk event in 2024. The in-person only event will take place on Wednesday, 22nd of May 2024 at 6 PM at our Chelsea Markets office in NYC. The doors will open at 5:30 pm. Spaces are limited, RSVP via our Meetup event page.?https://lnkd.in/ef5fqdrE Agenda: Salvatore Furino - Customer Reliability Engineer (CRE) at Bloomberg “The Hammer Changes the Hand” This talk will briefly explore how to view internal tooling through the lens of product management in not just developing and shipping features, but how those features empower teams to change their understanding of their social-technical systems. Sal is a Customer Reliability Engineer. During his career he’s worked as a TPM, SRE, Developer, Sys Admin, and IT support. While not working he enjoys cooking, gaming, traveling, skiing, and golfing. Sal lives in Queens and has a BS in Applied Mathematics from Marist College. Thiara Ortiz - Staff CDN Reliability Engineer at Netflix? “How we measure Quality of Experience to ensure our members get a world class experience they have come to expect from Netflix ” Any time a Netflix member sits down, reclines in their chair and turns on their TV to Netflix, there's a moment of truth. It's an opportunity to deliver a spectacular service with amazing quality of experience. This talk will go over how we measure the quality of experience for our members and how we work to develop new metrics when we have additional offerings like live streaming and cloud gaming. Thiara has worked at some of the largest internet companies in the world, Meta and Netflix. During her time at Meta. Since Meta, Thiara has been working at Netflix as a Staff CDN Reliability engineer. Her focus is primarily on resilience and quality of experience for members streaming from Open Connect. When incidents occur and Netflix's systems do not behave as expected, Thiara can be found working and engaging the necessary teams to remediate these issues. Mike Scherbakov - Staff Site Reliability Engineer at Google “LLM for SRE / Using LLM in SRE space” LLMs open up an opportunity to automate and scale many operational processes, which couldn't be otherwise solved by conventional methods. Examples include simple summarization of issues and incidents, assisting production on-callers, managing incidents, clustering (creating taxonomy) of issues, scaling SRE via assisted review of development design documents. Therefore LLMs provide a new and unique opportunity to transform the work we do as SREs. Mike works in YouTube Ads SRE as well as co-leading LLM4SRE, LLM center of competency in SRE at Google Our Tech Talks series are focused on professional development and networking: no recruiters, sales or press are allowed.
要查看或添加评论,请登录
-
Google SRE NYC Tech Talk - May 22nd, 6:30pm Google SRE NYC proudly announces our second Tech Talk event in 2024. The in-person only event will take place on Wednesday, 22nd of May 2024 at 6 PM at our Chelsea Markets office in NYC. The doors will open at 5:30 pm. Spaces are limited, RSVP via our Meetup event page. https://lnkd.in/ef5fqdrE Agenda: Salvatore Furino - Customer Reliability Engineer (CRE) at Bloomberg “The Hammer Changes the Hand” This talk will briefly explore how to view internal tooling through the lens of product management in not just developing and shipping features, but how those features empower teams to change their understanding of their social-technical systems. Sal is a Customer Reliability Engineer. During his career he’s worked as a TPM, SRE, Developer, Sys Admin, and IT support. While not working he enjoys cooking, gaming, traveling, skiing, and golfing. Sal lives in Queens and has a BS in Applied Mathematics from Marist College. Thiara Ortiz - Staff CDN Reliability Engineer at Netflix “How we measure Quality of Experience to ensure our members get a world class experience they have come to expect from Netflix ” Any time a Netflix member sits down, reclines in their chair and turns on their TV to Netflix, there's a moment of truth. It's an opportunity to deliver a spectacular service with amazing quality of experience. This talk will go over how we measure the quality of experience for our members and how we work to develop new metrics when we have additional offerings like live streaming and cloud gaming. Thiara has worked at some of the largest internet companies in the world, Meta and Netflix. During her time at Meta. Since Meta, Thiara has been working at Netflix as a Staff CDN Reliability engineer. Her focus is primarily on resilience and quality of experience for members streaming from Open Connect. When incidents occur and Netflix's systems do not behave as expected, Thiara can be found working and engaging the necessary teams to remediate these issues. Mike Scherbakov - Staff Site Reliability Engineer at Google “LLM for SRE / Using LLM in SRE space” LLMs open up an opportunity to automate and scale many operational processes, which couldn't be otherwise solved by conventional methods. Examples include simple summarization of issues and incidents, assisting production on-callers, managing incidents, clustering (creating taxonomy) of issues, scaling SRE via assisted review of development design documents. Therefore LLMs provide a new and unique opportunity to transform the work we do as SREs. Mike works in YouTube Ads SRE as well as co-leading LLM4SRE, LLM center of competency in SRE at Google Our Tech Talks series are focused on professional development and networking: no recruiters, sales or press are allowed.
要查看或添加评论,请登录
-
Google SRE NYC Tech Talk - May 22nd, 6:30pm Google SRE NYC proudly announces our second Tech Talk event in 2024. The in-person only event will take place on Wednesday, 22nd of May 2024 at 6 PM at our Chelsea Markets office in NYC. The doors will open at 5:30 pm. Spaces are limited, RSVP via our Meetup event page. https://lnkd.in/ef5fqdrE Agenda: Salvatore Furino - Customer Reliability Engineer (CRE) at Bloomberg “The Hammer Changes the Hand” This talk will briefly explore how to view internal tooling through the lens of product management in not just developing and shipping features, but how those features empower teams to change their understanding of their social-technical systems. Sal is a Customer Reliability Engineer. During his career he’s worked as a TPM, SRE, Developer, Sys Admin, and IT support. While not working he enjoys cooking, gaming, traveling, skiing, and golfing. Sal lives in Queens and has a BS in Applied Mathematics from Marist College. Thiara Ortiz - Staff CDN Reliability Engineer at Netflix “How we measure Quality of Experience to ensure our members get a world class experience they have come to expect from Netflix ” Any time a Netflix member sits down, reclines in their chair and turns on their TV to Netflix, there's a moment of truth. It's an opportunity to deliver a spectacular service with amazing quality of experience. This talk will go over how we measure the quality of experience for our members and how we work to develop new metrics when we have additional offerings like live streaming and cloud gaming. Thiara has worked at some of the largest internet companies in the world, Meta and Netflix. During her time at Meta. Since Meta, Thiara has been working at Netflix as a Staff CDN Reliability engineer. Her focus is primarily on resilience and quality of experience for members streaming from Open Connect. When incidents occur and Netflix's systems do not behave as expected, Thiara can be found working and engaging the necessary teams to remediate these issues. Mike Scherbakov - Staff Site Reliability Engineer at Google “LLM for SRE / Using LLM in SRE space” LLMs open up an opportunity to automate and scale many operational processes, which couldn't be otherwise solved by conventional methods. Examples include simple summarization of issues and incidents, assisting production on-callers, managing incidents, clustering (creating taxonomy) of issues, scaling SRE via assisted review of development design documents. Therefore LLMs provide a new and unique opportunity to transform the work we do as SREs. Mike works in YouTube Ads SRE as well as co-leading LLM4SRE, LLM center of competency in SRE at Google Our Tech Talks series are focused on professional development and networking: no recruiters, sales or press are allowed.
要查看或添加评论,请登录
-
Google SRE NYC Tech Talk - May 22nd, 6:30pm Google SRE NYC proudly announces our second Tech Talk event in 2024. The in-person only event will take place on Wednesday, 22nd of May 2024 at 6 PM at our Chelsea Markets office in NYC. The doors will open at 5:30 pm. Spaces are limited, RSVP via our Meetup event page. https://lnkd.in/ef5fqdrE Agenda: Salvatore Furino - Customer Reliability Engineer (CRE) at Bloomberg “The Hammer Changes the Hand” This talk will briefly explore how to view internal tooling through the lens of product management in not just developing and shipping features, but how those features empower teams to change their understanding of their social-technical systems. Sal is a Customer Reliability Engineer. During his career he’s worked as a TPM, SRE, Developer, Sys Admin, and IT support. While not working he enjoys cooking, gaming, traveling, skiing, and golfing. Sal lives in Queens and has a BS in Applied Mathematics from Marist College. Thiara Ortiz - Staff CDN Reliability Engineer at Netflix “How we measure Quality of Experience to ensure our members get a world class experience they have come to expect from Netflix ” Any time a Netflix member sits down, reclines in their chair and turns on their TV to Netflix, there's a moment of truth. It's an opportunity to deliver a spectacular service with amazing quality of experience. This talk will go over how we measure the quality of experience for our members and how we work to develop new metrics when we have additional offerings like live streaming and cloud gaming. Thiara has worked at some of the largest internet companies in the world, Meta and Netflix. During her time at Meta. Since Meta, Thiara has been working at Netflix as a Staff CDN Reliability engineer. Her focus is primarily on resilience and quality of experience for members streaming from Open Connect. When incidents occur and Netflix's systems do not behave as expected, Thiara can be found working and engaging the necessary teams to remediate these issues. Mike Scherbakov - Staff Site Reliability Engineer at Google “LLM for SRE / Using LLM in SRE space” LLMs open up an opportunity to automate and scale many operational processes, which couldn't be otherwise solved by conventional methods. Examples include simple summarization of issues and incidents, assisting production on-callers, managing incidents, clustering (creating taxonomy) of issues, scaling SRE via assisted review of development design documents. Therefore LLMs provide a new and unique opportunity to transform the work we do as SREs. Mike works in YouTube Ads SRE as well as co-leading LLM4SRE, LLM center of competency in SRE at Google Our Tech Talks series are focused on professional development and networking: no recruiters, sales or press are allowed.
要查看或添加评论,请登录
-
Google SRE NYC Tech Talk - May 22nd, 6:30pm Google SRE NYC proudly announces our second Tech Talk event in 2024. The in-person only event will take place on Wednesday, 22nd of May 2024 at 6 PM at our Chelsea Markets office in NYC. The doors will open at 5:30 pm. Spaces are limited, RSVP via our Meetup event page. https://lnkd.in/ef5fqdrE Agenda: Salvatore Furino - Customer Reliability Engineer (CRE) at Bloomberg “The Hammer Changes the Hand” This talk will briefly explore how to view internal tooling through the lens of product management in not just developing and shipping features, but how those features empower teams to change their understanding of their social-technical systems. Sal is a Customer Reliability Engineer. During his career he’s worked as a TPM, SRE, Developer, Sys Admin, and IT support. While not working he enjoys cooking, gaming, traveling, skiing, and golfing. Sal lives in Queens and has a BS in Applied Mathematics from Marist College. Thiara Ortiz - Staff CDN Reliability Engineer at Netflix “How we measure Quality of Experience to ensure our members get a world class experience they have come to expect from Netflix ” Any time a Netflix member sits down, reclines in their chair and turns on their TV to Netflix, there's a moment of truth. It's an opportunity to deliver a spectacular service with amazing quality of experience. This talk will go over how we measure the quality of experience for our members and how we work to develop new metrics when we have additional offerings like live streaming and cloud gaming. Thiara has worked at some of the largest internet companies in the world, Meta and Netflix. During her time at Meta. Since Meta, Thiara has been working at Netflix as a Staff CDN Reliability engineer. Her focus is primarily on resilience and quality of experience for members streaming from Open Connect. When incidents occur and Netflix's systems do not behave as expected, Thiara can be found working and engaging the necessary teams to remediate these issues. Mike Scherbakov - Staff Site Reliability Engineer at Google “LLM for SRE / Using LLM in SRE space” LLMs open up an opportunity to automate and scale many operational processes, which couldn't be otherwise solved by conventional methods. Examples include simple summarization of issues and incidents, assisting production on-callers, managing incidents, clustering (creating taxonomy) of issues, scaling SRE via assisted review of development design documents. Therefore LLMs provide a new and unique opportunity to transform the work we do as SREs. Mike works in YouTube Ads SRE as well as co-leading LLM4SRE, LLM center of competency in SRE at Google Our Tech Talks series are focused on professional development and networking: no recruiters, sales or press are allowed.
要查看或添加评论,请登录
-
Starting a career in DevOps and Cloud Engineering can feel overwhelming. With so many tools, technologies, and concepts, where do you even begin? The answer is simple: focus on getting the basics right. Here’s how you can build a strong foundation: 1?? Understand DevOps Principles DevOps is more than just tools—it’s about fostering collaboration between development and operations teams. Learn these key concepts: ? Continuous Integration (CI): Regularly merge code to detect and fix issues early. ? Continuous Delivery (CD): Automate the deployment process for faster, reliable releases. ? Automation and Monitoring: Save time and improve system reliability. 2?? Master Cloud Fundamentals Cloud platforms are essential for DevOps. Start by learning: ? What is Cloud Computing? ? Why use AWS, Azure, or GCP? ? Key services like virtual machines (EC2), storage (S3), and networking basics. 3?? Get Comfortable with Git Version control is the backbone of any DevOps workflow. Learn: ? How to initialize repositories, commit changes, and manage branches. ? Handling conflicts and collaborating effectively in teams. 4?? Learn Automation and Containers ? Start scripting with Bash or Python for basic automation tasks. ? Understand Docker to package applications into containers for easy deployment. 5?? Build Real-World Projects Practice is everything! Start small: ? Create a Git repository for a basic app. ? Dockerize the app. ? Set up a basic CI/CD pipeline with Jenkins or GitHub Actions. My Journey When I began, I focused on understanding why each tool or concept mattered. I learned Git to manage code, Docker to simplify deployments, and Jenkins to automate workflows. Over time, I tackled advanced tools like Kubernetes and Terraform, but the basics always guided me. Pro Tip: Always ask yourself: What problem does this tool solve? The clearer your understanding, the better you’ll perform in real-world scenarios. Call to Action: Are you starting your DevOps journey or curious about specific tools? Let’s connect and learn together—drop your questions or experiences in the comments! #DevOps #CloudEngineering #LearnDevOps #BeginnersGuide #TechCareers
要查看或添加评论,请登录
-
???????????????????? ???? ?????????? ???????? ???????????????????????????? ???? ???????? ???????????????? ??????????? #???????????? ?????? ????????!? ? This blog post explores how ???????????? ?????????????????? like #automation, #infrastructureascode and continuous integration/delivery can address common scalability challenges faced by organizations. ? We also dive into case studies to see how companies like Netflix and Amazon have achieved scalability with DevOps.? ? ?????????? ???? ???????? ???????? #?????????????????????? ???? ?????? ???????? ??????????? ???????? ???????? ???? ?????? ???????? ???? ?????????? ????????! #technology #IT #devopsengineers #devopstools #DevSecOps #development #digitaltransformation #codiant
要查看或添加评论,请登录
-
?? In today’s fast-paced world of #DevOps, being good at fixing problems aka #troubleshooting quickly is just as important as knowing your stuff. Imagine you're at work and something goes wrong with a big project. Here are some real examples: 1?? #Deployment_Fails ?: Picture this ?? Right before a big product launch, something goes wrong with putting it out there. A good DevOps person can quickly figure out what went wrong, like maybe something wasn't set up right, and fix it fast. 2?? #Things_Are_Slow ????: Sometimes, when you're using an app, it feels really slow or even stops working. A skilled DevOps person can look into why it's happening, like maybe there's too much stuff running at once, and make it work better. 3?? #Stuff_Stops_Working ?: Imagine if a big website suddenly goes down. A good troubleshooter can find out why, like maybe there's a problem with the servers, and get it back up and running quickly. 4?? #Security_Problems ??: If someone tries to break into a system, it's a big deal. A good DevOps person can find out what happened, like maybe someone got access they shouldn't have, and make sure it doesn't happen again. ?? By getting good at fixing problems fast, #DevOps people become really important at work. It means things keep running smoothly, problems get sorted out quickly, and everyone's happy.?? ?? Want to get better at fixing things? Share your own stories or tips in the comments! Let's help each other out and make things work better. ?? Feel free to share your own experiences or ideas!?? ? Follow Chhitiz Anand for more informative content ?? #DevOps #ProblemSolving #Cloud #Troubleshooting #Scenarios #RealTimeUseCases
要查看或添加评论,请登录
Marketing Content Manager at ContactLoop | Productivity & Personal Development Hacks
8 个月Nilesh Nimkar Useful post on team health frameworks. How do you implement these?