You're tasked with securing a live distributed system. How do you ensure updates without causing disruptions?

Dive into the art of seamless system updates! Share your strategies for maintaining live systems with zero downtime.

System Architecture

+ 关注

Last updated on 2024年9月11日

You're tasked with securing a live distributed system. How do you ensure updates without causing disruptions?

Dive into the art of seamless system updates! Share your strategies for maintaining live systems with zero downtime.

添加您的观点

32 个回答

Sumit Dahiya

Solution Architect|Enterprise-Scale Applications
举报内容
Rolling Updates: I would rather go with rolling updates instead of taking the complete system down., i.e., keeping things up and running, with downtime as low as possible. Backup and Test: Be as cautious with updates as you can. With any modifications, I would back up everything before doing so. I would then move on to a staging system that behaves exactly the same as what went live and run some tests there—this way, I could know for sure if the updates are going to break something down once they go into production. Monitoring: Once an update is released, I would monitor how the system was responding. And if things went wrong, that was okay—I had a rollback strategy to jump back before being cursed with an empty promise.

已翻译

赞
Oshrie Cohen

Technical Customer Support Manager at Maxlinear Corporation
举报内容
Create a backup: Ensure you have a recent and complete backup of both systems. Test environment: Set up a test environment to replicate your production environment. Plan downtime: If necessary, plan for a maintenance window or use a load balancer. Upgrade slave system: Stop services, apply upgrade, start services, verify. Upgrade master system: Stop services, apply upgrade, start services, verify. Switch roles: Use your system's

已翻译

赞
Khushi Gupta

Computer Science Engineering Student at SRMIST | Proficient in Java, Python, and SQL | Aspiring Software Developer
举报内容
Automated Testing: Write comprehensive unit tests and integration tests to ensure the update doesn't break existing functionality. Staging Environment: Test the update in a staging environment that mirrors the production setup. Rolling Updates: Update servers or nodes sequentially to minimize downtime. Load Balancing: Use load balancers to distribute traffic and ensure no single point of failure. Monitoring: Set up real-time monitoring tools (e.g., Prometheus, Grafana) to detect anomalies during updates. Tools: Kubernetes- container orchestration | Jenkins- automated testing, deployment | Docker- containerisation

已翻译

赞
Ankit Kumar

Engineering Manager @ Docusign | MarTech | Growth | Scalable Systems Builder | Mentor | Navodayan
举报内容
To secure a live distributed system and ensure updates without causing disruptions, adopt a phased approach. First, implement blue-green or canary deployments, allowing gradual updates without affecting the entire system. Use load balancers to route traffic to updated instances while keeping others active. Maintain redundancy through failover mechanisms and active monitoring to detect issues early. Apply rolling updates, updating parts of the system sequentially. Ensure robust automated testing, including security checks, before deployment. Finally, establish a rollback strategy for quick recovery in case of failure, ensuring minimal downtime.

已翻译

赞
Rick M.

Liquid Cooling SME and Immersion Requirements Lead at OCP
举报内容
Deployed in a sandbox first. MOST server infrastructure is virtualized today. I'd snapshot before doing any major changes and then do a rolling upgrade with the least impactful servers performing the updates first and bringing them back into production before moving to the next batch.

已翻译

赞

查看更多回答

System Architecture

+ 关注

给文章评分

我们借助人工智能创建了此文章。您认为这篇文章怎么样？

很棒不太好

举报此文章

查看全部

You're tasked with securing a live distributed system. How do you ensure updates without causing disruptions?

System Architecture

You're tasked with securing a live distributed system. How do you ensure updates without causing disruptions?

System Architecture

给文章评分

感谢您的反馈

更多System Architecture相关文章

更多相关阅读内容

You're tasked with securing a live distributed system. How do you ensure updates without causing disruptions?

System Architecture

You're tasked with securing a live distributed system. How do you ensure updates without causing disruptions?

System Architecture

给文章评分

感谢您的反馈

查看其他技能