Hadoop Multi Data Centre Migration

MapR Cluster Migration Via Multi Data Centre Setup


Task Despeciation: - It was really challenging when I started this work of migrating the MAPR Hadoop Cluster to be migrated from one Data Centre Setup to another.

Approaches/Options: -

  • To achieve the migration of existing cluster I considered two approaches i.e. build a new cluster and mirroring the data over the wire using TCP/IP. Pro of this approach is we can independently build a new cluster and promote volumes. Con was the time taken to mirror >2PB of user data for a live growing cluster. The estimate was around 4-6months with the N/W speed.
  • Merge the two Data Centre setups and grow the existing clusters followed by drain of old infrastructure. The pro of this approach was the end users will never know that we have changed the setup underneath & faster turnaround time. Con was we never knew data will seamlessly flow in between two setups.

Challenges: - Following were the challenges which we faced during this mammoth project while adopting the option B.

  • The biggest challenge was different N/W, subnet and IP range for new setup.
  •  Customizing the nodes from DNS names, domain name perspective being different naming convention from existing HUB.
  • User: Group setup to ensure cluster data remain valid post cutover
  • Moving the control components (Zk, RM, CLDB) without hampering the working of existing cluster.
  • RPC communication between New and Old Infrastructure for the time when they co-exist.
  • DB synchronization for Metadata DBs, openTSDB (time series data for metrics)
  • One of the complexities was managing the SSL certs for a secure cluster.
  • Getting the Proxy and getting HA’s of various components.

Pictorial Depiction of Solution


No alt text provided for this image
No alt text provided for this image

Final validation involves volume, data, user validations which should ensure once we dissolve DC1 there is no data loss. I have preferred Balancer settings to rapid for this phase.

Risks:-

  •  Product team doesn’t support multi DC setups
  • Split brain scenario can happen during Zk migration leading to cluster failure.
Ankita Sen

Currently: Senior Software Developer | Former: Hadoop Admin in Analytics | ERICSSON

4 年

Vishal Garg How was the Replication Factor of Data/Volumes was taken care of during migration?

回复
PRAVEEN KT

Product Area Architect - Analytics Platform(Big Data , AI/ML, Cloud AWS)

4 年

Good one

回复

要查看或添加评论,请登录

Vishal Garg的更多文章

  • Snowpipe in action for Realtime ingestion

    Snowpipe in action for Realtime ingestion

    In addition to my post on LinkedIn https://www.linkedin.

  • MLOPs monitoring Solution

    MLOPs monitoring Solution

    Really glad to showcase my work/POC for Model Monitoring using Data Bricks and Microsoft Azure ML. Problem Statement/s…

    5 条评论
  • Spark via Kubernetes using MapR as Data Storage Layer

    Spark via Kubernetes using MapR as Data Storage Layer

    Some real good implementation in my Data Platform. We have used NFSV3 to expose the HDFS/MFS for data storage and used…

    3 条评论
  • Data Tiering using Mapr

    Data Tiering using Mapr

    The industry started with a single server Data Base Management Systems with scalability limited to single server. The…

    2 条评论

社区洞察

其他会员也浏览了