Unlock you Intelligent Data Fabric on Power10 with Cloudera Data Platform 7.1.8!
Traditionally data fabric projects consists of three distinct phases, often using three different sets of applications from different vendors.
The challenge being that there are many different specialized data teams in an organisation having different requirements. This leads to a plethora of different tools fundamentally doing the same thing.
The move from static “Data Clusters” to elastic “Data Services” that are practitioner focussed rather than operator focussed is a journey. Including centralized control and customized environments, we can establish an Intelligent Data Fabric.
To support Data Services using an Intelligent Data Fabric your solution must support:
Here is where Cloudera Data Platform (CDP) Private Cloud Base on IBM Power10 and IBM Elastic Storage System whitepaper comes in play.
The IBM Power10 portfolio of servers enables flexible deployment options for running CDP Private Cloud Base. IBM recommends the IBM Power? S1022 and S1024 servers for CDP Private Cloud Base deployment.
The IBM Power servers provides performance, virtualization, reliability, availability and delivers twice the throughput of Intel? processor-based offerings and is highly economical for elastic data services deployments.
IBM PowerVM? allows for virtualizing the IBM Power Systems server without performance penalties traditionally associated with software-based hypervisors, and this is due to the enablement of single root I/O virtualization (SR-IOV) and dedicated I/O -virtualization options.
?To gain the benefits of the IBM Power and ESS technology stack, an elastic deployment topology is recommended.
领英推荐
?A new option with the Power S1024 server is the traditional data cluster deployment topology. This avoids the high-speed network and ESS for a MVP or non-scalable scenarios. The Power S1024 server can accommodate 16x6.4 TB NVMe persistent storage modules. Each NVMe persistent storage module is independently assigned to a VM, which leaves a large number of high-capacity modules available for hosting worker nodes in the same physical server together with master nodes and gateway nodes.
Using three Power S1024 servers, each DataNode holds a full data replica. Therefore, inter-server communication is not expected to require high bandwidth. This eliminates the 100 GbE network requirement and the ability to deploy the data network on regular 10/25 Gb ports provided in the data center.
This significantly lowers the barrier of adopting Power10 for Intelligent data fabric workloads.
Disaggregated compute or storage with IBM Elastic Storage? System (ESS) reduces the traditional HDFS 3-way data replication overhead of HFDS data by up to 85% using IBM Spectrum? Scale Native RAID.?
Therefore, the IBM Power and ESS elastic deployment topology solution provides both performance advantages as well as potential cost savings from the reduced raw data requirements and simplified DR scenarios.
Especialista en Sistemas en Banco Mercantil | IBM/EMC/PureStorage SAN switch skills, AIX and RHEL or SUSE Linux installation
9 个月On the way .....