Data Lake - Why ?? What & How ??
Sandeep Patel Author, VC networker, Google AI researcher
Principal Data Science,AI and Analytics Architect leader at Google- Data AI-ML advocate
What pushes customers for Data lake today . Majority of CDO expressed their concerns . Thought to share for benefit of all. Few pointers which i came during my engagements with people who matter :
Data lake creation with legacy sources built on traditional application tier concepts.Willingness to make to DR and workaround Hadoop file systems limitations.
Companies faces problems with Interoperate ability and willingness to move to CPA ( Common Hadoop Platform Architecture).As business faces problems with Multiple DB’s , different genre of Platforms .Issues with SLA , performance guarantee and compliance needs become a real business pain. DWO Offload to No –SQL and Hadoop Service Orchestration are no more buzz words.It is the need of the hour.
Some CIO's and CTO's explained the night-mares on creating or attempting a Big Data Project.
Dark Data - with no common Data model is one heck of challenge to manage.
CPE - Customer Centric Platform ( Application owners ) who needs to use data extensively. UX- CX interfaces to build on centralized Data lake.Companies are missing the entire Data realms, Data definitions ,contextualization logic & capability to build a common data model.
World of unstructured : Leave the Buzz created by Data vendors :).Companies face the inability to address unstructured data points, Data silos and data quality requirements.Adding to the laundry list is the Compliance, Growth & Inability to manage.
How do we do it ?? Let me explain the suspense in the next article :).. Good things are worth a wait..Stay tuned .. Keep reading.