登录查看更多内容

Questions from Webinar – Designing a Knowledge Repository

Naresh Agarwal

Data Products Leader | Chief Products Officer | Consulting | Large Program Execution

发布日期: 2016年2月15日

Many of you attended our webinar Big Data Analytics: Designing the Knowledge Repository, (https://www.brillio.com/1115/designing-the-knowledge-repository-webinar) but we ran out of time to answer all of your questions during the session. So we wanted to take the time to answer some of the questions with you in this blog.

Q: How does the security governance for the data layer work within this framework?

A: Overall the security governance will be part of the data governance function within the data management layer. However, the physical/network security, as well as the identity and access management functions, are handled by the foundation layer. Fine-grained security, encryption, and role-based security are implemented in the other layer.

While data security is often (and rightly) top of mind for most companies, a fine balance needs to be achieved from governance perspective. A platform that is too tightly controlled will result in data silos (exactly the type of thing we are trying to avoid) and will limit usability/access, which is detrimental to driving high confidence, insights-based decision-making.

Q: Why do you say the discovery layer should be temporary?

A: The discovery layer should not be temporary; rather we are saying the sandbox environment - which allows organizations to hypothesize, build, test - should be temporary.

This sandbox environment resides within this discovery layer, and is typically provisioned for a period of 4 to 12 week during which time scientists are able to provide or disprove their ideas. After that point, the sandbox is reclaimed for another rapid experimentation cycle.

Q: With some classes of data, I would image there is a "completeness" factor that needs to be tracked, especially if a proved experiment gets "mechanized" into the Enterprise Knowledge Layer. Any best practices for establishing sound "data completeness" controls?

A: Good question. This makes me think about how we need to move from the traditional way of handling data to the new of working with data. In the past, the core focus of data teams was to bring complete, clean data into the platform. But in today’s work all data is important, even data that is incomplete, as it may contain some valuable insights. This is where metadata comes in: organizations can tag relevant metadata, bring that data in, and use it for various analysis. This allows us to find wisdom in the entire data set.

Q: You mention that data analysts and data scientists trying to setup an experiment can spend a large amount of their time finding, collecting and preparing the data. Seems to me like automation is key here. How much automation to you bring – after all you need some human insight? And where do you bring in automation?

A: Automation in the data platform context is the ability to automatically capture metadata at all data lifecycle stages (at ingestion, preparation, consolidation, wrangling, etc.). Equally important is the process of exposing that metadata through a searchable data catalogue user interface. Adding few social features to the data catalogue will bring in the human insight factor, where the analysts should be able to comment on their experience with the underlying data. These two things allow for automation but also add a human element and provide additional “context” that can improve the usability of data for future analysis.

Q: How long does it take to deploy this type of platform or deploy this type of architecture?

A: With some good expertise and head start with prebuilt accelerators, which we have developed for numerous types of analysis and for numerous industry specific functions, this type of platform can be deployed in 8 to 12 weeks.

Q: The elements or blocks you describe look very linear. Is this correct, or are some elements done simultaneously?

A: Yes, you are right it looks linear, as the effort here was to abstract the complexity and present it in a conceptual framework. In reality, while deploying the architecture, multiple components are implemented together or in parallel. This allows the analysts and data scientists to adjust information and data as needed.

Q:Is this type if structure only useful for companies involved in a rapid experimentation approach to big data?

A: This type of structure can be used by any organization, whether they are following an ROI-based data project approach or rapid experimentation approach. In our view, current data world is complex, and getting more complicated. Often times we wander mostly in ‘Unknown Unknown’ space, and the rapid experimentation approach helps companies move forward on the path to becoming more data driven. So while we are little biased toward the repaid experimentation approach, this architecture has very broad applicability.

Got more questions on developing a robust data platform that can act as your organization’s knowledge repository, and how you can implement it in your own organization? If so, we’d love to hear from you. Contact us to talk specifics or explore this idea further.

要查看或添加评论，请登录

Naresh Agarwal的更多文章

The Cost of Falling Behind: Why Big Data investments can't be measured in simple ROI

2017年3月2日

The Cost of Falling Behind: Why Big Data investments can't be measured in simple ROI

ROI, ROI, ROI: when convincing executives to embrace Big Data, it may seem that the bottom line is all that matters—and…

1 条评论
Designing a Knowledge Repository

2016年2月15日

Designing a Knowledge Repository

We have talked at length about the big data opportunity, but we know that in order to achieve desired outputs and…
Big Data: The Move Toward Rapid Experimentation Webinar - Q & A

2015年10月8日

Big Data: The Move Toward Rapid Experimentation Webinar - Q & A

What’s on your mind? We got a lot of great questions during our webinar Big Data Analytics: The Move Towards Rapid…

1 条评论
As Big Data Forces Companies to Change, Are You Making the Right Bets?

2015年9月23日

As Big Data Forces Companies to Change, Are You Making the Right Bets?

Is the pace of innovation in the big data ecosystem moving faster than your enterprise can possibly change? If so, you…

2 条评论
Big Data Analytics : Moving Past the Old Way of Doing Things

2015年9月16日

Big Data Analytics : Moving Past the Old Way of Doing Things

When it comes to Big Data, many companies have a hard time moving past the old way of doing things. In many cases, this…

3 条评论

See all articles

Questions from Webinar – Designing a Knowledge Repository

Naresh Agarwal

Data Products Leader | Chief Products Officer | Consulting | Large Program Execution

Naresh Agarwal的更多文章

社区洞察

其他会员也浏览了

Data Governance with Microsoft Purview and Microsoft Fabric

Data as a product

BigID's Data Leaders Program - Week Six on Security, Privacy, and Data in action

Top 5 Looker practices for data governance and security to improve confident data usage

What a train and unstructured data have in common? More than you think…

TAMING THE WILD WEST OF DATA GOVERNANCE Bringing Order To The Chaos

Today's Tech Digest - Feb 12, 2020

Top Takeaways from New Zealand’s Data Readiness Masterclass

Data Monetization, Cybersecurity, Process Audits & More

Addressing the Data Governance Dilemma in the Age of Big Data

Naresh Agarwal的更多文章

The Cost of Falling Behind: Why Big Data investments can't be measured in simple ROI

Designing a Knowledge Repository

Big Data: The Move Toward Rapid Experimentation Webinar - Q & A

As Big Data Forces Companies to Change, Are You Making the Right Bets?

Big Data Analytics : Moving Past the Old Way of Doing Things

社区洞察

其他会员也浏览了

Data Governance with Microsoft Purview and Microsoft Fabric

Data as a product

BigID's Data Leaders Program - Week Six on Security, Privacy, and Data in action

Top 5 Looker practices for data governance and security to improve confident data usage

What a train and unstructured data have in common? More than you think…

TAMING THE WILD WEST OF DATA GOVERNANCE Bringing Order To The Chaos

Today's Tech Digest - Feb 12, 2020

Top Takeaways from New Zealand’s Data Readiness Masterclass

Data Monetization, Cybersecurity, Process Audits & More

Addressing the Data Governance Dilemma in the Age of Big Data