ArchiMate suited for emerging Data centric processes?
Dr Nicolas Figay, HDR
Let's prepare and build the continuous operational Interoperability supporting end to end digital collaboration
Introduction
ArchiMate not suited for data?The question arises about Archimate (standard enterprise architectural description visual modeling language) being suited for data, when considering emerging data related practices and process related to data quality, data exchange, data sharing, data archiving, data governance or IA.
This is particularly relevant when considering ArchiMate comes only with 4 constructs for representing passive objects of the data viewpoint: business object, data object, artefact and representation?
I attached a figure depicting some emerging processes related to data governance and processes it aims at supporting for a certain trend.
The recent DCAT W3C specification for data catalogs results from this trend, which doesn't necessarily respond to the needs of other data related frameworks, such as the ones related to manufacturing data exchange, sharing and long term retention required for Product Lifecycle Management (I'm thinking about ISO standards such as ISO STEP for models of products and associated processes).
I also don't retrieve all what is related to data warehoused related to data analytics dedicated to decision making and business intelligence.
Indeed, "data" concept is overloaded and comes with some fuzziness since the very beginning of computer science, for which everything is data: a program is a set of data giving instructions for a machine in order to consume and produce data stored in separated datasets (non executable files in you computer). Haven't programs which are also datasets to be managed and govern as well than datasets which are not programs?
So much of the time I'm a little bit puzzled when considering the current trends, being for data governance or knowledge management. The proposed frameworks are disconnected from the enterprise context and they are addressing very narrowed needs without considering an holistic view of what is to be considered within an enterprise, and reconcile these approches with enterprise modeling.
This can't be done without coming back to some basics, with a consistent set of definitions clarifying what we are talking about, and then to "inject" in ArchiMate relying on the extension mechanism of the language as needed, in order to include in the overall picture evolving and heterogeneous needs related to "data" related processes and practices.
This is what I'm exploring in this article.
About DCAT
DCAT stands for Data Catalog Vocabulary. It is an RDF vocabulary designed to facilitate interoperability between data catalogs published on the web. RDF (Resource Description Framework is a standard model for data interchange on the web). DCAT provides a set of classes and properties to describe datasets in catalogs and facilitates the discovery and exchange of metadata about datasets. DCAT is widely used in the context of open data initiatives and helps improve the discoverability and accessibility of datasets published online.
How data catalog differs from data or business object
A data catalog, data, and business object are related concepts but serve different purposes within an organization's data management framework:
In summary, while data catalogs provide metadata about datasets to facilitate their discovery and management, data itself represents the raw information, and business objects represent the structured representations of real-world entities within a business domain. Each plays a distinct role in managing and utilizing data within an organization.
ArchiMate fuzziness
As stated in the introduction, it points out some fuzziness in ArchiMate. ArchiMate provides a set of constructs for modeling different aspects of an enterprise architecture, including data. While ArchiMate provides constructs such as Business Object, Data Object, and Artifact for representing data, the distinction between these constructs may not always be clear, especially when considering aspects like data catalogs, structured data with schemas, and data assets for decision making.
Let's break down these constructs and their potential ambiguities:
领英推荐
Addressing the fuzziness issue
To address the fuzziness and provide clearer representation of data-related concepts, organizations using ArchiMate may need to extend the language or provide additional guidance within their modeling practices. This could involve defining specific conventions or patterns for modeling data catalogs, structured data schemas, data assets, and consolidated data for decision making within the ArchiMate framework. These extensions would help ensure that data-related concepts are accurately represented and understood within the enterprise architecture models.
However, extending the language should not mean to change the specification of the language, providing new modeling constructs. Such an approach is not appropriate, due to quick evolution and versatility of data related practices. So the idea is to use the specialization mechanism of ArchiMate as needed and required by each enterprise, and to rely on the "Representation" construct of ArchiMate. The Representation construct in ArchiMate is indeed intended to capture and document various forms of representations, including those related to data. Using ArchiMate's specialization mechanism to refine the Representation construct rather than introducing entirely new concepts. This approach aligns well with ArchiMate's flexibility and extensibility features. Let's explore how this could work:
Overall, by leveraging ArchiMate's specialization mechanism to refine Representation elements and associating them with existing constructs, we can effectively capture and communicate detailed information about data-related artifacts within the enterprise architecture model. This enhances the clarity, accuracy, and usefulness of the model for stakeholders involved in data management and governance.
Some specification already addressing this need?
While ArchiMate itself provides a flexible framework for modeling enterprise architectures, specific extensions or profiles tailored to data management might not be readily available within the standard ArchiMate specifications. However, there are initiatives and standards outside of ArchiMate that address the need for modeling data-related aspects within enterprise architectures. Some of these include:
When addressing the need for modeling data-related aspects within enterprise architectures, it's essential to consider these external standards, frameworks, and best practices alongside ArchiMate. By integrating relevant concepts and models from these sources, organizations can create more comprehensive and effective enterprise architecture models that accurately capture the complexities of data management and governance.
Conclusion
We've delved into the potential enhancements needed for data modeling within the ArchiMate framework. Although ArchiMate offers constructs like Data Objects and Artifacts, there's a recognized need for more specialized representations, such as data catalogs and datasets, to capture the nuances of data management effectively.
The flexibility of ArchiMate allows for customization and specialization of existing constructs, such as Representation, to better accommodate these data-related concepts. By utilizing ArchiMate's adaptable nature, organizations can tailor their models to suit their specific data management requirements.
Moreover, it's essential to consider external standards and frameworks from the data management domain. Resources like the Data Management Body of Knowledge (DMBOK) and data governance frameworks provide valuable insights and best practices that can inform data modeling efforts within ArchiMate.
By integrating relevant concepts and models from external sources, organizations can develop more comprehensive enterprise architecture models. These models accurately reflect the intricacies of data management and governance within their organization, facilitating better decision-making and alignment with business objectives.
In my case, while dealing with digital end to end collaboration within the manufacturing domain for PLM, MBSE, Virtual Manufacturing and EA, the standards I've been considering are external open standards considered by my enterprise when dealing with Digital Design, Manufacturing and Support. With ArchiMateCG (c.f. many posts and articles publish on this topic), I've been exploring various way to decorate ArchiMate graphs with complementary data and metadata, but without relying on the proposed way to rely on "representation" construct at this stage. This is will part of my future explorations.
Let's follow...
And don't hesitate to react, comment - if you think the article brings value, don't hesitate to like and to share ;)