Let's learn about F.A.I.R data - Episode #6
Source: blog.cuyahogarecycles.org

Let's learn about F.A.I.R data - Episode #6

In the last article, we covered the Interoperability principles and this is all about the Reusability principles.

R1.1: (Meta)data are released with a clear and accessible data usage license

When data is findable, accessible and interoperable, are we allowed to use it for granted ? The answer could be yes and no. The classic example would be the Creative Commons which is a non-profit organisation that helps public to get access to creative work in a legal way. In a different note, if we are using a copyrighted material for our own purpose, then we are bound to face a legal implication on that. This forms the basis for the right to use/re-use any data or metadata and all FAIR enabled data should be accompanied with a proper usage license. The complexity of licensing arises whenever the data is expected to be exchanged at inter-organisation or inter-country level.

So, without proper license, a FAI data will never become FAIR !!

R1.2: (Meta)data are associated with detailed provenance

A week back, while cleaning the loft I found a bottle which was unlabelled but it was filled with some liquid and the bottle was not tampered. I did opened and smelled it to understand what could be it, but I wasn't sure when it bought ? who bought it ? what it is ? Do we have an expiry date to it ? What can we do with it ? With no hesitation, I throwed this away in the bin.

Just relating the same with the above principle, a data asset should have proper metadata to understand the provenance to answer some critical questions to enable data reusability. Who has generated it ? from where and when ? What is the purpose of this data asset ? How often it will be updated ? Who will maintain it ?

A FAIR enabled data needs a clear provenance and associated information to support and facilitate proper usage of the data asset.

Source: Dataedo

R1.3: (Meta)data meet domain-relevant community standards

The data and of course the metadata, needs to be stored in a structured, organized and standardized way to enable the impact of reusability. The formats of the data is one aspect, but it doesn't stop there, using common vocabulary, metadata stored in a standard format will also form the other dimensions of making data - a reusable asset.

Clubbing 100 excel spreadsheets is not the same as clubbing 50 excel spreadsheet, 10 access database object, 10 JSON files and 30 text files. Joining two datasets using a country attribute will fail if one uses alpha-2 code and the other on alpha-3 and finally, a extracted datetime reported on two dataset extraction differs but they have exported at the same time by two users in the system, this possibly could be due to time zone changes but unless it is referred to the data point, it causes confusion which emphasizes the need of enriched metadata information.

All the above mentioned aspect will help a FAI data becoming a FAIR one. With this article, the introduction and principles of FAIR has been discussed. The next episodes will be around specific topics on FAIR implementation and assessments.

Happy reading ! ????




要查看或添加评论,请登录

Ravichander R.的更多文章

  • Ep. 1: Turning Data into Business Value | By The Data Alchemist

    Ep. 1: Turning Data into Business Value | By The Data Alchemist

    Data is everywhere in almost every different form, but how much does it really add value to us and the business ?…

    10 条评论
  • Do you use heuristics to make Decisions ?

    Do you use heuristics to make Decisions ?

    In my recent exploration around informed search algorithms, the concept of "heuristic function" played a very critical…

    4 条评论
  • Are we learning AI the right way?

    Are we learning AI the right way?

    Most AI courses focus primarily on model types, training techniques, model selection, and hyperparameter tuning. These…

  • Plug-and-Play Analytics: The Power of Composable Architecture

    Plug-and-Play Analytics: The Power of Composable Architecture

    In this modern analytics world, the architecture of any application has to meet three critical aspects - "Agility"…

    4 条评论
  • Let's learn about F.A.I.R data - Episode #5

    Let's learn about F.A.I.R data - Episode #5

    In the last article, we covered the Accessibility principles with some real-time examples and this article is about…

  • Let's learn about F.A.I.R data - Episode #4

    Let's learn about F.A.I.R data - Episode #4

    This episode is all about the "A" in FAIR - Accessibility. If you are reading this as your first article, do refer the…

    2 条评论
  • Let's learn about F.A.I.R data - Episode #3

    Let's learn about F.A.I.R data - Episode #3

    The recent article covered the first principle in Findability, and this article will cover the remaining three…

    2 条评论
  • Let's learn about F.A.I.R data - Episode #2

    Let's learn about F.A.I.R data - Episode #2

    If you wonder what FAIR data is all about, feel free to check my introductory article on F.A.

  • Let's learn about F.A.I.R data - Episode #1

    Let's learn about F.A.I.R data - Episode #1

    Ever since I completed my fellowship with FAIRPlus Community, I always wanted to write a simplified but comprehensive…

    5 条评论
  • Small data can do a Big impact !

    Small data can do a Big impact !

    I'm no exception in getting carried away by the term "Big Data" over the past decade or so, made me or us to illuminate…

社区洞察

其他会员也浏览了