Let's learn about F.A.I.R data - Episode #1
Image Source: https://www.nlm.nih.gov

Let's learn about F.A.I.R data - Episode #1

Ever since I completed my fellowship with FAIRPlus Community, I always wanted to write a simplified but comprehensive coverage of F.A.I.R data and its principle. Never too late to start this, so starting a series of articles on FAIR data and this one will be the introduction to FAIR data.

Before we see What is it all about, let's dig into a critical question on "Why we need FAIR data ?"

The answer is simple - What we are going to do with a data that is not findable, accessible, interoperable or reusable ? Yes, FAIRification of data unlocks a whole bunch of benefits including reusability, improve data quality and more importantly generate quality insights out of the data which ultimately leads to better decision making.

Not all organisations or sectors are concerned about making their data FAIR but those in pharmaceutical sector have a critical need to work on this. Let's take one simple case - All pharmaceutical organisations do engaged with multiple Contract Research Organisations (CROs) to outsource some of its work in the research and development space. If we don't have a standard mechanism of capturing and integrate the data from external partners, it will be difficult to utilize them in an expected way - FAIR principles and standards will be a saviour here. Not just external data can be FAIRified, even the data internal to an organisation can be FAIR (We will see more on this as we travel in this learning journey).

Now we understand the need of FAIR data, let's look at "What is a FAIR data ?"

FAIR here stands for Findable, Accessible, Interoperable and Reusable.

Findability comes when we have enough metadata associated with the actual data. For example, if you want to search for a particular version of a document, the last modified datetime, comments etc would help you to get it right. So, adding metadata will enrich the findability aspect of a data asset.

Accessibility is something comes right after we find the data asset, this will address whether we can access the actual data that we found out using the metadata. Continuing the above example, after finding the right version of the document, do we have permission to access it ? Are we authorized to perform specific actions on it ? All this accounts to this accessibility aspect.

Interoperability helps to ensure the accessed data is in a state that can be integrated or consumed by different workflows or applications. Same example, after we get required access to use the data asset, do we have it in a format that is operable in the applications we have in our computer system ? Can it be transformed or saved in a different compatible format ?

Reusability supports the dataset with optimal reuse of data for possible consumption requirements. The above said file getting catalogued into a central repository with proper attributes and description, alongside clear usage licensing policies will help multiple consumers to use this asset for a varied purpose and this will stand as a single source of information.

Let's see a practical example to understand better. Let's google Rajinikanth ??

You would have got 6,23,00,000 hit results (that's the power of Thalaivar !!), but let us take our preferred wiki reference.

https://en.wikipedia.org/wiki/Rajinikanth

Here Rajinikanth is a unique identifier which helps us to find him easily in Wikipedia. F is done..

I could able to access the above link with no problems, so it is using open, free and shared internet protocol. Okay, let me try editing it..Oops, I'm unable to do it and it is asking for login credentials, so here comes the A part.

Wiki API Catalog( https://api.wikimedia.org/wiki/API_catalog ) is under construction but has already got REST APIs with which we can export the required data in a desired format. So, I is clear now.

The page itself is enriched with multiple hyperlinks to other wiki references and external references. There is a version history tab with details of author and contributors to this page. Now, R completes the FAIR criteria for this wiki article.

Image Source: OpIndia

Hope you have enjoyed reading this writing and you found it useful to know about FAIR data. Watch out for the next article on a detailed run through on each of the FAIR components.





Nagalakshmi Maheshkumar

IT Quality and Compliance lead|GxP Validation | Platform Qualification | Vendor Assessments| Change Management | Incident and problem Management | CAPA Management | SOx Compliance

1 å¹´

Simple and great example ??.. very useful

Karthikeyan Parasuraman

Snowflake Certified | DBT | AWS | Data and BI Architect - Power BI / MicroStrategy | Gen AI trained

1 å¹´

Great article to refer in a time when I am looking for a clarity in FAIR. Looking forward to deep dive on this topic. especially the examples are great way to understand the concepts better in this article

Ravit Jain

Founder & Host of "The Ravit Show" | LinkedIn Top Voice | Startups Advisor | Gartner Ambassador | Evangelist | Data & AI Community Builder | Influencer Marketing B2B | Marketing & Media | (Mumbai/San Francisco)

1 å¹´

Love this article, Ravichander! I am pretty sure you will be exploring many more interesting Data topics in the future!

Santosh Karthikeyan Viswanathan, Ph.D.

Data & AI Leader | AstraZeneca Global Ambassador | CXO Incubator | TEDx Speaker | Technology Advisor | Mentor | Author | Linkedin Top AI Voice | World Record Holder-Data & Analytics | ACDM APAC Chair | Board Member

1 å¹´

Nice article Ravi, thanks for taking Thalaivar in your example!

Arvind Sud

Data | Analytics | Architecture | Business | Digital | Leadership | Consultant | Advisor

1 å¹´

Great article Ravichander R.?looking forward to the next episode. Recommended reading??

要查看或添加评论,请登录

Ravichander R.的更多文章

  • Ep. 1: Turning Data into Business Value | By The Data Alchemist

    Ep. 1: Turning Data into Business Value | By The Data Alchemist

    Data is everywhere in almost every different form, but how much does it really add value to us and the business ?…

    10 条评论
  • Do you use heuristics to make Decisions ?

    Do you use heuristics to make Decisions ?

    In my recent exploration around informed search algorithms, the concept of "heuristic function" played a very critical…

    4 条评论
  • Are we learning AI the right way?

    Are we learning AI the right way?

    Most AI courses focus primarily on model types, training techniques, model selection, and hyperparameter tuning. These…

  • Plug-and-Play Analytics: The Power of Composable Architecture

    Plug-and-Play Analytics: The Power of Composable Architecture

    In this modern analytics world, the architecture of any application has to meet three critical aspects - "Agility"…

    4 条评论
  • Let's learn about F.A.I.R data - Episode #6

    Let's learn about F.A.I.R data - Episode #6

    In the last article, we covered the Interoperability principles and this is all about the Reusability principles. R1.

  • Let's learn about F.A.I.R data - Episode #5

    Let's learn about F.A.I.R data - Episode #5

    In the last article, we covered the Accessibility principles with some real-time examples and this article is about…

  • Let's learn about F.A.I.R data - Episode #4

    Let's learn about F.A.I.R data - Episode #4

    This episode is all about the "A" in FAIR - Accessibility. If you are reading this as your first article, do refer the…

    2 条评论
  • Let's learn about F.A.I.R data - Episode #3

    Let's learn about F.A.I.R data - Episode #3

    The recent article covered the first principle in Findability, and this article will cover the remaining three…

    2 条评论
  • Let's learn about F.A.I.R data - Episode #2

    Let's learn about F.A.I.R data - Episode #2

    If you wonder what FAIR data is all about, feel free to check my introductory article on F.A.

  • Small data can do a Big impact !

    Small data can do a Big impact !

    I'm no exception in getting carried away by the term "Big Data" over the past decade or so, made me or us to illuminate…

社区洞察

其他会员也浏览了