Why chatGPT/LLM should have unlearning capability like human has..
https://www.moveleadership.com/blog/2018/4/14/unlearning-everything-you-know

Why chatGPT/LLM should have unlearning capability like human has..

Executive Summary

Do you know, chatGPT/LLM has this open problem to solve. This problem(unlearn) has potential to negate the power of chatGPT.

  1. The article starts with Why it is important
  2. Then it talks about current state.
  3. Article finishes telling about opportunities for innovation and the mitigation suggestion

Why?

Look at below pic. This is corrections published by a renowned news paper. Think, If chatGPT has been trained on 25th data, then it acquired an incorrect knowledge. Can chatGPT forgets wrong info based on below correction advertisement?

No alt text provided for this image
https://www.nytimes.com/international/section/corrections

Some of these corrections could be quite important . Look at the below correction related to a crime.

No alt text provided for this image
https://www.poynter.org/fact-checking/2017/not-fake-news-just-plain-wrong-top-media-corrections-of-2017/

So, the bigger question is: How can chatGPT/LLM correct itself?

Additional case

Look at the below example. James Gunn?was fired from "Guardians of the Galaxy 3"?by Disney after his offensive tweets resurface. He was fired in 2018, for tweets that were written between 2008 and 2011.


No alt text provided for this image

Look at James comment. He mentioned that he has changed himself and like his new avatar.

“I used to make a lot of offensive jokes,” Gunn wrote. “I don’t anymore. I don’t blame my past self for this, but I like myself more and feel like a more full human being and creator today.?Love you to you all.”

So, the question is that should a person's content be hanging around forever by some LLM?

There is option in facebook to delete your content. What if LLM learns it and keep on using it even though it doesn't exist at source?


Current capability of chatGPT

I asked chatGPT about any instance where it learned wrong info and what was the measure taken. See the answer in the highlight

No alt text provided for this image

As a human being, we are expected to unlearn the incorrect knowledge. What about chatGPT?

The simplest approach would be to delete the data point from the training data and retrain the model. However, this is clearly expensive. For example, OpenAI spent an estimated cost of between 10 and 20 million dollars?to train?GPT-3. Thus, we need better and cheaper alternatives.


Mitigation

If chatGPT/LLM is trained cautiously, then it will be less likely to suffer this problem.

  1. Layered training of chatGPT. So, chatGPT will be feed with small subset of data at a time. Note that It will help LLM to remove specific layer. Note that this approach also has flaws. Curious chaps can read https://arxiv.org/abs/1912.03817!!
  2. The training data needs to be cleaned to avoid any such conditions which triggers unlearn

Opportunities

This challenge opens up window of opportunities as well

  1. Innovation: To give a solution which solves this problem for ever.. To start with, I suggest to follow the ongoing research papers/articles
  2. Build excellence is training LLM which has lesser and lesser of such issues

Thanks to these helping hands

https://towardsdatascience.com/machine-unlearning-the-duty-of-forgetting-3666e5b9f6e5

https://techcrunch.com/2018/07/20/james-gunn-fired-from-guardians-of-the-galaxy-3-after-offensive-tweets-resurface/

https://techcrunch.com/2018/07/20/james-gunn-fired-from-guardians-of-the-galaxy-3-after-offensive-tweets-resurface/

https://techcrunch.com/2018/07/20/james-gunn-fired-from-guardians-of-the-galaxy-3-after-offensive-tweets-resurface/

https://www.nytimes.com/international/section/corrections

https://www.poynter.org/fact-checking/2017/not-fake-news-just-plain-wrong-top-media-corrections-of-2017/

https://arxiv.org/pdf/1912.03817.pdf

https://arxiv.org/pdf/1912.03817.pdf

https://towardsdatascience.com/machine-unlearning-the-duty-of-forgetting-3666e5b9f6e5

chatGPT platform

Deepak Kumar

Propelling AI To Reinvent The Future ||Author|| 150+ Mentorship|| Leader || Innovator || Machine learning Specialist || Distributed architecture | IoT | Cloud Computing

1 年

#research #chatgpt4 #innovation #technologynews

回复

要查看或添加评论,请登录

Deepak Kumar的更多文章

  • Role of DBSCAN in machine learning

    Role of DBSCAN in machine learning

    Why to read this? Density-based spatial clustering of applications with noise (DBSCAN)is a well-known data clustering…

  • Choice between multithreading and multi-processing: When to use what

    Choice between multithreading and multi-processing: When to use what

    Introduction Single threaded and single process solution is normal practice. For example, if you open the text editor…

  • Artificial Narrow Intelligence

    Artificial Narrow Intelligence

    About ANI ANI stands for "Artificial Narrow Intelligence." ANI refers to artificial intelligence systems that are…

  • Federated learning and Vehicular IoT

    Federated learning and Vehicular IoT

    Definition Federated Learning is a machine learning paradigm that trains an algorithm across multiple decentralised…

  • An age old proven technique for image resizing

    An age old proven technique for image resizing

    Why to read? Anytime, was you curious to know how you are able to zoom small resolution picture to bigger size?…

    1 条评论
  • Stock Market Volatility Index

    Stock Market Volatility Index

    Why? Traders and investors use the VIX index as a tool to gauge market sentiment and assess risk levels. It can help…

  • The case for De-normalisation in Machine learning

    The case for De-normalisation in Machine learning

    Why? The need for inverse normalization arises when you want to interpret or use the normalized data in its original…

    1 条评论
  • Kubernetes complements Meta-verse

    Kubernetes complements Meta-verse

    Motivation The #metaverse is a virtual world or space that exists on the #internet . It's like a big interconnected…

    1 条评论
  • Which one offers better Security- OSS or Proprietary software

    Which one offers better Security- OSS or Proprietary software

    Motivation World is using so many OSS. Apache Kafka is a core part of our infrastructure at LinkedIn Redis is core part…

  • Gini index for ML (Performance measurement and many more..)

    Gini index for ML (Performance measurement and many more..)

    Motivation You have developed machine learning model. What is next? You definitely want to check its performance.

    1 条评论

社区洞察

其他会员也浏览了