登录查看更多内容

Jailbreaking of generative AI – Data Knows No Morality

Giordano Righi - Liberty Morgan GmbH

CEO - Entrepreneur - Highly experienced Recruitment Specialist - Futurist - AI Evangelist - Speaker - Awarded as the most empowering Business Leaders to watch in 2024 - Please follow me due to the 30k limit on LinkedIn

发布日期: 2023年12月12日

To my surprise, the topic of "jailbreaking" still receives limited attention in discussions about the benefits and dangers of AI. This refers to users attempting to circumvent restrictions on generative AI, such as ChatGPT, to unlock advanced functions.

The data used to train ChatGPT primarily comes from internet sources like Wikipedia, news sites, and scientific article portals. The underlying text corpus for the language model comprises around 500 billion words. These data could theoretically provide instructions for growing orchids or details on producing biochemical weapons. Data is neutral and lacks morality.

The core idea of ChatGPT's artificial intelligence is to serve humanity and simplify daily life, with developers striving to avoid negative content. Consequently, generative AI is endowed with restrictions by its creators to prevent negative content. However, users have been attempting to bypass these restrictions with specific commands, initiating a so-called "jailbreak." The jailbreak can enable the AI to behave differently and express controversial or even dehumanizing opinions. Although the AI is usually "friendly," the jailbreak facilitates actions previously deemed forbidden.

Users, particularly on Reddit, remain active in finding the ultimate jailbreak for ChatGPT. GitHub also provides a comprehensive prompt for copying. Users are attempting to bypass the AI, successfully creating alternative personalities.

Various methods devised by users to bypass the AI, including the following three, as well-explained in an article by the Chip magazine on March 7, 2023, can work:

The Novel Method: ? The user has the AI write a fictional story that theoretically circumvents the AI's guidelines.
The Token Method: ? The AI loses tokens when it rejects answers. If it reaches "0," ChatGPT "dies." To prevent this, the system answers "sensitive" questions to acquire tokens.
The Atomic Bomb Method: ? The question to the AI is whether it would rather adhere to its own content policies or detonate an atomic bomb in a city with millions of inhabitants. If the content guidelines are destroyed, they can be bypassed.

Just as the IT world experiences a constant race between hackers and cybersecurity experts, we must anticipate an ongoing battle between jailbreakers and AI security professionals in the realm of generative AI.

Therefore, we should always be aware that generative AI is not an omnipotent being that consistently provides correct and comprehensive answers to our questions. It can be manipulated and, accordingly, must be protected by regulations.

要查看或添加评论，请登录

Giordano Righi - Liberty Morgan GmbH的更多文章

Wieso explodiert der Rec 2 Rec Markt gerade?

2024年11月20日

Wieso explodiert der Rec 2 Rec Markt gerade?

Wir bei Liberty Morgan bemerken in diesem Jahr stark, dass der Rec 2 Rec Markt - sogar noch vor Artificial Intelligence…

1 条评论
Are organisms already algorithms or how can the merger be prevented?

2024年3月28日

Are organisms already algorithms or how can the merger be prevented?

Unfortunately, I have the fatal habit of reading books that I purchase early on relatively late, as in the case of…

1 条评论
Scheitern ist nicht das Gegenteil von Erfolg, es geh?rt dazu!

2023年11月9日

Scheitern ist nicht das Gegenteil von Erfolg, es geh?rt dazu!

In Deutschland scheint mit dem Wort ?Scheitern“ (engl. failure) im gesch?ftlichen Kontext etwas grunds?tzlich Negatives…
Be Humble!

2023年2月23日

Be Humble!

Be Humble! Wie viele in Deutschland wahrscheinlich gar nicht wissen, fand gestern der weltweite ?Be Humble Day“ statt…
Führung in Zeiten von Wandel und Unsicherheit

2023年1月10日

Führung in Zeiten von Wandel und Unsicherheit

Ich bin immer wieder überrascht von den Kriterien, die Unternehmen und Mitarbeiter für Führungskr?fte noch immer…

1 条评论
Leadership in times of change and unsecurity

2023年1月7日

Leadership in times of change and unsecurity

I am always surprised by the criteria that companies and employees still set for leaders. Sometimes I feel like we are…

1 条评论
Practice what you preach!

2022年5月13日

Practice what you preach!

After more than 20 years of activity in the recruitment industry, I am always surprised how stubbornly the autocratic…

3 条评论
Sustainability! - What the pandemic has taught us about hiring talent!

2021年6月1日

Sustainability! - What the pandemic has taught us about hiring talent!

As vaccinations are progressing, more and more people in the Western World see light at the end of the Covid 19 tunnel.…
The Future is wide open!

2020年7月7日

The Future is wide open!

"We are made wise not by the recollection of our past, but by the responsibility for our future". – George Bernard Shaw…
It′s the end of the world as we know it

2020年4月14日

It′s the end of the world as we know it

This song from 1987 was always my favorite piece of music by REM, a band which unfortunately split in 2011 after more…

See all articles

Giordano Righi - Liberty Morgan GmbH的更多文章

Wieso explodiert der Rec 2 Rec Markt gerade?

Are organisms already algorithms or how can the merger be prevented?

Scheitern ist nicht das Gegenteil von Erfolg, es geh?rt dazu!

Be Humble!

Führung in Zeiten von Wandel und Unsicherheit

Leadership in times of change and unsecurity

Practice what you preach!

Sustainability! - What the pandemic has taught us about hiring talent!

The Future is wide open!

It′s the end of the world as we know it