登录查看更多内容

The Double Edge of ChatGPT's New Memory Function

Nick Potkalitsky, PhD

AI Literacy Consultant, Instructor, Researcher

发布日期: 2024年5月9日

+ 关注

Evolving Capacities; Lingering Questions

Greetings, Dear Readers,

A few weeks ago, upon logging into my ChatGPT 4 account, I was greeted with a notification about a new feature: “memory” across chats. This piqued my curiosity immediately:

Could this be the next step toward AGI—an AI form of human working memory?

Or was it simply another example of the tech industry repurposing cognitive terminology for technological uses?

In search of clarity, I reached out to my Substack network, and numerous individuals, including the insightful Daniel Nest, came to my aid. Daniel gently reminded me that he had covered this update in his newsletter back in February. It seems I had missed that while engrossed in other pursuits.

Memory and Higher-Order Reasoning

During that time in February, I was deeply absorbed in a series of fascinating podcasts by Daniel Bashir at The Gradient. One particular episode set me on an exploratory path into Daniel Kahneman’s seminal work, Thinking, Fast and Slow, for the first time. I know, dear readers, don’t hold this gap in my reading against me for long. I hope to rectify this state of affairs in this article. In this pivotal podcast, Bashir interviews computer scientist Subbarao Kambhampati, discussing topics like planning, reasoning, and interpretability in the age of large language models (LLMs).

The Gradient

Subbarao Kambhampati: Planning, Reasoning, and Interpretability in the Age of LLMs

Together, they outline a prevailing view within the machine learning community: current LLMs are limited in performing higher-order reasoning tasks, partly due to their lack of working memory. Essentially, while our most sophisticated prompting techniques aim to replicate working memory, they notably fall short of fully compensating for this capacity, which is foundational to a wide array of cognitive, emotional, and intellectual processes.

Kahneman: System 1 vs. System 2 Thinking

Throughout their discussion, Bashir and Kambhampati frequently referenced Kahneman’s concepts of “System 1” and “System 2” thinking. While I initially grasped the gist of this distinction from their dialogue, my curiosity led me to delve into Kahneman’s own writings to fully appreciate the resonances of his ideas in their original context. Kahneman’s text is a wild ride in case you haven’t read it. Kahneman prides himself on creating scenarios that his readers can replicate to test out the validity of his ideas.

Source: Neurofied

Back to the theory: System 1 thinking, as Kahneman articulates, operates automatically and quickly, with little or no effort and no sense of voluntary control. It encompasses what we might consider our instinctual responses to stimuli, those immediate, often subconscious reactions that guide much of our daily decision-making. In contrast, System 2 thinking is deliberate, effortful, and orderly. It’s the mode we engage when faced with complex problems or decisions that require focus and analytical thought.

In the lively online discussions I had with others about Kahnman’s distinction and its application to AI, I found that the reception was mixed. Guy Wilson, one of my favorite subscribers, recommended Patrick House's 19 Ways of Looking at Consciousness, as an alternative way of thinking about thinking. My own intellectual journey had previously led me to Bruno Latour's 14 modes of existence, and we cannot forget Wallace Stevens' “13 Ways of Looking at a Blackbird.” Despite the temptation to expand modes or perspectives via multiplicity, I keep returning to the System 1 and System 2 distinction. In particular, I find it thinking compelling to the extent that it highlights working memory as a dynamic target or limit for future AI systems.

What Is Working Memory?

Working memory or long-term memory is a fundamental cognitive function, akin to a mental notepad that temporarily holds information necessary for tasks such as reasoning, comprehension, and learning. This capability is crucial for advanced human activities—it underpins abstract thinking, strategic planning, and complex problem-solving, which are central to innovation and sophisticated reasoning. Moreover, working memory is essential for language processing, aiding in the construction and understanding of complex sentences, and it plays a critical role in social interactions, enabling us to consider others' perspectives and respond appropriately in real-time.

Source: “Episodic, Procedural, and Semantic Memory”

More than just a functional tool, working memory is a core component of human consciousness that significantly enriches our interactions with the world and one another. This cognitive capacity is integral to System 2 thinking, which involves managing and manipulating multiple pieces of information to address complex issues or make informed decisions, embodying deliberate, analytical thought. In contrast, System 1 thinking operates on a more automatic, intuitive basis, largely circumventing the need for the short-term storage functions provided by working memory.

Is GPT Memory Function “Working Memory”?

Gini Dietrich 1 年前

The Mind of a Machine: A Unique Caricature…

Michael Browers 6 个月前

Speaking & Tweaking: A great new way to use ChatGPT

Dave Birss 1 年前

What role does the memory function play in ChatGPT's operations, particularly when compared to the concept of working memory in human cognition? According to Tiernan Ray in a recent exploration of ChatGPT’s memory capabilities, the function strives to emulate the human facility of working memory by retaining information from ongoing interactions. This feature enables the AI to leverage previous exchanges within a session to produce more contextually coherent and relevant responses. As Ray observes, “The memory function in ChatGPT is like a fine-tuning procedure,” yet he also notes the limitations: “Getting the results you want, however, can be frustrating at times.”

Similarly to how human working memory supports intricate cognitive tasks by handling multiple information streams, ChatGPT’s memory aims to augment interaction quality by preserving continuity. Nevertheless, unlike human working memory, which is dynamic and capable of sophisticated adjustments and processing, ChatGPT’s memory function is considerably more static and confined. It predominantly recalls stored data rather than dynamically manipulating or reassessing it in response to new inputs.

Ray poignantly highlights the shortcomings of this system, stating, "The management of the memory entries is primitive and needs more development.” This emphasizes a profound disparity in depth and adaptability between artificial and human cognitive mechanisms, illustrating that while ChatGPT's memory function marks a step towards mimicking human memory processes, it still falls short of the flexible and integrative capacity of the human mind.

Higher-Order Reasoning vs. Our Privacy

In response to my inquiry regarding ChatGPT's memory function, Rachel Harrisdirected me to a poignant commentary by Walter Haydock in the Deploy Securely Substack, which introduces a critical dimension to the discussion of AI memory: the implications for security, privacy, and legality. Haydock articulates a delicate paradox: while the advancement of AI towards higher-order reasoning necessitates the integration of working memory, this progression requires access to increasingly extensive segments of our data. This raises pivotal questions:

Are we prepared to sacrifice our privacy for the sake of AI's enhanced capabilities?
Can the potential gains in workflow efficiency ever justify a corresponding increase in our vulnerability regarding digital autonomy?

As I reflect on these concerns, I am about to conclude with Haydock’s enlightening post. What I find particularly invaluable in his analysis is twofold: firstly, his guidance on how to disable ChatGPT's memory function—a critical option for users at this crossroads—and secondly, his approach of informing readers without dictating a specific course of action. I invite your perspectives on this complex issue. How do you perceive OpenAI's initial foray into crafting working memory for AI? Please share your thoughts and potential responses in the comments below.

Nick Potkalitsky, Ph.D.

Mr. Haydock’s Definitive Post on ChatGPT Memory:

“RIP disable chat history (2023-2024). As of yesterday morning, OpenAI will retain all ChatGPT (except Team and Enterprise) prompts indefinitely. My question:

1?? Stop using ChatGPT

Some have suggested that exit is the best choice here, to make a point to OpenAI. I admire the principled nature of these suggestions, but it’s a bridge too far for me (and probably most people).

2?? Upgrade to Team or Enterprise so you can still disable chat history

I’ve done the former for StackAware, but that is $30/month/user (2 user minimum), which is a little pricey. Enterprise is $108,000/year ($60/month/user, 150 user minimum). That’s way out of reach for most companies.

3?? Use the mobile app

As of today, when using the latest version you could still disable chat history. This might be a legacy feature that will be eliminated with the next update, though.

4?? Use Temporary Chats if and when they become available

Shortly before making this change, OpenAI published a support article describing a feature called “Temporary Chat.” This appears to reproduce the functionality of disabling chat history, but with a major caveat:

“We are rolling out to a small portion of ChatGPT free and Plus users this week to learn how useful it is. We will share plans for a broader roll out soon.”

5?? Use the API

This still has a 30 day retention period. Using it will be more challenging for non-technical users, and maintaining conversation history requires a few more steps.

It also changes the billing model from “all you can eat” for a fixed fee to a metered approach.

6?? Use the Playground

This is a little more user friendly and allows for continuous conversations, including with fine-tuned models and assistants.

I also got OpenAI to confirm that the Playground follows the API retention period (30 days).

?? Bottom line: this is a major change (for the worse, in my opinion) of OpenAI’s privacy and security posture. Adapt accordingly.

Why do you think they did this?”

The Pragmatic AI Educator

1,186 位关注者

Shravan Kumar Chitimilla

Information Technology Manager | I help Client's Solve Their Problems & Save $$$$ by Providing Solutions Through Technology & Automation.

5 个月

Hey, that sounds fascinating! The intersection of AI and human cognition is mind-blowing. Can't wait to dive into your newsletter Nick Potkalitsky, PhD

1 次回应

Walter Haydock

I help AI-powered companies get ISO 42001 certified to manage cybersecurity, compliance, and privacy risk | NIST AI RMF and EU AI Act expert | Harvard MBA | Marine veteran

5 个月

Thanks for citing me! As usual, things have moved quickly in this space. Here's the latest on data retention: https://www.dhirubhai.net/posts/walter-haydock_disable-chatgpt-history-is-dead-long-live-activity-7191526797174280192-HMcm?utm_source=share&utm_medium=member_desktop

3 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

The Double Edge of ChatGPT's New Memory Function

Nick Potkalitsky, PhD

AI Literacy Consultant, Instructor, Researcher

Evolving Capacities; Lingering Questions

Memory and Higher-Order Reasoning

Kahneman: System 1 vs. System 2 Thinking

What Is Working Memory?

Is GPT Memory Function “Working Memory”?

领英推荐

Higher-Order Reasoning vs. Our Privacy

Mr. Haydock’s Definitive Post on ChatGPT Memory:

The Pragmatic AI Educator

1,186 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

ChatGPT Makes Us Human

ChatGPT: The good, the bad and the future

ChatGPT 101 (what to know in proposals) & Never do this in a Q&A

The Quest for Conversational Gold: I Discovered the BEST ChatGPT Prompt Formula (It's Not What You Think!)

Uses of ChatGPT in Daily Life

Maximize Your AI Potential with These 15 ChatGPT-4o Prompts

ChatGPT Won’t Replace You

The ChatGPT Observer

AI Personas: Exploring the Positive and the Pernicious

Google’s ChatGPT Killer Gemini is Live – But Fake. Should You Use It? // Future Work Digest #65

Evolving Capacities; Lingering Questions

Memory and Higher-Order Reasoning

Kahneman: System 1 vs. System 2 Thinking

What Is Working Memory?

Is GPT Memory Function “Working Memory”?

领英推荐

Higher-Order Reasoning vs. Our Privacy

Mr. Haydock’s Definitive Post on ChatGPT Memory:

The Pragmatic AI Educator

1,186 位关注者

Exploring Authorial Voice in the Age of AI

2024年10月9日

Does AI Homogenize Writing Toward Western Styles and Diminish Cultural Nuance?

2024年10月2日

From Calcification to Creativity: Expert Prompting for the AI-Infused Classroom

2024年9月26日

"Embracing AI in Education: One School’s Journey," Guest Post by Brenda Brusegard

2024年9月19日

Is AI Harmful or Helpful for Students? The Tale of Two Studies

2024年9月11日

When Students Trust You with Their AI Secrets, Real Learning Begins

2024年9月5日

UPGRADING the Letter Grading System in Middle and High School (Guest Post by Terry Underwood)

2024年8月28日

Teachers Need Time, Space, and Support to Harness AI's Potential

2024年8月22日

Does Gen AI's Have an Implicit Writing Pedagogy?

2024年8月14日

AI Is Still Searching for Its Voice: New Concepts and Trainings

2024年8月7日

社区洞察

其他会员也浏览了

ChatGPT Makes Us Human

ChatGPT: The good, the bad and the future

ChatGPT 101 (what to know in proposals) & Never do this in a Q&A

The Quest for Conversational Gold: I Discovered the BEST ChatGPT Prompt Formula (It's Not What You Think!)

Uses of ChatGPT in Daily Life

Maximize Your AI Potential with These 15 ChatGPT-4o Prompts

ChatGPT Won’t Replace You

The ChatGPT Observer

AI Personas: Exploring the Positive and the Pernicious

Google’s ChatGPT Killer Gemini is Live – But Fake. Should You Use It? // Future Work Digest #65