登录查看更多内容

Colab Notebooks: Neural Language Processing and GPT-2

Vladimir Alexeev

Autor, Forscher, Künstler, Speaker, KI-Berater (Generative KI). Digital Experience Specialist - @ DB Schenker. OpenAI Community Ambassador. Digital Resident. Ich erforsche kreative Mitarbeit von Mensch + Maschine

发布日期: 2020年5月17日

GPT-2.

This language Model, released by OpenAI during the year 2019 is trained on 40 GB text from various sources. There are several GPT-2 Colab notebooks, which work in a similar way: you enter the beginning of the sentence, and GPT-2 continues (or you ask questions to provided texts). The transformer-driven model works with “self-attention”, paying attention to text parts in specified proximity, which allows generating coherent stories, instead of gibberish chaos.

I prefer two GPT-2 notebooks:

“Train a GPT-2 Text-generating Model” by Max Woolf
GPT-2 with Javascript Interface by gpt2ent

Max Woolf’s Notebook allows:

to generate various texts by GPT-2
to train your own texts (up to 355m Model)

I did it in three languages:

English (on “Alice in Wonderland”)
German (on “Faust I” by Goethe)
Russian (on early poetry by Pushkin)

As you see, it works to some degree for all languages. Of course, GPT-2 is trained on English sources. For foreign languages, we should apply finetuning and other assets, but this proof of concept was convincing for me. With some interesting observations:

The more I trained German on Faust, the closer to original the texts became. The reason is probably in a small dataset (just one single text). If you want to train on your texts, provide wider data amounts.
Russian Texts are not really comprehensible, but you can nevertheless recognize the style and even form by Pushkin's poetry. And the coinages and neologisms are perfect, every literary Avant-gardist would be proud of such inventions.

“GPT-2 with Javascript Interface”-Notebook allows:

Text generation, not more, not less. But you can control the text length (which is a very relevant factor):

With Temperature and top_k you can modify the randomness, repeatedness, and “weirdness” of the text.

With Generate how much you can generate longer texts (I am using the value of 1000).

Links:

First OpenAI post about GPT2
GPT-2: 1.5B Release
Max Woolf’s Blog
Colab Notebook by Max Woolf
GPT-2 with Javascript Interface

You also can use the web-implementation of GPT-2 by Adam King:

TalkToTransformer.com

I asked this application about the meaning of life. The answer was very wise and mature.

Wisely, indeed! (Screenshot of TalkToTransformer.com by: Merzmensch)

Colab Notebooks: Neural Language Processing and GPT-2

Vladimir Alexeev

Autor, Forscher, Künstler, Speaker, KI-Berater (Generative KI). Digital Experience Specialist - @ DB Schenker. OpenAI Community Ambassador. Digital Resident. Ich erforsche kreative Mitarbeit von Mensch + Maschine

更多精彩文章

社区洞察

其他会员也浏览了

Demystifying Tokenization: Preparing Data for Large Language Models (LLMs)

Applied Machine Learning: Naive Bayes, Linear SVM, Logistic Regression, and Random Forest

Understanding Transformers: A Deep Dive with PyTorch

Data Preparation for Fine-Tuning LLMs (Large Language Models) using Google Colab

AI News Letter, December 31,2022

Fine-Tuning Made Easy: The Game-Changing Benefits of LoRA for Language Models

Evolution of Word Embeddings: A Journey Through NLP History

Natural Language Processing: Linear Text Classification

Unlocking the Power of Small Language Models (SLMs): Evolution of Phi

Engineers Guide to AI - Tokenization

Merzazine ‘23

2024年1月12日

My Highlights in 2022

2023年1月4日

The Big Secret of a Little House. Part 1: Exhibition Space.

2022年11月11日

#AI: Diversity is what we need.

2020年9月23日

AI-driven Creative Apps for You and Your Kids: Build a New World.

2020年7月10日

AI-driven Creative Apps for You and Your Kids: Paint a Picture

2020年5月27日

AI-driven Creative Apps for You and Your Kids: Tell a story.

2020年5月23日

AI-driven Creative Apps for You and Your Kids: Introduction

2020年5月21日

Plato, his Cave, a Virus, and A.I.

2020年5月15日

#ICYMI_011: Looking for Truths and Lies in Art and AI (and for Waldo)

2020年5月14日