登录查看更多内容

Email Spam Detection using Pre-Trained BERT Model : Part 1 - Introduction and Tokenization

madhukara phatak

Chief Architect at Tellius

发布日期: 2023年2月13日

Recently I have been looking into Transformer based machine learning models for natural language tasks. The field of NLP has changed tremendously in the last few years and I have been fascinated by the new architectures and tools that they are coming out at the same time. Transformer models are one of such architecture.

As the frameworks and tools to build transformer models keep evolving, the documentation often becomes stale and blog posts are often confusing. So for any one topic, you may find multiple approaches which can confuse beginners.

So as I am learning these models, I am planning to document the steps to do few of the important tasks in simplest way possible. This should help any beginner like me to pickup transformer models.

In this two-part series, I will be discussing how to train a simple model for email spam classification using pre-trained transformer BERT model.This is the first post in series where I will be discussing about transformer models and preparing our data. You can read all the posts in the series?here.

Transformer Models

Transformer is a neural network architecture first introduced by Google in 2017. This architecture has proven extremely efficient in learning various tasks. Some of the popular models of transformer architecture is BERT, Distilbert, GPT-3, chatGPT etc.

You can read more about transformer models in below link

https://huggingface.co/course/chapter1/4.

领英推荐

Should Open-Source AI Prioritize Developing Foundation…

Lightning AI 1 年前

Master Large Language Models in 10 Weeks with AI…

Ritesh Kanjee 4 个月前

The LLMOps Lifecycle: Managing Large Language Models…

Sankara Reddy Thamma 2 个月前

Pre-Trained Language Model and Transfer Learning

A pre-trained language model is a transformer model, which is trained on large amount of language data for specific tasks.

The idea behind using pre-trained model is that, model has really good understand of language which we can borrow for our nlp task as it is and just focus on training unique part of task in our model. This is called as transfer learning. You can read more about transfer learning in below link

https://huggingface.co/course/chapter1/4#transfer-learning.

Google Colab

Google Colab is a hosted jupyter python notebook which has access GPU runtime. As these transformer models perform extremely well on GPU, we are going to use google colab for our examples. You can get community version of same by signing in using your google credentials.

带有此图标的链接由领英创建，不带此图标的链接由作者添加。

要查看或添加评论，请登录

madhukara phatak的更多文章

Email Spam Detection using Pre-Trained BERT Model: Part 2 - Model Fine Tuning

2023年2月16日

Email Spam Detection using Pre-Trained BERT Model: Part 2 - Model Fine Tuning

Recently I have been looking into Transformer based machine learning models for natural language tasks. The field of…
Java Streams: Write Functional Collection code in Java

2023年1月23日

Java Streams: Write Functional Collection code in Java

I started my career as a Java developer back in 2011. I developed most of my code in the 1.
Higher Order Functions in Java

2022年10月17日

Higher Order Functions in Java

I started my career as a Java developer back in 2011. I developed most of my code in the 1.
Functional Interfaces: Java Lambda Expressions and Backward Compatibility

2022年10月13日

Functional Interfaces: Java Lambda Expressions and Backward Compatibility

I started my career as a Java developer back in 2011. I developed most of my code in the 1.

1 条评论
Latest Java Features from a Scala Dev Perspective - Part 2: Lambda Expressions

2022年10月10日

Latest Java Features from a Scala Dev Perspective - Part 2: Lambda Expressions

I started my career as a Java developer back in 2011. I developed most of my code in the 1.
Latest Java Features from a Scala Dev Perspective - Part 1: Type Inference

2022年9月14日

Latest Java Features from a Scala Dev Perspective - Part 1: Type Inference

I started my career as a Java developer back in 2011. I developed most of my code in the 1.
Pandas API on Apache Spark - Part 2: Hello World

2021年7月23日

Pandas API on Apache Spark - Part 2: Hello World

Pandas API on Apache Spark brings the familiar python Pandas API on top of distributed spark framework. This…
Pandas API on Apache Spark- Part 1: Introduction

2021年7月21日

Pandas API on Apache Spark- Part 1: Introduction

Apache Spark has revolutionized the data science field with its support for big data. With its support for multiple…
Barrier Execution Mode in Spark 3.0 - Part 2: Barrier RDD

2020年11月20日

Barrier Execution Mode in Spark 3.0 - Part 2: Barrier RDD

Barrier execution mode is a new execution mode added to spark in 3.0 version.
Barrier Execution Mode in Spark 3.0 - Part 1: Introduction

2020年11月11日

Barrier Execution Mode in Spark 3.0 - Part 1: Introduction

Barrier execution mode is a new execution mode added to spark in 3.0 version.

See all articles

Email Spam Detection using Pre-Trained BERT Model : Part 1 - Introduction and Tokenization

madhukara phatak

Chief Architect at Tellius

Transformer Models

领英推荐

Pre-Trained Language Model and Transfer Learning

Google Colab

madhukara phatak的更多文章

社区洞察

其他会员也浏览了

Watch#8: Extreme Teachers and Mixing Tokens, not Experts

Fuzzy Wuzzy Matching

Tips on Showcasing Your Skills and Projects That Leverage GenAI for Testing

All about BERT

Elevating Efficiency: 10 Amazing Free AI Tools

FineTuning BERT- Named Entity Recognition - Bidirectional Encoders Representation of Transformers - Part 4

Top 4 Agentic AI Design Patterns for Architecting AI Systems

Class 32 - DOCUMENT GPT 2.0 Notes from the AI Basic Course by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)

Transformer Models

领英推荐

Pre-Trained Language Model and Transfer Learning

Google Colab

madhukara phatak的更多文章

Email Spam Detection using Pre-Trained BERT Model: Part 2 - Model Fine Tuning

Java Streams: Write Functional Collection code in Java

Higher Order Functions in Java

Functional Interfaces: Java Lambda Expressions and Backward Compatibility

Latest Java Features from a Scala Dev Perspective - Part 2: Lambda Expressions

Latest Java Features from a Scala Dev Perspective - Part 1: Type Inference

Pandas API on Apache Spark - Part 2: Hello World

Pandas API on Apache Spark- Part 1: Introduction

Barrier Execution Mode in Spark 3.0 - Part 2: Barrier RDD

Barrier Execution Mode in Spark 3.0 - Part 1: Introduction

社区洞察

其他会员也浏览了

Watch#8: Extreme Teachers and Mixing Tokens, not Experts

Fuzzy Wuzzy Matching

Tips on Showcasing Your Skills and Projects That Leverage GenAI for Testing

All about BERT

Elevating Efficiency: 10 Amazing Free AI Tools

FineTuning BERT- Named Entity Recognition - Bidirectional Encoders Representation of Transformers - Part 4

Top 4 Agentic AI Design Patterns for Architecting AI Systems

Class 32 - DOCUMENT GPT 2.0 Notes from the AI Basic Course by Irfan Malik & Dr Sheraz Naseer (Xeven Solutions)