登录查看更多内容

Text Classification with BERT and .Net

Faisal Waris

Data Scientist/Gen. AI Strategist in the telecom industry

发布日期: 2021年11月21日

Transformer based models are currently the state-of-the-art for text classification and other natural language related machine learning tasks. One popular popular model type is BERT from Google - described in this paper. Google has released the weights of several pre-trained BERT models and this has enabled researchers/practitioner to easily build models that perform very well on several text/language related tasks. I have read that BERT-based models are now an integral part of Google search.

Training language models is expensive and time-consuming. To train a 'base' BERT model from scratch can take several weeks with a compute cost of around $500 on the Google cloud. The availability of pre-trained weights is a huge boon for the rest of us. We can "fine tune" BERT (or other language models) to perform new text/language related tasks - often quite easily.

It is still a challenge to tightly integrate such models into applications that need them. Often the models are deployed as a service and a consuming application will need to make remote calls to access the model's functionality. This gives rise to cost, latency, security, integration-complexity and reliability related concerns.

What if we need to embed this functionality into the application itself. To address this need for .Net applications I have constructed a notebook that outlines how a BERT model may be defined, trained for text classification and consumed from .Net. The code is pure .Net/F# (no Python needed).

The code and notebook output can be viewed here. The code was developed using the new .Net Interactive notebooks now available in VS Code.

要查看或添加评论，请登录

Faisal Waris的更多文章

Revisiting Logic Puzzle with "o1"

2024年9月29日

Revisiting Logic Puzzle with "o1"

A few months ago, I tested the reasoning capabilities of then latest 'gpt' model. Here is a link to that article: An…
Phi-3 Vision is a Surprisingly Useful Gem

2024年7月8日

Phi-3 Vision is a Surprisingly Useful Gem

My work involves building RAG applications for question-answering over highly technical internal company documents. The…
Constrained and Provable LLM Code Generation

2024年3月11日

Constrained and Provable LLM Code Generation

LLMs are now good at generating code but human intervention is still required. Can't accept the generated code blindly…

1 条评论
An Investigation into LLM Reasoning Capabilities (+ 'Zebra' puzzles & SMT Solvers)

2024年1月15日

An Investigation into LLM Reasoning Capabilities (+ 'Zebra' puzzles & SMT Solvers)

With the (preview) release of GPT 4 Turbo, OpenAI has updated its Technical Report on GPT performance. The results are…
An Elegant Web Application Architecture for Contemporary Times

2023年12月16日

An Elegant Web Application Architecture for Contemporary Times

It used to be that as data scientists we rarely built full-stack production applications. However, that is changing…

1 条评论
FsOpenAI: A GPT 'chat' app for Internal Organizational Data

2023年8月14日

FsOpenAI: A GPT 'chat' app for Internal Organizational Data

#chatgp #azureopenai #semantickernel #semanticsearch #fsharp Unsurprisingly, the demand for accessing Large Language…
Applying Some 'Unconventional Wisdom' to Improve Model Scoring wrt. High-Velocity Streaming Data

2023年1月17日

Applying Some 'Unconventional Wisdom' to Improve Model Scoring wrt. High-Velocity Streaming Data

Some data streams in the telecom industry can exceed the rate of 50,000 messages / second. And at about 300KB per…

7 条评论
Resource-efficient model deployment

2022年9月28日

Resource-efficient model deployment

AI/ML is now mainstream. Model scoring capacity requirements are ever-increasing.
Graph Convolutional Network Model with a Strongly-typed Functional Language

2021年5月17日

Graph Convolutional Network Model with a Strongly-typed Functional Language

My present job requires me to work with network or graphical data formats. Graphical data are not readily amenable to…

1 条评论
Lessons learnt in moving a data science 'project' to 'product'

2020年10月11日

Lessons learnt in moving a data science 'project' to 'product'

Data science is complex and so is software engineering. The nature of contemporary technology work often requires…

1 条评论

See all articles

Text Classification with BERT and .Net

Faisal Waris

Data Scientist/Gen. AI Strategist in the telecom industry

Faisal Waris的更多文章

社区洞察

其他会员也浏览了

Build a GraphRAG Agent, Learn about ColPali, Something Spooky, and More!

How LLMs Are Revolutionizing Data Extraction: Discussing how language models enhance scraping by analyzing and contextualizing data in real-time

SPARQL queries, GPTs and Large Language Models – where are we currently?

Boost AI Fairness and Explainability with Amazon SageMaker Clarify

Analysis of Language Models' Ability to Generate Coherent and Contextualized Texts

7-Steps Journey to Generate the Financial Report Summary With LLM Private Assistant

LLMs Get Smarter with Vector Databases & Retrieval-Augmented Generation

Survey of Multimodal LLMs; Meet GOAT-7B-Community Model; AWS’ Amazon Bedrock With More Capabilities; Using OpenAI & Langchain To Build App; and More

Create SQL-based AI Agents from Natural Language Input

Vector databases & indexes, similarity search, and?RAG

Faisal Waris的更多文章

Revisiting Logic Puzzle with "o1"

Phi-3 Vision is a Surprisingly Useful Gem

Constrained and Provable LLM Code Generation

An Investigation into LLM Reasoning Capabilities (+ 'Zebra' puzzles & SMT Solvers)

An Elegant Web Application Architecture for Contemporary Times

FsOpenAI: A GPT 'chat' app for Internal Organizational Data

Applying Some 'Unconventional Wisdom' to Improve Model Scoring wrt. High-Velocity Streaming Data

Resource-efficient model deployment

Graph Convolutional Network Model with a Strongly-typed Functional Language

Lessons learnt in moving a data science 'project' to 'product'

社区洞察

其他会员也浏览了

Build a GraphRAG Agent, Learn about ColPali, Something Spooky, and More!

How LLMs Are Revolutionizing Data Extraction: Discussing how language models enhance scraping by analyzing and contextualizing data in real-time

SPARQL queries, GPTs and Large Language Models – where are we currently?

Boost AI Fairness and Explainability with Amazon SageMaker Clarify

Analysis of Language Models' Ability to Generate Coherent and Contextualized Texts

7-Steps Journey to Generate the Financial Report Summary With LLM Private Assistant

LLMs Get Smarter with Vector Databases & Retrieval-Augmented Generation

Survey of Multimodal LLMs; Meet GOAT-7B-Community Model; AWS’ Amazon Bedrock With More Capabilities; Using OpenAI & Langchain To Build App; and More

Create SQL-based AI Agents from Natural Language Input

Vector databases & indexes, similarity search, and?RAG