登录查看更多内容

BERT as a service

Jayant Kumar

Principal ML Scientist at Adobe | Technical Advisor at Preffect | Multimodal AI | Large language models and Knowledge Graph applications

发布日期: 2020年5月17日

There are multiple ways of leveraging the open source BERT model for your NLP work, for example, via huggingface transformers or spacy transformers. I recently came across a post about running BERT as a service, and it was quite easy to setup too.

If the pipeline requires efficient extraction of BERT features this easy setup may save your dev/test time.

Create a environment if needed:

conda create --name bert

Install tensorflow (works with cpu or gpu)

pip3 install tensorflow-cpu==1.15 # version must be before 2.0

OR

pip3 install tensorflow-gpu==1.15 # version must be before 2.0

Install bert serving server and client

pip3 install -U bert-serving-server bert-serving-client

Start the model serving

bert-serving-start -model_dir /path_to_the_model/ -num_worker=2

For example, I used:

bert-serving-start -model_dir ../BERT_models/wwm_uncased_L-24_H-1024_A-16/   -num_worker=4

In Python 3, start using by setting up a client

from bert_serving.client import BertClient

client = BertClient()

vector = client.encode(['lovely portrait'])

References:

https://towardsdatascience.com/word-embedding-using-bert-in-python-dd5a86c00342
https://github.com/hanxiao/bert-as-service

要查看或添加评论，请登录

Jayant Kumar的更多文章

DeepSeek-R1: A Pure RL-based Reasoning Model

2025年1月26日

DeepSeek-R1: A Pure RL-based Reasoning Model

I summarize the key steps involved in creating the DeepSeek models, from the foundational development of DeepSeek-R1 to…

1 条评论
LLaVA-OneVision

2024年9月21日

LLaVA-OneVision

The LLaVA-NeXT series represents a groundbreaking evolution in large multimodal models with each iteration bringing…

2 条评论
GraphRAG: Powerful but Expensive and Slow Solution

2024年7月29日

GraphRAG: Powerful but Expensive and Slow Solution

Microsoft's GraphRAG architecture represents a significant advancement in Retrieval-Augmented Generation (RAG) systems,…

2 条评论
SIGIR Day 1 - Keynotes and Industry Papers

2024年7月16日

SIGIR Day 1 - Keynotes and Industry Papers

Day 1 started with the opening remarks from general/program chairs. Some key insights are as follows: RecSys has the…
LLM Alignment: Direct Preference Optimization

2024年7月13日

LLM Alignment: Direct Preference Optimization

In the realm of language models (LMs), alignment is essential to ensure that the outputs generated by these models meet…

1 条评论
Behind the Rankings: LLM Model Evaluation in Benchmark Datasets

2024年4月20日

Behind the Rankings: LLM Model Evaluation in Benchmark Datasets

Over the past few days, there's been a flurry of posts discussing the newly unveiled Llama 3 model and its impressive…
Navigating the Shifting Tides: Reflections on the Rollercoaster Ride of 2023

2023年12月31日

Navigating the Shifting Tides: Reflections on the Rollercoaster Ride of 2023

The Unfolding Drama in Early 2023: Unrealistic Projections, Layoffs, and the Pressure to Innovate As the curtains rose…

1 条评论
AI Horizons: A Closer Look at the Five Big AI Bets in 2023

2023年12月22日

AI Horizons: A Closer Look at the Five Big AI Bets in 2023

As we navigate the ever-evolving landscape of artificial intelligence, it's natural to wonder – which bets are paying…

1 条评论
Custom Object Detector

2018年12月2日

Custom Object Detector

Recently I had a chance to try Tensorflow object detection API to develop a custom object detector - an object…

2 条评论
Learning by Teaching

2015年8月22日

Learning by Teaching

I had heard before that the best way to learn anything is to try to teach it to others. If you can explain a topic of…

3 条评论

See all articles

BERT as a service

Jayant Kumar

Principal ML Scientist at Adobe | Technical Advisor at Preffect | Multimodal AI | Large language models and Knowledge Graph applications

Jayant Kumar的更多文章

社区洞察

其他会员也浏览了

What is Long Short-Term Memory (LSTM)?

Artificial Intelligence #85

Breaking the Jargons: Issue 9

Introducing Claude 3.5 Sonnet: Anthropic's Fastest and Smartest Model that Outperforms Claude 3 Opus. ??

My Book on Generative AI Now on Amazon

Course: Introduction to LLMs in Python

New LLM & RAG Courses and Certifications

McCulloch-Pitts: The First Computational Neuron

Part 3 - Applied LLMs: How to Build a Cat Generative Dialogue Processor (CatGDP)

TensorFlow.js Monthly #8: Exponential growth, JAX to JS conversion, new videos to watch

Jayant Kumar的更多文章

DeepSeek-R1: A Pure RL-based Reasoning Model

LLaVA-OneVision

GraphRAG: Powerful but Expensive and Slow Solution

SIGIR Day 1 - Keynotes and Industry Papers

LLM Alignment: Direct Preference Optimization

Behind the Rankings: LLM Model Evaluation in Benchmark Datasets

Navigating the Shifting Tides: Reflections on the Rollercoaster Ride of 2023

AI Horizons: A Closer Look at the Five Big AI Bets in 2023

Custom Object Detector

Learning by Teaching

社区洞察

其他会员也浏览了

What is Long Short-Term Memory (LSTM)?

Artificial Intelligence #85

Breaking the Jargons: Issue 9

Introducing Claude 3.5 Sonnet: Anthropic's Fastest and Smartest Model that Outperforms Claude 3 Opus. ??

My Book on Generative AI Now on Amazon

Course: Introduction to LLMs in Python

New LLM & RAG Courses and Certifications

McCulloch-Pitts: The First Computational Neuron

Part 3 - Applied LLMs: How to Build a Cat Generative Dialogue Processor (CatGDP)

TensorFlow.js Monthly #8: Exponential growth, JAX to JS conversion, new videos to watch