登录查看更多内容

What does Fine-Tuning OpenAI models mean and how does it help?

Vishal Madan

Vice President, Head of Engineering & Information Security at iMocha ? HR-Tech SaaS Startup ? PMP ? Patent Holder ? ISC2 CC ? Natural Language Processing (NLP) & Generative AI Enthusiast ? Azure Cloud ? Ex Amdocs

发布日期: 2023年6月12日

+ 关注

Just thought of sharing something to satisfy your curiosity if any with Fine-Tuning of Microsoft Azure OpenAI models.

Microsoft Azure Open AI model used here : "text-curie-001"

Go to your Microsoft's Subscription Azure AI Studio
Go to Models -> Base Models
Select "text-curie-001" and click Deploy. This would be your Non Fine Tuned Model

No alt text provided for this image — Deployment of Base Model text-curie-001

Click Create
You will see the Deployment in the Deployments tab

Now let's ask some questions to this Non Fine Tuned Model. You can use the Completions in the Playground or you can use a python Program to connect to this model and ask a question. Have shown both examples below.

Now though this might a right answer, based on how the model has been trained.

However, being an iMochaite, I would like the answer to be related to our Startup iMocha which is helping enterprises to build a skills-first, data-driven ecosystem encompassing hiring, upskilling, and managing talent – at scale, for any job role, any industry.

So let's train this model on iMocha facts.

Create a .jsonl file with prompts and completions. Example below

{"prompt": "What is iMocha", "completion": "iMocha is a skills intelligence and assessment platform that enables talent teams to make smarter talent decisions."}
{"prompt": "iMocha", "completion": "iMocha is a SAAS HRTech into Skills Assessment and Skills Intelligence"}
{"prompt": "Who are the founders of iMocha", "completion": "Amit Mishra and Sujit Karpe are the founders of iMocha"}
{"prompt": "Who is the Head of Engineering at iMocha", "completion": "Vishal Madan is the Head of Engineering at iMocha"}
{"prompt": "Tell me more about iMocha", "completion": "iMocha is a SAAS HRTech into Skills Assessment and Skills Intelligence. It was founded in 2015 by Amit Mishra and Sujit Karpe. It is a top tier partner of Microsoft and is a TM Forum Member"}
{"prompt": "Name few clients of iMocha", "completion": "iMocha caters to the Talent Acquisition needs of Hexaware, Capgemini, United Nations, Navy Federal and companies like Upgrad, HPE use it for their Talent Development needs"}
{"prompt": "Tell me more about iMocha", "completion": "iMocha is Helping enterprises to build a skills-first, data-driven ecosystem encompassing hiring, upskilling, and managing talent – at scale, for any job role, any industry"}
{"prompt": "What can you do with iMocha's Live Interview", "completion": "Using iMocha’s solution, you can conduct automated interviews, live interview, and live coding interview. You can collaborate seamlessly with your hiring team in real-time. Invite up to 3 interviewers to assess a candidate thoroughly and reduce the number of interviews."}

Go to Azure AI Studio -> Models -> Base Models
Select the same base model "text-curie-001". However this time instead of clicking Deploy, click "Create a Custom Model"

Once you click on "Start Training job", you might have to wait for around 30 mins or so which includes queuing and the actual training.
Once the training is completed, you will see the new Custom Models in the Management -> Models -> Custom Models

领英推荐

OpenAI Hype Cycle

AIM 2 年前

Importance of Frameworks in AI

Analytics Insight? 8 个月前

Importance of Frameworks in AI

Analytics Insight? 8 个月前

The Custom Model has to be deployed, just like the Base Model (Non Fine Tuned) was deployed.

Once this Custom Fine Tuned Model is deployed, you will see 2 deployments in your Deployments tab, one for the Non Fine Tuned Model and for the new Custom Fine Tuned Model.

You would need to wait till the Status becomes Succeeded before you start using it

Now let's go back to the Playground or use the Python Program that we had created and get the response to the same question "What is iMocha?" and see if the answer differs.

As an iMochaite, though the response is not completely accurate, I am now happy that the response is more closer to what iMocha is into i.e. Talent Acquisition, Management, Development, Intelligence and not the earlier "Coffee Delivery Service".

Please also note that this Fine Tuned model is trained on only 8 facts. In reality we would be training this model on 100+ facts for far better accuracy. However for the essence of Fine Tuning to be captured and to reduce on Training Costs for this Proof of Concept, this limited training is good enough.

Did you notice something?

Though my training data had exactly the same prompt "What is iMocha" with a completion saying "iMocha is a skills intelligence and assessment platform that enables talent teams to make smarter talent decisions.", it actually did not use the same completion.

"{"prompt": "What is iMocha", "completion": "iMocha is a skills intelligence and assessment platform that enables talent teams to make smarter talent decisions."}So what is actually happening during the training?

So what is happening during the Training?

Large Language Models ( LLMs ) like the OpenAI curie model, work on something called as Embeddings that are nothing but mathematical vectors that store the context of a word per dimension ( topic ) . By some kind of vector mathematics and techniques like cosine similarity, it is decided, how multiple words or phrases come together based on the topic at hand and finally forming the actual response.

Without going into details, with my training, I have placed iMocha?and related words in the right position in the vector space and hence brought it closer to the words like B2B, Talent Management, Talent Intelligence etc., closer than coffee ordering.

Hope this article helped to get some basic curiosity satisfied.

Please note that Training, Deployment and Prompts are billable, so please check the pricing before you try out yourself.

20 years of Learnings

3,373 位关注者

HIMANSHU SHARMA

Student at A P G SHILA UNIVERSITY

1 年

THIS MEANS HIGH VIRTUALIZATION IN A SYSTEM WHEN SYSTEM 1 DEPARTS THAN IT MAKES NEW ENGINE TO GENERATE IT IN IT GLOBALIZATION........

1 次回应

要查看或添加评论，请登录

Vishal Madan的更多文章

Sometimes all it takes, is a good justification to overcome a challenge

2024年3月4日

Sometimes all it takes, is a good justification to overcome a challenge

In May of 2005, I found myself in a meeting room at the Amdocs Competency Center with 40+ Junior and Senior leaders. I…

9 条评论
So Startups, Are you saying yes to Azure?

2022年2月2日

So Startups, Are you saying yes to Azure?

You have a Problem-Solution Fit. You have identified a problem; You have a solution for it; You know there are people…

10 条评论
Why Recruiters might not respond

2021年6月7日

Why Recruiters might not respond

Trying to answer the question on why recruiters might not respond back. I have been a Hiring Manager for 14 years, had…

6 条评论
An attribute called ^M - An incident that shaped some of my management skills

2021年5月25日

An attribute called ^M - An incident that shaped some of my management skills

Long years back, when I was a programmer, I happened to work on a critical issue. The product was misbehaving while…
Do you take your career as a public transport commute?

2021年5月24日

Do you take your career as a public transport commute?

Before I start the article, few disclaimers The public transport options mentioned below are compared only on the basis…

2 条评论
Do we still recite "I am a little tea pot" in our resume?

2021年5月23日

Do we still recite "I am a little tea pot" in our resume?

Mostly all of us have recited the "I am a little teapot" poem once in our playschools. The poem was about a tea-pot who…

2 条评论
How and why to add value to your #startup?

2021年5月22日

How and why to add value to your #startup?

I am not a trader, but an investor in stock market. I invest in a stock for long term and only when I believe in it's…
If SURE Type SURE - The Story of our Diploma project and friendship 24 years back ..

2020年10月13日

If SURE Type SURE - The Story of our Diploma project and friendship 24 years back ..

This is the story of LMS c.h.

5 条评论
Not being called for an Interview? Here's what might be going wrong.

2020年10月11日

Not being called for an Interview? Here's what might be going wrong.

You have polished our resume to the best extent possible. Some of you might have even paid to beautify your resume.

19 条评论
The power of secondary responsibilities

2020年10月3日

The power of secondary responsibilities

We all have primary responsibilities in our job, the ones for which we are hired, we are measured against during our…

See all articles

What does Fine-Tuning OpenAI models mean and how does it help?

Vishal Madan

Vice President, Head of Engineering & Information Security at iMocha ? HR-Tech SaaS Startup ? PMP ? Patent Holder ? ISC2 CC ? Natural Language Processing (NLP) & Generative AI Enthusiast ? Azure Cloud ? Ex Amdocs

领英推荐

20 years of Learnings

3,373 位关注者

Vishal Madan的更多文章

社区洞察

其他会员也浏览了

Issue #319 - The ML Engineer ??

A Comprehensive Guide to Azure OpenAI Service

Issue #278 - The ML Engineer ??

OpenAI Hype Cycle

Issue #214 - THE ML ENGINEER ??

Building your first AI Agent using PydanticAI

Introducing OpenAI o1

PyCaret - An open source low-code machine learning library

Building the GenAI Dream Team: How Data Scientists, Data Engineers, Developers, and DevOps Make Magic Happen

?? Automate Data Annotation Like a Pro with CVAT ??

领英推荐

20 years of Learnings

3,373 位关注者

Vishal Madan的更多文章

Sometimes all it takes, is a good justification to overcome a challenge

So Startups, Are you saying yes to Azure?

Why Recruiters might not respond

An attribute called ^M - An incident that shaped some of my management skills

Do you take your career as a public transport commute?

Do we still recite "I am a little tea pot" in our resume?

How and why to add value to your #startup?

If SURE Type SURE - The Story of our Diploma project and friendship 24 years back ..

Not being called for an Interview? Here's what might be going wrong.

The power of secondary responsibilities

社区洞察

其他会员也浏览了

Issue #319 - The ML Engineer ??

A Comprehensive Guide to Azure OpenAI Service

Issue #278 - The ML Engineer ??

OpenAI Hype Cycle

Issue #214 - THE ML ENGINEER ??

Building your first AI Agent using PydanticAI

Introducing OpenAI o1

PyCaret - An open source low-code machine learning library

Building the GenAI Dream Team: How Data Scientists, Data Engineers, Developers, and DevOps Make Magic Happen

?? Automate Data Annotation Like a Pro with CVAT ??