登录查看更多内容

My Experience Fine-Tuning a Model with InstructLab

Timothy Lam

Director of Strategic Business Development at Red Hat | CISA | CISM | CRISC | PMP | PMI-ACP | TOGAF |MACS (Snr) CP

发布日期: 2024年7月25日

Introduction

InstructLab is an exciting open-source project that makes it easier for anyone to improve and customize large language models (LLMs), which are used in AI applications to generate human-like text.

Working with these advanced AI models usually requires a lot of specialized skills, high-quality data, and powerful computers, making it a challenging and expensive task. However, InstructLab, a joint effort by Red Hat and IBM, changes this by allowing people from all backgrounds to contribute to and enhance these AI models, regardless of their expertise in machine learning.

This initiative enables developers to add specific knowledge and skills to the AI models, tailoring them to suit their business or industry needs using their own data. InstructLab embodies the true spirit of open-source innovation, ensuring that the latest AI advancements are accessible and cost-effective for everyone.

Fine-Tuning AI Models for Health and Fitness Trainers

Recently, I used InstructLab to fine-tune a model for health and fitness trainers who work with older adults. This process allowed me to add specific skills and knowledge to the model, demonstrating its flexibility and adaptability. Here’s a detailed and simplified overview of my experience:

Process Overview:

Data Preparation: Generate 100 samples using my NVIDIA GeForce RTX 2080 8GB laptop, splitting into a 66/34 train-test ratio.? ?

I created several folders based on how instructlab folders are meant to be structured. Dataset are created by adding new skills/knowledge to the fine-tuning LLM.

1.1 Create a new skill and knowledge for the model.

In the context of InstructLab, a skill is a capability domain submitted by a contributor intending to train the AI model on the submitted information. In other words, when you submit a skill, you teach the AI model how to do something.

InstructLab skills are broken down into two main categories:

Composition skills. Composition or performative skills allow AI models to perform specific tasks or functions. With InstructLab, there are two types of composition skills:
Freeform compositional skills are performative skills that do not require additional context. For example, to train an AI model to write a poem, you would provide examples of poems.
Grounded compositional skills are performative skills that require additional context. One example is how an AI model reads the value of a cell in a table layout. To create the grounded skill to read a table formatted in Markdown, the additional context might be an example table layout.
Foundational skills. Foundational skills are skills like math, reasoning, and coding. Note: Foundational skills are not currently being accepted.
Skills are written in a YAML file and submitted to the InstructLab upstream project for review.?

1.2? Data Generation

To generate dataset, I simply run the command ilab data generate?

领英推荐

App Launch, Connections, Insights and ICF AI…

Jelena Pavlovic, PhD, PCC 9 个月前

Optimize AI PC Adoption in Your Business with Phison’s…

Phison Electronics USA 4 个月前

The Benefits and Drawbacks of Using AI to Improve…

Grant Crow DBA, MBA 3 个月前

1.3? Data Validation

To validate the generated dataset, I simply run the command ilab taxonomy diff

2. Unzipping generated dataset: Uploaded and unzipped the dataset on Collab.??

?Generated dataset was stored in the generated dataset folder, which was split and stored in the taxonomy_dataset folder.

3. Base Model Loading: Uploaded and unzipped the dataset on Collab.??

4. Training: Fine-tuned the model on an A100 GPU for 10-15 minutes, completing 35 iterations.???

5. Monitoring: Adjusted and evaluated the model using the test dataset with promising results.??

Ilab Work flow Diagram:

Conclusion

InstructLab makes it easy for anyone, including non-technical users, to contribute to AI by using a simple YAML format and a supportive community. This approach facilitates continuous model improvement and customization by incorporating diverse, high-quality data through an effective synthetic data generation and quality assurance process.

Timothy Lam

8 个月

InstructLab embodies the true spirit of open-source innovation, ensuring that the latest AI advancements are accessible and cost-effective for everyone.

要查看或添加评论，请登录

Timothy Lam的更多文章

Ansible in Action: Managing Network Infrastructure with AAP

2025年1月10日

Ansible in Action: Managing Network Infrastructure with AAP

Tim Lam, a network engineer, had been successfully managing his network of 150 devices using custom PowerShell scripts…

3 条评论
Automation is the foundation for AI success

2025年1月8日

Automation is the foundation for AI success

The goal of AI is to deliver high-value services to customers, such as AI chatbots, AI healthcare tools, autonomous…

4 条评论
Why the Ansible Automation Platform (AAP) Experience Matters ?

2024年12月18日

Why the Ansible Automation Platform (AAP) Experience Matters ?

When introducing any tool into the enterprise, security is often the first challenge. Teams ask: Is it secure? Who has…

1 条评论
Ansible in Action: Automating RHEL Server Provisioning with VMware, Infoblox, and Red Hat Satellite

2024年12月16日

Ansible in Action: Automating RHEL Server Provisioning with VMware, Infoblox, and Red Hat Satellite

The Challenge: Bridging Complexity in IT Infrastructure Provisioning servers isn’t just about spinning up VMs—it’s a…

2 条评论
點解 AI、Automation 同 Sustainability 係未來嘅關鍵？

2024年10月19日

點解 AI、Automation 同 Sustainability 係未來嘅關鍵？

你有冇諗過點解而家咁多人講 AI（人工智能）、自動化（Automation）同可持續性（Sustainability）？其實呢三樣嘢係互相連繫，並且影響住我哋未來嘅世界。首先，AI 可以幫我哋快速分析大量數據，從而做出更好嘅決策。而…

1 条评论
How Can Generative AI Transform Your Sales Strategy? ??

2024年2月18日

How Can Generative AI Transform Your Sales Strategy? ??

In a world where every client expects VIP treatment, only the best AI for insurance agents can keep you ahead of the…

6 条评论
Maximising Cloud Value with Ansible Enterprise Automation

2024年2月15日

Maximising Cloud Value with Ansible Enterprise Automation

Feeling lost in the confusing world of cloud expenses? You're not alone. Many businesses struggle with tight budgets…

1 条评论
Series #3 - Optimizing Networks for Excellence: The Role of Enterprise Automation

2024年1月26日

Series #3 - Optimizing Networks for Excellence: The Role of Enterprise Automation

Welcome to the Final Series on Operational Excellence through Enterprise Automation Welcome to the Final Series on…

10 条评论
Part 2: Enterprise Automation in Chaos Engineering

2024年1月17日

Part 2: Enterprise Automation in Chaos Engineering

1. Introduction to Chaos Engineering and Ansible Chaos Engineering is a critical method for testing and strengthening…

12 条评论
Operational Excellence and the Strategic Role of Enterprise Automation

2024年1月13日

Operational Excellence and the Strategic Role of Enterprise Automation

In today's fast-paced tech world, managing technology manually has become a daunting task. The key to solving this…

7 条评论

See all articles

My Experience Fine-Tuning a Model with InstructLab

Timothy Lam

Director of Strategic Business Development at Red Hat | CISA | CISM | CRISC | PMP | PMI-ACP | TOGAF |MACS (Snr) CP

Introduction

Fine-Tuning AI Models for Health and Fitness Trainers

Process Overview:

领英推荐

Conclusion

Timothy Lam的更多文章

社区洞察

其他会员也浏览了

AI in Everyday Life: Convenience, Challenges, and the Future

The Impact of AI on Education, Work, and Jobs

AI Shaping the Future of Learning, Collaboration, and Career Transitions

How Students Can Develop AI Prompting Skills By Playing Business Simulations

AI and the Future of Formative Assessment: Unlocking Skill Development Through Data

The Power of AI Tools for Learning and Development Today

A Sixty Minute Crash Course on Reducing Workload with Generative AI

Ed3 Weekly: AI's Tangible Tools

Training Minds and AI: The Surprising Parallels Between AI and Workplace Behavior

"AI Trainer": Shaping the Future by Reinforcing Intelligence

Introduction

Fine-Tuning AI Models for Health and Fitness Trainers

Process Overview:

领英推荐

Conclusion

Timothy Lam的更多文章

Ansible in Action: Managing Network Infrastructure with AAP

Automation is the foundation for AI success

Why the Ansible Automation Platform (AAP) Experience Matters ?

Ansible in Action: Automating RHEL Server Provisioning with VMware, Infoblox, and Red Hat Satellite

點解 AI、Automation 同 Sustainability 係未來嘅關鍵？

How Can Generative AI Transform Your Sales Strategy? ??

Maximising Cloud Value with Ansible Enterprise Automation

Series #3 - Optimizing Networks for Excellence: The Role of Enterprise Automation

Part 2: Enterprise Automation in Chaos Engineering

Operational Excellence and the Strategic Role of Enterprise Automation

社区洞察

其他会员也浏览了

AI in Everyday Life: Convenience, Challenges, and the Future

The Impact of AI on Education, Work, and Jobs

AI Shaping the Future of Learning, Collaboration, and Career Transitions

How Students Can Develop AI Prompting Skills By Playing Business Simulations

AI and the Future of Formative Assessment: Unlocking Skill Development Through Data

The Power of AI Tools for Learning and Development Today

A Sixty Minute Crash Course on Reducing Workload with Generative AI

Ed3 Weekly: AI's Tangible Tools

Training Minds and AI: The Surprising Parallels Between AI and Workplace Behavior

"AI Trainer": Shaping the Future by Reinforcing Intelligence