登录查看更多内容

Beyond Prompts: Fine-Tuning Your LLM

Hari Galla

Techno-Functional Manager | Process Mining Consultant | Intelligent Automation | Generative AI & ML | FINTECH & Emerging Trends | Digital Transformation| Trainer & Mentor | Tech Talk | Partnering for Client Success |

发布日期: 2024年3月3日

WHY FINE TUNING?

While both prompt engineering and fine-tuning aim to enhance the capabilities of large language models (LLMs), they tackle different challenges. Here's a breakdown of some key limitations addressed by fine-tuning but not by prompt engineering, along with illustrative examples:

Prompt Challenge 1: Knowledge Gap

Example: Imagine asking an LLM to diagnose an illness. A well-crafted prompt can guide it through symptoms, but without medical knowledge, the LLM might miss crucial details.

Solution: Fine-tuning exposes the LLM to a vast dataset of labeled medical cases, equipping it with the knowledge needed for accurate diagnoses.

Prompt Challenge 2: Limited Control

Example: You ask an LLM to write a persuasive essay. While a prompt might outline the arguments, the LLM might struggle to maintain a coherent flow or address counter-arguments effectively.

Solution: Fine-tuning can train the LLM on specific reasoning patterns and argument structures, enabling it to construct logical arguments and build a compelling case.

Fine-Tuning LLMs for Real-World Tasks: A Step-by-Step Approach

Data Acquisition:

# The instruction dataset to use
dataset_name = "mlabonne/guanaco-llama2-1k"

Model LLM:

领英推荐

Curb your LLMs : 'LLM-zoning' to overcome…

Batonics AB 1 年前

EXTENDED! FREE ChatGPT For Data Analytics Course

Maven Analytics 1 年前

AI Proof Your Data Science Career With These 4 Skills

StrataScratch 3 个月前

# The model that you want to train from the Hugging Face hub
model_name = "NousResearch/Llama-2-7b-chat-hf"
# Fine-tuned model name
new_model = "Llama-2-7b-chat-finetune"

Specify Fine Tuning Parameters

################################################################################
# QLoRA parameters
################################################################################

# LoRA attention dimension
lora_r = 64

# Alpha parameter for LoRA scaling
lora_alpha = 16

# Dropout probability for LoRA layers
lora_dropout = 0.1

################################################################################
# bitsandbytes parameters
################################################################################

# Activate 4-bit precision base model loading
use_4bit = True

# Compute dtype for 4-bit base models
bnb_4bit_compute_dtype = "float16"

# Quantization type (fp4 or nf4)
bnb_4bit_quant_type = "nf4"

# Activate nested quantization for 4-bit base models (double quantization)
use_nested_quant = False


################################################################################
# SFT parameters
################################################################################

# Maximum sequence length to use
max_seq_length = None

# Pack multiple short examples in the same input sequence to increase efficiency
packing = False

# Load the entire model on the GPU 0
device_map = {"": 0}

Fine Tuning Configuration

# Load LLaMA tokenizer
tokenizer = AutoTokenizer.from_pretrained(model_name, trust_remote_code=True)
tokenizer.pad_token = tokenizer.eos_token
tokenizer.padding_side = "right" # Fix weird overflow issue with fp16 training

# Load LoRA configuration
peft_config = LoraConfig(
    lora_alpha=lora_alpha,
    lora_dropout=lora_dropout,
    r=lora_r,
    bias="none",
    task_type="CAUSAL_LM",
)

# Set training parameters
training_arguments = TrainingArguments(
    output_dir=output_dir,
    num_train_epochs=num_train_epochs,
    per_device_train_batch_size=per_device_train_batch_size,
    gradient_accumulation_steps=gradient_accumulation_steps,
    optim=optim,
    save_steps=save_steps,
    logging_steps=logging_steps,
    learning_rate=learning_rate,
    weight_decay=weight_decay,
    fp16=fp16,
    bf16=bf16,
    max_grad_norm=max_grad_norm,
    max_steps=max_steps,
    warmup_ratio=warmup_ratio,
    group_by_length=group_by_length,
    lr_scheduler_type=lr_scheduler_type,
    report_to="tensorboard"
)

# Set supervised fine-tuning parameters
trainer = SFTTrainer(
    model=model,
    train_dataset=dataset,
    peft_config=peft_config,
    dataset_text_field="text",
    max_seq_length=max_seq_length,
    tokenizer=tokenizer,
    args=training_arguments,
    packing=packing,
)

Model Training & Saving

# Train model
trainer.train()
# Save trained model
trainer.model.save_pretrained(new_model)

Conclusion: Remember, choosing the right technique depends on your needs. Prompting offers flexibility, while fine-tuning empowers the LLM with deeper knowledge and stronger control - like choosing the perfect tools for the job!

Emerging Technology

246 位关注者

要查看或添加评论，请登录

Hari Galla的更多文章

ADVANCED RAG SERIES

2024年9月26日

ADVANCED RAG SERIES

INDEXING STRATEGIES - PART I In many industries, processing large documents into manageable chunks is essential for…
Celonis PI Graph: Revolutionizing Process Mining with a Unified Data and Knowledge Platform

2024年3月17日

Celonis PI Graph: Revolutionizing Process Mining with a Unified Data and Knowledge Platform

Conclusion: By combining a standardized data model, centralized process knowledge, and pre-built applications, the PI…
P2P Comprehensive view

2024年3月12日

P2P Comprehensive view

I have insights on how hyper-automation can streamline your F&A operations resulting unlocking Efficiency in…

2 条评论
Bye Bye to Invoice manual processing

2024年3月11日

Bye Bye to Invoice manual processing

Business Case: French Handwritten Invoice Image Extraction: LLM's Invoice extraction Do you want to know more about it?…

1 条评论
Secrets of Decision Trees: A Guide to Entropy, Gini, and Information Gain

2024年3月4日

Secrets of Decision Trees: A Guide to Entropy, Gini, and Information Gain

Application: Decision trees are supervised learning algorithms used for classification and regression tasks. Focus:…

2 条评论
How 1-Bit LLMs Are Revolutionizing Efficiency

2024年3月1日

How 1-Bit LLMs Are Revolutionizing Efficiency

Challenges with Traditional LLMs: Large size: Traditional LLMs have billions of parameters, leading to: Deployment…
OpenAI's Revolutionary Text-to-Video Model

2024年2月26日

OpenAI's Revolutionary Text-to-Video Model

Introduction: OpenAI's Sora is a game-changing text-to-video model, captivating the AI community with its remarkable…
Unlocking AI for Everyone: Google's Gemma Opens the Door

2024年2月24日

Unlocking AI for Everyone: Google's Gemma Opens the Door

Google's Gemma Opens Doors to Responsible Development Large Language Models (LLMs) have captivated the world with their…
Customize Your LLM Pipelines (No Coding Needed!)

2024年2月23日

Customize Your LLM Pipelines (No Coding Needed!)

Learn how to simplify LLMOps and build LLM Pipelines in minutes without writing any code using Vext platform…
Bye-Bye RNNs, Hello Transformers: Why We Upgraded!

2024年2月19日

Bye-Bye RNNs, Hello Transformers: Why We Upgraded!

Recurrent Neural Networks (RNNs) face similar challenges: 1. Vanishing or Exploding Gradients: Example: Translating a…

2 条评论

See all articles

Beyond Prompts: Fine-Tuning Your LLM

Hari Galla

Techno-Functional Manager | Process Mining Consultant | Intelligent Automation | Generative AI & ML | FINTECH & Emerging Trends | Digital Transformation| Trainer & Mentor | Tech Talk | Partnering for Client Success |

领英推荐

Emerging Technology

246 位关注者

Hari Galla的更多文章

社区洞察

其他会员也浏览了

Explain SMOTE Method Which Is Used To Handle Data Imbalance ?

Predicting House Prices using Machine Learning

Watch#3: Literate LLMs, Human Errors and Chains-of-Verification

Can we detect LLM hallucinations?

Artificial Intelligence #211

Artificial Intelligence #211

Decoding NotebookLM: A Peek Inside

Artificial Intelligence that manufactures Decision Support Systems

Getting Structured Information with Validation using a State Monad

Efficient Word Search in Character Matrices: A Deep Dive into Depth-First Search (DFS) and Bit masking Techniques

领英推荐

Emerging Technology

246 位关注者

Hari Galla的更多文章

ADVANCED RAG SERIES

Celonis PI Graph: Revolutionizing Process Mining with a Unified Data and Knowledge Platform

P2P Comprehensive view

Bye Bye to Invoice manual processing

Secrets of Decision Trees: A Guide to Entropy, Gini, and Information Gain

How 1-Bit LLMs Are Revolutionizing Efficiency

OpenAI's Revolutionary Text-to-Video Model

Unlocking AI for Everyone: Google's Gemma Opens the Door

Customize Your LLM Pipelines (No Coding Needed!)

Bye-Bye RNNs, Hello Transformers: Why We Upgraded!

社区洞察

其他会员也浏览了

Explain SMOTE Method Which Is Used To Handle Data Imbalance ?

Predicting House Prices using Machine Learning

Watch#3: Literate LLMs, Human Errors and Chains-of-Verification

Can we detect LLM hallucinations?

Artificial Intelligence #211

Artificial Intelligence #211

Decoding NotebookLM: A Peek Inside

Artificial Intelligence that manufactures Decision Support Systems

Getting Structured Information with Validation using a State Monad

Efficient Word Search in Character Matrices: A Deep Dive into Depth-First Search (DFS) and Bit masking Techniques