登录查看更多内容

Kohya brought massive improvements to FLUX LoRA (as low as 4 GB GPUs) and DreamBooth / Fine-Tuning (as low as 6 GB GPUs) training

Furkan G?zükara

PhD. Computer Engineer. Produces Content For FLUX, LoRA, Fine Tuning, Stable Diffusion, SDXL, Training, DreamBooth Training, Deep Fake, Voice Cloning, Text To Speech, Text To Image, Text To Video, Generative AI, LLMs

发布日期: 2024年11月17日

+ 关注

You can download all configs and full instructions

> https://www.patreon.com/posts/112099700 - Fine Tuning post

> https://www.patreon.com/posts/110879657 - LoRA post

Kohya brought massive improvements to FLUX LoRA and DreamBooth / Fine-Tuning (min 6GB GPU) training.

Now as low as 4GB GPUs can train FLUX LoRA with decent quality and 24GB and below GPUs got a huge speed boost when doing Full DreamBooth / Fine-Tuning training

You need minimum 4GB GPU to do a FLUX LoRA training and minimum 6 GB GPU to do FLUX DreamBooth / Full Fine-Tuning training. It is just mind blowing.

You can download all configs and full instructions > https://www.patreon.com/posts/112099700

The above post also has 1-click installers and downloaders for Windows, RunPod and Massed Compute

The model downloader scripts also updated and downloading 30+GB models takes total 1 minute on Massed Compute

You can read the recent updates here : https://github.com/kohya-ss/sd-scripts/tree/sd3?tab=readme-ov-file#recent-updates

This is the Kohya GUI branch : https://github.com/bmaltais/kohya_ss/tree/sd3-flux.1

Key thing to reduce VRAM usage is using block swap

Kohya implemented the logic of OneTrainer to improve block swapping speed significantly and now it is supported for LoRAs as well

Now you can do FP16 training with LoRAs on 24 GB and below GPUs

Now you can train a FLUX LoRA on a 4 GB GPU - key is FP8, block swap and using certain layers training (remember single layer LoRA training)

It took me more than 1 day to test all newer configs, their VRAM demands, their relative step speeds and prepare the configs :)

Generative AI

3,577 位关注者

Pablo Montero

Motion Designer, GenAI Researcher

3 个月

Ey Furkan, you usually say you train all images at 1024x1024, do you have any research or post supporting this premise? Thank you

1 次回应

Joe Skopek

Creative Technologist - AI Pragmatist

3 个月

VRAM improvements to FLUX LoRA look amazing, Do you think these improvements will make it more approachable by lower end users? Thanks so much for sharing. It looks like it was a ton of work!

2 次回应

查看更多评论

要查看或添加评论，请登录

Furkan G?zükara的更多文章

Wan 2.1 AI Video Model: Ultimate Step-by-Step Tutorial for Windows & Affordable Private Cloud Setup

2025年3月3日

Wan 2.1 AI Video Model: Ultimate Step-by-Step Tutorial for Windows & Affordable Private Cloud Setup

Video Tutorial : https://youtu.be/hnAhveNy-8s New Updates Has been made after tutorial has been published — Now APP is…
Wan 2.1 Ultra Advanced Gradio APP for — Works as low as 4GB VRAM — 1-Click Installers for Windows, RunPod, Massed Compute — Batch Processing — T2V

2025年2月26日

Wan 2.1 Ultra Advanced Gradio APP for — Works as low as 4GB VRAM — 1-Click Installers for Windows, RunPod, Massed Compute — Batch Processing — T2V

Installer and APP : https://www.patreon.
IDM VTON : Virtual Try On APP Automatic Installers for Windows, RunPod, Massed Compute and a free Kaggle Account notebook published — Can transfer obj

2025年2月20日

IDM VTON : Virtual Try On APP Automatic Installers for Windows, RunPod, Massed Compute and a free Kaggle Account notebook published — Can transfer obj

Patreon exclusive posts index to find our scripts easily. Join discord to get help, chat, discuss and also tell me your…

2 条评论
RTX 5090 Tested Against FLUX DEV, SD 3.5 Large, SD 3.5 Medium, SDXL, SD 1.5 with AMD 9950X CPU and RTX 5090 compared against RTX 3090 TI

2025年2月13日

RTX 5090 Tested Against FLUX DEV, SD 3.5 Large, SD 3.5 Medium, SDXL, SD 1.5 with AMD 9950X CPU and RTX 5090 compared against RTX 3090 TI

RTX 5090 Tested Against FLUX DEV, SD 3.5 Large, SD 3.

2 条评论
MSI RTX 5090 TRIO FurMark Benchmarking + Overclocking + Noise Testing and Comparing with RTX 3090 TI

2025年2月12日

MSI RTX 5090 TRIO FurMark Benchmarking + Overclocking + Noise Testing and Comparing with RTX 3090 TI

https://youtu.be/uV3oqdILOmA I have early purchased MSI RTX 5090 32G GAMING TRIO OC GPU to bring tests, benchmarks and…

2 条评论
Amazing Gradio Batch Processing APP For Newest SOTA Open Source Background Remover Model BiRefNet HR

2025年2月7日

Amazing Gradio Batch Processing APP For Newest SOTA Open Source Background Remover Model BiRefNet HR

Installers and APP : https://www.patreon.

6 条评论
VisoMaster (newest SOTA 0-shot Face Swap / Deep Fake APP) Tutorial and 1-Click Windows and Linux (Massed Compute) Installers

2025年2月5日

VisoMaster (newest SOTA 0-shot Face Swap / Deep Fake APP) Tutorial and 1-Click Windows and Linux (Massed Compute) Installers

Download the installer zip file from here : https://www.patreon.
AuraSR GigaGAN 4x Upscaler Gradio APP Published — Batch Upscale — Best GAN Model — Fast and Low VRAM

2025年2月4日

AuraSR GigaGAN 4x Upscaler Gradio APP Published — Batch Upscale — Best GAN Model — Fast and Low VRAM

The Installer files and app file published here : https://www.patreon.
DeepFace Based Image Similarity / Resemblance Sorter Gradio APP - Can Be Used to Batch Sort AI images - Works on Real Images Too

2025年2月2日

DeepFace Based Image Similarity / Resemblance Sorter Gradio APP - Can Be Used to Batch Sort AI images - Works on Real Images Too

Installers Installer zip file shared here : https://www.patreon.
Paints-UNDO Installers Published - Undo Images Like Drawing From Scratch - 1-Click Install for Windows, RunPod, Massed Compute, Kaggle

2025年2月1日

Paints-UNDO Installers Published - Undo Images Like Drawing From Scratch - 1-Click Install for Windows, RunPod, Massed Compute, Kaggle

Installers shared here : https://www.patreon.

See all articles

Kohya brought massive improvements to FLUX LoRA (as low as 4 GB GPUs) and DreamBooth / Fine-Tuning (as low as 6 GB GPUs) training

Furkan G?zükara

PhD. Computer Engineer. Produces Content For FLUX, LoRA, Fine Tuning, Stable Diffusion, SDXL, Training, DreamBooth Training, Deep Fake, Voice Cloning, Text To Speech, Text To Image, Text To Video, Generative AI, LLMs

Generative AI

3,577 位关注者

Furkan G?zükara的更多文章

社区洞察

其他会员也浏览了

AAEON’s MAXER-2100 Inference Server Integrates Both Intel CPU and NVIDIA GPU Technologies

Motherboards: Architectural Backbone of Computing Systems

Eaton UPS Battery Refresh, AMD Radeon PRO GPUs Evaluated, NVIDIA Adds Smarter Switching

7 Things to make sure while building a silent PC | THEMVP

rmNVMe-IP for Gen5: Breakthrough 4K IOPS Performance with fully CPU offload

High performance & Simultaneous multiple users access to NVMe Gen4 SSD by muNVMe-IP Core

Experience the Power of 20GB/s SSD Performance with 2CH NMVe-IP Gen5 RAID0 Demo

AMD Launches 5th Gen. EPYC CPUs, Surprising Industry with 192-Core Options

The aftermath of Fluix Tri-swift & the release of our block

Quanta makes AI servers and EVs as growths in 2023

Generative AI

3,577 位关注者

Furkan G?zükara的更多文章

Wan 2.1 AI Video Model: Ultimate Step-by-Step Tutorial for Windows & Affordable Private Cloud Setup

Wan 2.1 Ultra Advanced Gradio APP for — Works as low as 4GB VRAM — 1-Click Installers for Windows, RunPod, Massed Compute — Batch Processing — T2V

IDM VTON : Virtual Try On APP Automatic Installers for Windows, RunPod, Massed Compute and a free Kaggle Account notebook published — Can transfer obj

RTX 5090 Tested Against FLUX DEV, SD 3.5 Large, SD 3.5 Medium, SDXL, SD 1.5 with AMD 9950X CPU and RTX 5090 compared against RTX 3090 TI

MSI RTX 5090 TRIO FurMark Benchmarking + Overclocking + Noise Testing and Comparing with RTX 3090 TI

Amazing Gradio Batch Processing APP For Newest SOTA Open Source Background Remover Model BiRefNet HR

VisoMaster (newest SOTA 0-shot Face Swap / Deep Fake APP) Tutorial and 1-Click Windows and Linux (Massed Compute) Installers

AuraSR GigaGAN 4x Upscaler Gradio APP Published — Batch Upscale — Best GAN Model — Fast and Low VRAM

DeepFace Based Image Similarity / Resemblance Sorter Gradio APP - Can Be Used to Batch Sort AI images - Works on Real Images Too

Paints-UNDO Installers Published - Undo Images Like Drawing From Scratch - 1-Click Install for Windows, RunPod, Massed Compute, Kaggle

社区洞察

其他会员也浏览了

AAEON’s MAXER-2100 Inference Server Integrates Both Intel CPU and NVIDIA GPU Technologies

Motherboards: Architectural Backbone of Computing Systems

Eaton UPS Battery Refresh, AMD Radeon PRO GPUs Evaluated, NVIDIA Adds Smarter Switching

7 Things to make sure while building a silent PC | THEMVP

rmNVMe-IP for Gen5: Breakthrough 4K IOPS Performance with fully CPU offload

High performance & Simultaneous multiple users access to NVMe Gen4 SSD by muNVMe-IP Core

Experience the Power of 20GB/s SSD Performance with 2CH NMVe-IP Gen5 RAID0 Demo

AMD Launches 5th Gen. EPYC CPUs, Surprising Industry with 192-Core Options

The aftermath of Fluix Tri-swift & the release of our block

Quanta makes AI servers and EVs as growths in 2023