Unsloth AI

科技、信息和网络

Sans Fransisco，California 10,580 位关注者

Making AI accessible for everyone! ??

查看职位关注

查看全部 4 位员工

关于我们

Easily finetune & train LLMs. Get faster with unsloth.

网站: https://unsloth.ai
Unsloth AI的外部链接
所属行业: 科技、信息和网络
规模: 2-10 人
总部: Sans Fransisco，California
类型: 私人持股
创立: 2023
领域: artificial intelligence、ai、llms、language models和finetuning

地点

主要

US，California，Sans Fransisco，94107

获取路线

Unsloth AI员工

查看全部员工

动态

Unsloth AI转发了
Daniel Han

unsloth.ai - open-source AI training
3 天前已编辑
举报此动态
We teamed up with Hugging Face to release a free GRPO notebook that fine-tunes Gemma 3 into a powerful reasoning model! Using Unsloth AI, OpenAI’s math dataset and custom reward functions, we fine-tune Google’s Gemma 3 (1B) to generate chain-of-thought reasoning. Free Colab Notebook: https://lnkd.in/e94SKJz4 Summary of what you'll learn: ? Implement chain-of-thought reasoning in Google's Gemma 3 (1B) using 16-bit LoRA ? Make tiny LLMs benefit from GRPO ? Understand reward functions ? Prepare your data + evaluate your LLM Join HF's Course: https://lnkd.in/e_PhX4tc Thank you Ben Burtenshaw for being patient and working with us on this collab! ??
69 条评论

赞评论分享
Unsloth AI转发了
Ben Burtenshaw

Machine Learning Advocacy @ ?? Hugging Face
1 周
举报此动态
The unit we’re all waiting for is here! Unsloth AI + Hugging Face on GRPO in the reasoning course. ?? https://lnkd.in/enr3adQ5 In this unit, you’ll build on the earlier units by implementing GRPO in Unsloth, this time we’re also levelling things up: - run on limited hardware with unsloth optimizations - expand GRPO reward functions to format and beyond - explore a wider range of model sizes up to 7B This should help way more students without serious hardware. Can’t wait to hear how it goes. Follow the org to join in: https://lnkd.in/enr3adQ5
12 条评论

赞评论分享
Unsloth AI

10,580 位关注者
2 周
举报此动态
Unsloth now works on Windows! ?? Fine-tune LLMs locally on Windows without Linux or WSL. Just install prerequisites & run our pip command. Tutorial: https://lnkd.in/gWC4AcMV
14 条评论

赞评论分享
Unsloth AI

10,580 位关注者
3 周已编辑
举报此动态
Tutorial: Train your own Reasoning LLM for free! Transform Llama 3.1 (8B) to have chain-of-thought using DeepSeek's GRPO algorithm. Unsloth makes GRPO use 90% less VRAM: https://docs.unsloth.ai/ You'll learn about: ? Reward Functions + dataset prep ? GRPO Basics + tips & tricks ? Training on free Colab GPUs ? Running + evaluating + saving your model Tutorial Link: https://lnkd.in/gxYGrFhd
12 条评论

赞评论分享
Unsloth AI

10,580 位关注者
1 个月
举报此动态
Today, we’re launching new algorithms that enable 10x longer context lengths & 90% less VRAM for training Reasoning Models (GRPO). Using Unsloth, you can now train your own reasoning model with just 5GB VRAM for Qwen2.5-1.5B with no accuracy loss. Blog: https://lnkd.in/gnvEjxMm Free Colab Notebook for Llama 3.1 (8B) GRPO: https://lnkd.in/g7deg5Uw For our benchmarks, a standard GRPO QLoRA setup (TRL + FA2) for Llama 3.1 (8B) at 20K context required 510.8GB VRAM. Unsloth’s GRPO algorithms reduces this to just 54.3GB. The 5GB VRAM requirement for Qwen2.5 (1.5B) is down from 7GB in our previous GRPO release two weeks ago!
34 条评论

赞评论分享
Unsloth AI

10,580 位关注者
1 个月
举报此动态
You can now reproduce DeepSeek-R1's reasoning on your own local device! Introducing reasoning in Unsloth. You'll just need 7GB VRAM to experience your own "Aha" moment 100% locally or free on Colab. Unsloth makes GRPO RL use 80% less memory. With 15GB VRAM, you can convert Llama 3.1 (8B), Phi-4 (14B), Mistral (7B), or any model up to 15B parameters into reasoning models. Guide + Blog: https://lnkd.in/gdzMDsYF
19 条评论

赞评论分享
Unsloth AI转发了
Unsloth AI

10,580 位关注者
1 个月已编辑
举报此动态
Introducing 1.58bit DeepSeek-R1 GGUFs! ?? R1 can now run in 1.58-bit, while being fully functional. We shrank the 671B parameter model from 720GB to just 131GB - a 80% size reduction. Naively quantizing all layers breaks the model entirely, causing endless loops & gibberish outputs. Our dynamic quants solve this. The 1.58-bit quant fits in 160GB VRAM (2x H100 80GB) for fast inference at ~140 tokens/sec for throughput. By studying DeepSeek AI's R1 architecture, we selectively quantized certain layers to higher bits (like 4-bit), and leave most MoE layers to 1.5-bit. Benchmarks + Blog: https://lnkd.in/g5uA3855 Dynamic GGUFs (131GB–212GB) on Hugging Face: https://lnkd.in/gP7ysgfe
19 条评论

赞评论分享
Unsloth AI

10,580 位关注者
1 个月已编辑
举报此动态
Introducing 1.58bit DeepSeek-R1 GGUFs! ?? R1 can now run in 1.58-bit, while being fully functional. We shrank the 671B parameter model from 720GB to just 131GB - a 80% size reduction. Naively quantizing all layers breaks the model entirely, causing endless loops & gibberish outputs. Our dynamic quants solve this. The 1.58-bit quant fits in 160GB VRAM (2x H100 80GB) for fast inference at ~140 tokens/sec for throughput. By studying DeepSeek AI's R1 architecture, we selectively quantized certain layers to higher bits (like 4-bit), and leave most MoE layers to 1.5-bit. Benchmarks + Blog: https://lnkd.in/g5uA3855 Dynamic GGUFs (131GB–212GB) on Hugging Face: https://lnkd.in/gP7ysgfe
19 条评论

赞评论分享
Unsloth AI转发了
Vaibhav Srivastav

GPU poor @ Hugging Face
2 个月
举报此动态
running Phi 4 w/?Ollama &?Unsloth AI?on Mac, 100% local and fully private! ?? ollama run hf. co/unsloth/phi-4-GGUF:Q8_0 that's it! ??

43 条评论

赞评论分享
Unsloth AI转发了
Unsloth AI

10,580 位关注者
2 个月
举报此动态
You can now finetune Phi-4 for free on Google Colab! Unsloth makes Phi-4 finetuning 2x faster, use 70% less memory, and enables >128K context lengths with no accuracy loss. That's 12x longer than Hugging Face + FA2’s 12K on an 48GB GPU. We've also fixed 4 bugs in Phi-4, greatly increasing the model’s accuracy. View our new documentation: https://docs.unsloth.ai/ Read more details about Microsoft's Phi-4 + our bug fixes: https://lnkd.in/gZHfvtTz Phi-4 Fine-tuning Colab notebook: https://lnkd.in/gqPjRij8

Google Colab

colab.research.google.com

20 条评论

赞评论分享

相似主页

查看职位

融资

Unsloth AI 共 4 轮

上一轮

种子轮 2024年10月30日

在 Crunchbase 上查看更多信息

登录看看您认识Unsloth AI的哪些人

Unsloth AI

科技、信息和网络

Sans Fransisco，California 10,580 位关注者

Making AI accessible for everyone! ??

关于我们

地点

Unsloth AI员工

Daniel Han

unsloth.ai - open-source AI training

Michael Han (Unsloth)

Currently building Unsloth AI. ??

动态

立即加入，查看您错过的职场动态

相似主页

Moonshot AI

Melty

David AI

Anara (YC S24)

Void (YC S24)

Haystack Software

Storia AI

Undermind

autarc (YC S24)

Argil (YC S24)

查看职位

工程师职位

科学家职位

软件工程师职位

机器学习工程师职位

经理职位

分析师职位

分析主管职位

实习生职位

数据科学职位

高级数据分析师职位

指挥员职位

高级科学家职位

数据工程师职位

数据科学家职位

用户体验设计师职位

创意总监职位

统计员职位

融资