instruct fine tuning setup

This commit is contained in:
Your Name
2025-08-28 22:41:56 +00:00
parent d49b4ff2d5
commit 78d7063ce5
58 changed files with 2510234 additions and 98 deletions
+4 -4
View File
@@ -1,5 +1,5 @@
---
base_model: unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
base_model: unsloth/llama-3.3-70b-instruct-bnb-4bit
library_name: transformers
model_name: outputs
tags:
@@ -12,7 +12,7 @@ licence: license
# Model Card for outputs
This model is a fine-tuned version of [unsloth/meta-llama-3.1-8b-instruct-bnb-4bit](https://huggingface.co/unsloth/meta-llama-3.1-8b-instruct-bnb-4bit).
This model is a fine-tuned version of [unsloth/llama-3.3-70b-instruct-bnb-4bit](https://huggingface.co/unsloth/llama-3.3-70b-instruct-bnb-4bit).
It has been trained using [TRL](https://github.com/huggingface/trl).
## Quick start
@@ -36,8 +36,8 @@ This model was trained with SFT.
### Framework versions
- TRL: 0.21.0
- Transformers: 4.55.1
- Pytorch: 2.7.1
- Transformers: 4.55.4
- Pytorch: 2.8.0
- Datasets: 3.6.0
- Tokenizers: 0.21.4