instruct fine tuning setup

2025-08-28 22:41:56 +00:00
parent d49b4ff2d5
commit 78d7063ce5
58 changed files with 2510234 additions and 98 deletions
@@ -1,5 +1,5 @@
 ---
-base_model: unsloth/meta-llama-3.1-8b-instruct-bnb-4bit
+base_model: unsloth/llama-3.3-70b-instruct-bnb-4bit
 library_name: transformers
 model_name: outputs
 tags:
@@ -12,7 +12,7 @@ licence: license

 # Model Card for outputs

-This model is a fine-tuned version of [unsloth/meta-llama-3.1-8b-instruct-bnb-4bit](https://huggingface.co/unsloth/meta-llama-3.1-8b-instruct-bnb-4bit).
+This model is a fine-tuned version of [unsloth/llama-3.3-70b-instruct-bnb-4bit](https://huggingface.co/unsloth/llama-3.3-70b-instruct-bnb-4bit).
 It has been trained using [TRL](https://github.com/huggingface/trl).

 ## Quick start
@@ -36,8 +36,8 @@ This model was trained with SFT.
 ### Framework versions

 - TRL: 0.21.0
- Transformers: 4.55.1
- Pytorch: 2.7.1
+- Transformers: 4.55.4
+- Pytorch: 2.8.0
 - Datasets: 3.6.0
 - Tokenizers: 0.21.4