Anthropic/hh-rlhf
Viewer • Updated • 169k • 38.4k • 1.74k
How to use kubernetes-bad/Ligma-L2-13b with PEFT:
from peft import PeftModel
from transformers import AutoModelForCausalLM
base_model = AutoModelForCausalLM.from_pretrained("NousResearch/Llama-2-13b-hf")
model = PeftModel.from_pretrained(base_model, "kubernetes-bad/Ligma-L2-13b")Ligma Is "Great" for Model Alignment
WARNING: This model is published for scientific purposes only. It may and most likely will produce toxic content.
Trained on the rejected column of Anthropic's hh-rlhf dataset.
Use at your own risk.
License: just comply with llama2 license and you should be ok.
Base model
NousResearch/Llama-2-13b-hf