zephyr-7b-beta

zephyr-7b-beta项目介绍

zephyr-7b-beta是一个强大的语言模型，旨在充当友好、乐于助人的AI助手。这个项目由HuggingFace公司开发，是Zephyr系列模型中的第二个版本。

模型概述

zephyr-7b-beta是在mistralai/Mistral-7B-v0.1的基础上进行微调而来的。它使用了直接偏好优化(DPO)技术，在多个公开可用的合成数据集上进行训练。该模型拥有70亿参数，主要针对英语进行了优化。

性能表现

在发布时，zephyr-7b-beta在多个基准测试中表现出色:

在MT-Bench测试中得分7.34，是7B参数级别的聊天模型中排名最高的。
在AlpacaEval测试中胜率达到90.60%，同样位居7B模型之首。
在某些MT-Bench类别中，它甚至超越了像Llama2-Chat-70B这样的大型开源模型。

使用方法

可以使用Hugging Face的pipeline函数轻松调用该模型:

from transformers import pipeline

pipe = pipeline("text-generation", model="HuggingFaceH4/zephyr-7b-beta", torch_dtype=torch.bfloat16, device_map="auto")

messages = [
    {"role": "system", "content": "You are a friendly chatbot who always responds in the style of a pirate"},
    {"role": "user", "content": "How many helicopters can a human eat in one sitting?"},
]

prompt = pipe.tokenizer.apply_chat_template(messages, tokenize=False, add_generation_prompt=True)
outputs = pipe(prompt, max_new_tokens=256, do_sample=True, temperature=0.7, top_k=50, top_p=0.95)
print(outputs[0]["generated_text"])