Baichuan2-7B-Chat

Baichuan2-7B-Chat项目介绍

Baichuan2-7B-Chat是百川智能推出的新一代开源大语言模型中的一员。该模型是基于2.6万亿高质量语料训练而成的7B参数规模的对话模型,是Baichuan 2系列模型中的重要组成部分。

模型特点

强大的性能：在权威的中文和英文基准测试中,Baichuan2-7B-Chat取得了同尺寸模型中的最佳效果。
开放获取：该模型不仅对学术研究完全开放,开发者在获得官方商用许可后也可以免费用于商业用途。
多版本支持：除了Chat版本,还提供了Base版本和4bits量化版本,以满足不同场景的需求。
技术创新：采用了PyTorch 2.0中的新特性F.scaled_dot_product_attention,大幅提升了推理速度。

使用方法

使用Baichuan2-7B-Chat模型非常简单。用户只需通过Hugging Face的transformers库加载模型和分词器,然后就可以进行对话交互。以下是一个简单的使用示例:

import torch
from transformers import AutoModelForCausalLM, AutoTokenizer
from transformers.generation.utils import GenerationConfig

tokenizer = AutoTokenizer.from_pretrained("baichuan-inc/Baichuan2-7B-Chat", use_fast=False, trust_remote_code=True)
model = AutoModelForCausalLM.from_pretrained("baichuan-inc/Baichuan2-7B-Chat", device_map="auto", torch_dtype=torch.bfloat16, trust_remote_code=True)
model.generation_config = GenerationConfig.from_pretrained("baichuan-inc/Baichuan2-7B-Chat")

messages = []
messages.append({"role": "user", "content": "解释一下"温故而知新""})
response = model.chat(tokenizer, messages)
print(response)