dd6fe347创建于 4月9日历史提交

文件	最后提交记录	最后更新时间
evaluation	!6460 修改ChatGLM2-6B问题单 Merge pull request !6460 from xiongliangcheng/master	1 年前
fix	[自研][PyTorch]ChatGLM2-6B添加fix文件夹	2 年前
model	!5990 【ChatGLM2-6B】添加适配torch2.X的FA代码并更新README * 添加适配torch2.X的代码并更新readme	2 年前
ptuning	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
resources	ChatGLM2-6B first commit	2 年前
FAQ.md	ChatGLM2-6B first commit	2 年前
MODEL_LICENSE	ChatGLM2-6B first commit	2 年前
README.md	文档整改，gitee->gitcode Co-authored-by: Lighters_c<zyh13227@163.com> # message auto-generated for no-merge-commit merge: !7469 merge ffffix into master 文档整改，gitee->gitcode Created-by: addsubmuldiv Commit-by: Lighters_c Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7469	5 个月前
README_EN.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
api.py	[自研][PyTorch]ChatGLM2-6B带license提交	2 年前
cli_demo.py	[自研][PyTorch]ChatGLM2-6B带license提交	2 年前
openai_api.py	[自研][PyTorch]ChatGLM2-6B带license提交	2 年前
public_address_statement.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
requirements.txt	!6460 修改ChatGLM2-6B问题单 Merge pull request !6460 from xiongliangcheng/master	1 年前
utils.py	[自研][PyTorch]ChatGLM2-6B带license提交	2 年前
web_demo.py	[自研][PyTorch]ChatGLM2-6B带license提交	2 年前
web_demo2.py	[自研][PyTorch]ChatGLM2-6B带license提交	2 年前

当前模型脚本已不随版本演进，如使用此模型可跳转至该地址

ChatGLM2-6B

概述

简介

ChatGLM2-6B 是开源中英双语对话模型 ChatGLM-6B 的第二代版本，在保留了初代模型对话流畅、部署门槛较低等众多优秀特性的基础之上，ChatGLM2-6B 拥有更强大的性能和更长的上下文以及更高效的推理。

参考实现：

url=https://github.com/THUDM/ChatGLM2-6B
commitID=921d7e9adc69020a19169d1ba4f76c2675a2dd29

适配昇腾 AI 处理器的实现：

url=https://gitcode.com/ascend/ModelZoo-PyTorch.git
code_path=PyTorch/built-in/foundation

准备训练环境

准备环境

当前模型支持的 PyTorch 版本和已知三方库依赖如下表所示。

表 1 版本支持表

Torch_Version 三方库依赖版本

PyTorch 1.11 transformers == 4.29.0

PyTorch 2.1 transformers == 4.29.0
环境准备指导。

请参考《Pytorch框架训练环境准备》。

Torch_Version	三方库依赖版本
PyTorch 1.11	transformers == 4.29.0
PyTorch 2.1	transformers == 4.29.0

安装依赖。

在模型源码包根目录下执行命令，安装模型对应PyTorch版本需要的依赖。

pip install -r requirements.txt

# transformers适配
pip show transformers
# 复制 Location 路径，然后在`/${Location}/transformers/training_args.py`文件的1334行将`!=`改为`==`。

准备数据集

获取数据集。

用户可以从这里下载数据集，并将其放在ptuning路径下的AdvertiseGen文件夹内，该文件夹内容包括：

├── AdvertiseGen
      ├──train.json
      ├──dev.json

数据转换修改数据转换脚本ptuning/preprocess.sh

# modify the script according to your own  ascend-toolkit path
source env_npu.sh

# for preprocess training datasets
--do_train \
--max_source_length 4096 \ #for example 
--max_target_length 4096 \

# for preprocess predict datasets
--do_predict \
--max_source_length 256 \
--max_target_length 256

执行下面代码转换数据集

  # process datasets                              
  bash preprocess.sh

准备预训练权重

用户可以从这里下载预训练权重和配置文件，然后将这些文件放在 "model"文件夹中，不要覆盖 modeling_chatglm.py文件。 model文件夹内容如下：

  ├── model
      ├──config.json
      ├──configuration_chatglm.py
      ├──pytorch_model-00001-of-00007.bin
      ├──pytorch_model-00002-of-00007.bin
      ├──pytorch_model-00003-of-00007.bin
      ├──pytorch_model-00004-of-00007.bin
      ├──pytorch_model-00005-of-00007.bin
      ├──pytorch_model-00006-of-00007.bin
      ├──pytorch_model-00007-of-00007.bin
      ├──pytorch_model.bin.index.json
      ├──quantization.py
      ├──tokenization_chatglm.py
      ├──tokenizer_config.json
      ├──tokenizer.model
      ├──modeling_chatglm.py

修改modeling_chatglm.py文件:

  USE_Flash=True

PS: 设置为True能提高性能

开始训练

进入解压后的源码包根目录。
```
cd /${模型文件夹名称} 
```
启动训练

该模型P-Tuning v2支持单机单卡，全参数fintune支持单机8卡。

全参数finetune 配置ChatGLM2-6B训练脚本: ptuning/ds_train_finetune.sh

# modify the script according to your own  ascend-toolkit path
source env_npu.sh

# modify script according to your own needs
--model_name_or_path ../model/ \  #model path
--max_source_length 4096 \
--max_target_length 4096 \  #should align with the processed dataset

注意：--max_source_length与--max_target_length参数配置应该与启动preprocess.sh脚本转换数据集时一致，如果需要更改这两个参数，需要重新转换对应长度的数据集。

启动8卡微调

bash ds_train_finetune.sh

Lora 启动Lora微调
```
bash ds_train_lora.sh
```

全参数finetune验证

运行以下命令

cd /${模型文件夹名称}/ptuning
bash evaluate_fintune.sh

模型训练脚本部分参数说明如下。

--model_name_or_path                   // 模型路径
--output_dir                           // 模型输出路径
--gradient_accumulation_steps          // 梯度累计步长
--learning_rate                        // 学习率

训练结果展示

表 2 训练结果展示表

Device	Torch_version	total Iterations	throughput rate (samples/s)	throughput rate (tokens/s/p)	single-step time (s/step)	floating point operation (TFLOPs/s)
8p-NPU	1.11	1000	1.79	1833	4.46	65.72
8p-NPU	2.1	1000	2.03	2078	3.94	74.62
8p-竞品	2.1	1000	1.76	1802	4.54	64.64

推理

推理环境搭建

推理环境搭建参考上述训练环境搭建。

推理脚本

1）执行vim infer.py创建推理脚本，然后将下面代码写入infer.py文件中，然后按Esc键输入:wq退出并保存文件。

from transformers import AutoTokenizer, AutoModel

# 修改CHECKPOINT路径
CHECKPOINT="./model_weight"
tokenizer = AutoTokenizer.from_pretrained(CHECKPOINT, trust_remote_code=True)
model = AutoModel.from_pretrained(CHECKPOINT, trust_remote_code=True, device='npu')
model = model.eval()

print("请输入对话")
_input=input(">>")

while _input:
    response, history = model.chat(tokenizer, _input, history=[])
    print(response)
    _input=input(">>")

2）运行下面命令执行推理任务

 python infer.py

推理结果展示

请输入对话
>>你好
你好👋！我是人工智能助手 ChatGLM2-6B，很高兴见到你，欢迎问我任何问题。
>>晚上睡不着应该怎么办
以下时是一些有助于晚上睡觉的技巧:

1. 创建一个规律的睡眠时间表:每天在相同的时间上床并起床可以帮助身体建立一个规律的睡眠时间表。

2. 创建一个舒适的睡眠环境:在安静、黑暗、凉爽、舒适的房间里睡觉可以帮助放松身心,更容易入睡。

3. 避免使用电子设备:在睡觉前一两个小时内避免使用电子设备,如手机、电脑、平板电脑等,以免干扰睡眠。

4. 放松身心:在睡觉前做些轻松的活动,如阅读、听轻柔的音乐、洗个热水澡、做些瑜伽、冥想等,有助于放松身心,减轻压力。

5. 避免咖啡因和酒精:在睡觉前几个小时内避免摄入咖啡因和酒精,以免影响睡眠。

6. 远离刺激:在睡觉前远离刺激,如避免摄入咖啡因、饮酒、吸烟等,以免影响睡眠。

7. 远离压力:避免在睡觉前进行紧张的活动,如激烈的运动,以免影响睡眠。

如果这些技巧不能解决你的问题,你可以尝试寻求医生的帮助,找到更好的解决方案。

评估

准备数据集任务

用户可以从 Tsinghua Cloud 下载处理好的 C-Eval 数据集，解压到 evaluation 目录下。

运行评估任务

1）首先修改评估脚本evaluation/evaluate_ceval.py。

# 修改 CHECKPOINT 路径和数据集任务路径
CHECKPOINT= "../model"
DATA_PATH="./CEval/test/**/*.jsonl"

2）然后运行下面代码执行评估任务。

cd evaluation
python evaluate_ceval.py

评估结果展示

表 3 评估结果展示表

任务	验证集	模型	昇腾值	参考值	社区值
CEval	val	ChatGLM2-6B	0.548	0.552	--

FAQ

加载参数阶段有卡死现象

删除root下的cache目录，重新运行

提示so文件错误

提示so文件找不到
# 若遇到该报错
全局搜索so的位置，然后导入环境变量
export LD_LIBRARY_PATH=/usr/:$LD_LIBRARY_PATH

eval提示scaledsoftmax报错

# 若遇到该报错
搜索output文件夹生成的modeling_chatglm.py文件，
self.scale_mask_softmax 设置为false

微调时出现AttributeError或RuntimeError

module 'torch_npu' has no attribute 'npu_rotary_mul' 或

RuntimeError:Error!, The last dimension of input tensor shoule be within the range of [32,2048] and be divisible by32

# 修改modeling_chatglm.py文件:
USE_Flash=False
USE_SCALED_SOFTMAX=False

如果cann不支持flash_attention

报错提示为module 'torch_npu' has no attribute 'npu_flash_attention'或RuntimeError: call aclnnFlashAttentionScore failed，请：

# 修改modeling_chatglm.py文件:
USE_FLASH=False

任务评估过程中出现XXXX does not appear to have a file named tokenization_chatglm.py/ configuration_chatglm.py / modeling_chatglm.py时，将model路径下的tokenization_chatglm.py/ configuration_chatglm.py / modeling_chatglm.py拷贝到XXXX路径下。
如果出现报错：ValueError: FP16 Mixed precision training with AMP or APEX (--fp16) and FP16 half precision evaluation (--fp16_full_eval) can only be used on CUDA devices. 请在transformers/training_args.py文件的1334行将!=改为==。

公网地址说明

代码涉及公网地址参考 public_address_statement.md