dd6fe347创建于 4月9日历史提交

文件	最后提交记录	最后更新时间
dataset	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
models	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
python_scripts	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
scripts	!6881 【bugfix】ckpt_epoch保存方式修改 Merge pull request !6881 from 小乔/master	1 年前
tasks	!6881 【bugfix】ckpt_epoch保存方式修改 Merge pull request !6881 from 小乔/master	1 年前
utils	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
.gitignore	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
README.md	文档整改，gitee->gitcode Co-authored-by: Lighters_c<zyh13227@163.com> # message auto-generated for no-merge-commit merge: !7469 merge ffffix into master 文档整改，gitee->gitcode Created-by: addsubmuldiv Commit-by: Lighters_c Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7469	5 个月前
README_en.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
public_address_statement.md	!7376 optimize public_address_statement.md Merge pull request !7376 from 王凯宇/master	8 个月前
requirements.no_torch.txt	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
requirements.torch.txt	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
requirements.txt	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前

PLLaVA for PyTorch

简介

模型介绍

PLLaVA是一种新颖的端到端训练的大型多模态模型，它结合了视觉编码器和Vicuna，用于通用的视觉和语言理解，实现了令人印象深刻的聊天能力，在科学问答（Science QA）上达到了新的高度。

支持任务列表

本仓已经支持以下模型任务类型：

模型	任务列表	是否支持
LLaVA 1.6 7B	训练	✔
LLaVA 1.6 7B	推理	✔

代码实现

参考实现：

url=https://github.com/magic-research/PLLaVA
commit_id=6f49fd2

适配昇腾AI处理器的实现：

url=https://gitcode.com/ascend/ModelZoo-PyTorch.git
code_path=PyTorch/built-in/mm/PLLaVA

准备训练环境

安装模型环境

下载代码：

git clone https://gitcode.com/ascend/ModelZoo-PyTorch.git
cd PyTorch/built-in/mm/PLLaVA

创建Python环境并且安装Python三方包：

conda create -n pllava python=3.10 -y
conda activate pllava
pip install --upgrade pip  # enable PEP 660 support
pip3 install torch==2.1.0+cpu  --index-url https://download.pytorch.org/whl/cpu  #For X86
pip3 install torch==2.1.0  #For Aarch64
pip install -r requirements.txt

安装昇腾环境

请参考昇腾社区中《Pytorch框架训练环境准备》文档搭建昇腾环境，本仓已支持表4中软件版本。

表 4 昇腾软件版本支持表

软件类型	支持版本
FrameworkPTAdapter	在研版本
CANN	在研版本
昇腾NPU固件	在研版本
昇腾NPU驱动	在研版本

准备数据集

json文件下载路径参考： (https://huggingface.co/datasets/OpenGVLab/VideoChat2-IT)。
视频文件下载参考：（https://github.com/magic-research/PLLaVA/blob/main/README.md 中的数据准备章节）。

数据集结构如下所示：

 dataset/VideoChat2-IT/video/reasoning/clever_qa
   ├── train.json
 
 dataset/video_all
   ├── xxx.mp4

在训练脚本中（train_pllava_single_npu.sh（单卡）、 train_pllava_multi_npu.sh（单机多卡）、train_pllava_npu_multi_node.sh（多机多卡））通过指定train_corpus的value，在 tasks/train/instruction_data.py中获取具体的json路径和视频路径。

获取预训练模型

联网情况下，预训练模型会自动下载。
无网络时，用户可访问huggingface官网自行下载，文件namespace如下：参考 https://github.com/magic-research/PLLaVA/blob/main/README.md 中的模型下载准备章节。在训练脚本中，需要指定模型存储的绝对路径。

快速开始

模型训练

训练脚本位置位于scripts目录，提供了train_pllava_single_npu.sh（单卡）、 train_pllava_multi_npu.sh（单机多卡）、train_pllava_npu_multi_node.sh（多机多卡）三个脚本。需要根据真实值配置cann的set_env.sh路径、数据集路径、权重的路径。
运行训练脚本，下面以单机单卡示例：
```
bash scripts/train_pllava_single_npu.sh 
```
训练完成后，权重文件保存在参数--output_dir路径下。

结果展示

表 2 训练结果展示：

芯片	卡数	second per step	batch_size	AMP_Type	Torch_Version
竞品A	8p	0.9352s	1	bf16	2.1
Atlas 200T A2 Box16	8p	0.8411s	1	bf16	2.1
竞品A	8p	1.0760s	1	fp32	2.1
Atlas 200T A2 Box16	8p	0.9347s	1	fp32	2.1

模型推理

训练脚本位置位于scripts目录下，提供了eval_single.sh脚本，其中的cann的set_env.sh路径、视频文件路径、模型文件路径、权重文件路径等，按照实际填写。

bash scripts/eval_single.sh

脚本执行中，会让用户输入问题，再根据问题返回答案。

公网地址说明

代码涉及公网地址参考 public_address_statement.md

变更说明

2024.08.09: 首次发布。

FAQ

无