dd6fe347创建于 4月9日历史提交

文件	最后提交记录	最后更新时间
assets	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
configs	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
docs	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
eval	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
gradio	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
opensora	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
scripts	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
test	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
tests	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
tools	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
.isort.cfg	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
LICENSE	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
README.md	文档整改，gitee->gitcode Co-authored-by: Lighters_c<zyh13227@163.com> # message auto-generated for no-merge-commit merge: !7469 merge ffffix into master 文档整改，gitee->gitcode Created-by: addsubmuldiv Commit-by: Lighters_c Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7469	5 个月前
public_address_statement.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
requirements.txt	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
requirements_npu.txt	!6864 【bugfix】huggingface hub版本修改 Merge pull request !6864 from J石页/master	1 年前
setup.py	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前

OpenSora1.1 for PyTorch

注意：本仓库OpenSora-1.1模型将不再进行维护，请使用MindSpeed-MM

简介

模型介绍

OpenSora是HPC AI Tech开发的开源高效复现类Sora视频生成方案。OpenSora不仅实现了先进视频生成技术的低成本普及，还提供了一个精简且用户友好的方案，简化了视频制作的复杂性。本仓库主要将OpenSora1.1的STDiT2模型的任务迁移到了昇腾NPU上，并进行极致性能优化。

支持任务列表

本仓已经支持以下模型任务类型

模型	任务列表	是否支持
STDiT2-XL/2	在线训练	✔
STDiT2-XL/2	在线推理	✔

代码实现

参考实现：

url=https://github.com/hpcaitech/Open-Sora
commit_id=74b645350b0f7a0ed802f87243c23edd1504c26d

适配昇腾 AI 处理器的实现：

url=https://gitcode.com/ascend/ModelZoo-PyTorch.git
code_path=PyTorch/built-in/mm/

STDiT2（在研版本）

准备训练环境

安装模型环境

表 3 三方库版本支持表

三方库	支持版本
PyTorch	2.1.0
TorchVision	0.16.0

在模型根目录下执行以下命令，安装模型对应PyTorch版本需要的依赖。

source ${cann_install_path}/ascend-toolkit/set_env.sh              # 激活cann环境
cd OpenSora1.1
pip install -v -e .                                                # 安装本地代码仓，同时自动安装依赖

安装mindspeed：

git clone https://gitcode.com/ascend/MindSpeed.git
pip install -e MindSpeed

获取 Megatron-LM 并指定 commit id:

git clone https://github.com/NVIDIA/Megatron-LM.git
cd Megatron-LM
git checkout core_r0.6.0

安装昇腾环境

请参考昇腾社区中《Pytorch框架训练环境准备》文档搭建昇腾环境，本仓已支持表4中软件版本。

表 4 昇腾软件版本支持表

软件类型	支持版本
FrameworkPTAdapter	在研版本
CANN	在研版本
昇腾NPU固件	在研版本
昇腾NPU驱动	在研版本

准备数据集

训练数据集准备

数据集准备请参考官网，链接如下： https://github.com/hpcaitech/Open-Sora?tab=readme-ov-file#data-processing

获取预训练模型

联网情况下，预训练模型会自动下载。

无网络时，用户可访问huggingface官网自行下载，文件namespace如下：

PixArt-alpha/PixArt-alpha   # PixArt-XL-2-512x512模型(训练用)
stabilityai/sd-vae-ft-ema   # vae模型
DeepFloyd/t5-v1_1-xxl       # t5模型
hpcai-tech/OpenSora-STDiT-v2-stage2        # 预训练权重(推理用)
hpcai-tech/OpenSora-STDiT-v2-stage3        # 预训练权重(推理用)

说明：
在线推理时，对hpcai-tech/OpenSora-STDiT-v2-stage3和hpcai-tech/OpenSora-STDiT-v2-stage3模型需做一些离线转换，转换成.pth格式。提供参考用例：
import os
import torch
import safetensors
data = safetensors.torch.load_file('./hpcai-tech/OpenSora-STDiT-v2-stage2/model.safetensors')
data["state_dict"] = data
torch.save(data, os.path.splitext('./hpcai-tech/OpenSora-STDiT-v2-stage2/model.safetensors')[0]+'.pth')

获取对应的预训练模型后，在以下配置文件中将model、vae的from_pretrained参数设置为本地预训练模型绝对路径。

configs/opensora-v1-1/inference/sample.py
configs/opensora-v1-1/train/stage1.py
configs/opensora-v1-1/train/stage2.py
configs/opensora-v1-1/train/stage3.py

将下载好的t5模型放在本工程目录下的DeepFloyd目录下，组织结构如下：

$OpenSora1.1
├── DeepFloyd
├── ├── t5-v1_1-xxl
├── ├── ├── config.json
├── ├── ├── pytorch_model-00001-of-00002.bin
├── ├── ├── ...
└── ...

快速开始

训练任务

本任务主要以预训练模型为主，展示训练任务，包含单机单卡和单机多卡的训练。

开始训练

进入解压后的源码包根目录。
```
cd /${模型文件夹名称} 
```
准备训练数据。按照官网流程，准备对应数据集，处理数据并得到包含数据信息的csv文件，放在模型文件夹下，如图：
```
$OpenSora1.1
├── train_data.csv
└── ...
```

运行训练脚本。

用户可以按照自己训练需要进行参数配置，以下给出单卡和多卡的一种训练示例。

bash test/train_full_1p_opensorav1_1.sh --data_path=train_data.csv
# 混合精度BF16，单卡训练，stage1

bash test/train_full_8p_opensorav1_1.sh --data_path=train_data.csv
# 混合精度BF16，八卡训练，stage1

对于本模型，可以采用绑核优化，以绑核方式启动。绑核方法参考：https://gitcode.com/ascend/att/tree/master/profiler/affinity_cpu_bind 本模型使用示例如下：

python3 bind_core.py \
-app="bash test/train_full_18p_opensorav1_1.sh --data_path=train_data.csv"

推理任务

本任务主要以预训练模型为主，展示推理任务，包括单卡在线推理。

开始推理

进入解压后的源码包根目录。
```
cd /${模型文件夹名称} 
```
运行推理的脚本。

单机单卡推理

bash test/infer_full_1p_opensorav1_1.sh --ckpt_path=/path/to/OpenSora-STDiT-v2-stage3/model.pth  # 混精bf16 在线推理

推理脚本参数说明如下

test/infer_full_1p_opensorav1_1.sh
--batch_size                         //设置batch_size
--ckpt_path                          //推理加载的模型地址
--prompt                             //测试用的prompt
--num_frames                         //生成视频的总帧数
--img_h                              //生成视频的宽
--img_w                              //生成视频的高
 
scripts/inference.py
config                               //配置文件路径
--seed                               //随机种子
--ckpt-path                          //推理加载的模型文件路径    
--batch-size                         //设置batch_size
--prompt-path                        //推理使用的prompt文件路径
--prompt                             //测试用的prompt
--num-frames                         //生成视频的总帧数
--image-size                         //生成视频的分辨率
--fps                                //生成视频的帧率
--save-dir                           //输出视频的路径
--num-sampling-steps                 //推理的采样步数
--cfg-scale                          //无分类器引导的权重系数

公网地址说明

代码涉及公网地址参考 public_address_statement.md

变更说明

变更

2024.04.29：OpenSora1.1 STDiT2 bf16训练和推理任务首次发布。