dd6fe347创建于 4月9日历史提交

文件	最后提交记录	最后更新时间
IndexKits	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
app	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
comfyui-hydit	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
controlnet	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
dataset	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
diffusers	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
hydit	!7365 【特性】Performance statistics of the DeepSpeed framework for HunyuanDiT Merge pull request !7365 from liushengyuan/master	8 个月前
lite	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
lora	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
mllm	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
test	!7365 【特性】Performance statistics of the DeepSpeed framework for HunyuanDiT Merge pull request !7365 from liushengyuan/master	8 个月前
trt	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
utils	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
LICENSE.txt	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
Notice	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
README.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
README_RAW.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
environment.yml	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
example_prompts.txt	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
public_address_statement.md	fix link validity Co-authored-by: frozenleaves<914814442@qq.com> # message auto-generated for no-merge-commit merge: !7517 merge master into master fix link validity Created-by: frozenn Commit-by: frozenleaves Merged-by: ascend-robot Description: ## Motivation Please describe the motivation of this PR and the goal you want to achieve through this PR. ## Modification Please briefly describe what modification is made in this PR. ## Self-test (Optional) If modifications to this PR may cause/fix function/accuracy/performance DTSs/issues, a self-inspection record needs to be attached. ## BC-breaking (Optional) If there are compatibility issues, such as dependencies on cann/torch_npu versions, they need to be explained in the PR. ## Checklist Before PR: - [ ] The new code needs to comply with the Clean Code specification. - [ ] The PR content is self-checked, and the expression can be clear and the writing standardized After PR: - [ ] CLA has been signed and all committers have signed the CLA in this PR. - [ ] The ci-pipeline is passed, Code Check is passed. See merge request: Ascend/ModelZoo-PyTorch!7517	1 个月前
requirements.txt	!6895 【资料】Torchvision版本更新 Merge pull request !6895 from J石页/master	1 年前
sample_controlnet.py	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前
sample_t2i.py	!6719 [built-in][Pytorch] 调整多模态模型存放目录 Merge pull request !6719 from zhangjunyi08/master	1 年前

HunyuanDiT for PyTorch

简介

模型介绍

HunyuanDiT是由腾讯开发并开源的一款先进的文生图（文本到图像）模型。该模型支持中英文双语输入，特别针对中文进行了优化，能够深刻理解中文语境和文化元素，生成高质量且富有中国文化特色的图像。HunyuanDiT经过大规模中文数据集的训练，涵盖了广泛的类别和艺术风格，能够根据文本提示生成细腻逼真的图像。本仓库主要将HunyuanDiT模型的任务迁移到了昇腾NPU上，并进行极致性能优化。

支持任务列表

本仓已经支持以下模型任务类型

模型	任务列表	是否支持
DiT-g/2	在线训练	✔
DiT-g/2	在线推理	✔

代码实现

参考实现：

url=https://github.com/Tencent/HunyuanDiT
commit_id=3bb80e1dedba5bf9728e7c9566c4b5c665bbfbd2

适配昇腾 AI 处理器的实现：

url=https://gitcode.com/ascend/ModelZoo-PyTorch.git
code_path=PyTorch/built-in/mlm/HunyuanDiT

HunyuanDiT（在研版本）

准备训练环境

安装模型环境

表 3 三方库版本支持表

三方库	支持版本
PyTorch	2.1.0
TorchVision	0.16.0
deepspeed	0.14.4
diffusers	0.21.2
transformers	4.39.1
accelerate	0.27.2

在模型根目录下执行以下命令，安装模型对应PyTorch版本需要的依赖。

source ${cann_install_path}/ascend-toolkit/set_env.sh              # 激活cann环境
cd HunyuanDiT
pip install -r requirements.txt                                    #安装其它依赖

安装昇腾环境

请参考昇腾社区中《Pytorch框架训练环境准备》文档搭建昇腾环境，本仓已支持表4中软件版本。

表 4 昇腾软件版本支持表

软件类型	支持版本
FrameworkPTAdapter	在研版本
CANN	在研版本
昇腾NPU固件	在研版本
昇腾NPU驱动	在研版本

准备数据集

训练数据集准备

数据集准备请参考官网，链接如下： https://github.com/Tencent/HunyuanDiT

获取预训练模型

联网情况下，预训练模型会自动下载。
无网络时，用户可访问huggingface官网自行下载(https://huggingface.co/Tencent-Hunyuan/HunyuanDiT/tree/main)

将下载好的t5模型放在本工程目录下的ckpts目录下，组织结构如下：

$HunyuanDiT
├── ckpts
├── ├── t2i
├── ├── ├── clip_text_encoder
├── ├── ├── model
├── ├── ├── mt5
├── ├── ├── sdxl-vae-fp16-fix
├── ├── ├── tokenizer
└── ...

快速开始

训练任务

本任务主要以全参微调为主，展示训练任务，包含单机单卡和单机多卡的训练。

开始训练

进入解压后的源码包根目录。
```
cd /${模型文件夹名称} 
```

准备训练数据。按照官网流程，准备对应数据集，放在模型文件夹下，如图：

dataset
 ├──porcelain
 │  ├──images/  (image files)
 │  │  ├──0.png
 │  │  ├──1.png
 │  │  ├──......
 │  ├──csvfile/  (csv files containing text-image pairs)
 │  │  ├──image_text.csv
 │  ├──arrows/  (arrow files containing all necessary training data)
 │  │  ├──00000.arrow
 │  │  ├──00001.arrow
 │  │  ├──......
 │  ├──jsons/  (final training data index files which read data from arrow files during training)
 │  │  ├──porcelain.json
 │  │  ├──porcelain_mt.json

运行训练脚本。

用户可以按照自己训练需要进行参数配置，以下给出多卡的一种训练示例。

【如需长步数训练】需修改epochs默认步数1400步为所需步数 (Total Optimzation Steps为Epochs * len(Data Loader) // Gradient Accumulation Steps)

vim test/train_full_8p_bf16.sh

# 修改epochs为所需步数
--reso-step 64 \
--epochs 1400 \ 
--max-training-steps ${max_train_steps} \

vim test/train_full_8p_bf16.sh

# 修改epochs为所需步数
--reso-step 64 \
--epochs 1400 \ 
--max-training-steps ${max_train_steps} \

【zero3】配置可在/hydit/modules/models.py文件里修改层数等，如修改depth层为90

运行zero2配置脚本

bash test/train_full_8p_bf16.sh
# 混合精度BF16，8卡训练

运行zero3配置脚本

bash test/train_full_8p_bf16_zero3.sh
# 混合精度BF16，8卡训练

性能展示

性能 (zero2)

芯片	卡数	单步迭代耗时（ms/step）	batch_size	AMP_dtype
GPU	8p	1059.3	1	BF16
Atlas A2	8p	1011.7	1	BF16

推理任务

本任务主要以全参微调为主，展示推理任务，包括单卡在线推理。

开始推理

进入解压后的源码包根目录。
```
cd /${模型文件夹名称} 
```
运行推理的脚本（待补充）。

单机单卡推理

bash test/inference_full_1p_fp16.sh  # 混精fp16 在线推理

推理脚本参数说明如下

test/inference_full_1p_fp16.sh
--prompt                         //测试用的prompt

公网地址说明

代码涉及公网地址参考 public_address_statement.md

变更说明

变更

2024.08.22：HunyuanDiT bf16训练和fp16推理任务首次发布。