{
 "cells": [
  {
   "cell_type": "markdown",
   "metadata": {},
   "source": [
    "# 1 章节介绍\n",
    "\n",
    "本教程《Sana-Video 推理优化实践》以 `Sana-Video` 为例,展示如何在昇腾 NPU 上完成模型跑通、Profiling 分析,并在整网中接入 `RMSNorm` 融合算子验证性能收益。\n",
    "\n",
    "---\n",
    "\n",
    "## 学习目标\n",
    "- 完成 `Sana-Video` Baseline 跑通。\n",
    "- 使用 `torch_npu.profiler` 采集 Baseline 性能数据。\n",
    "- 在 `Sana-Video` 中接入 `torch_npu.npu_rms_norm` 并验证整网收益。\n",
    "- 对比优化前后的整网时延变化。\n",
    "\n",
    "## 章节安排\n",
    "- [1 章节介绍](./01_chapter_intro.ipynb)\n",
    "- [2 Baseline 跑通](./02_baseline_run.ipynb)\n",
    "- [3 Profiling 分析](./03_profiling_analysis.ipynb)\n",
    "- [4 RMSNorm 融合接入与收益验证](./04_rmsnorm_fusion_optimization.ipynb)\n",
    "\n",
    "## 环境说明\n",
    "- 本教程依赖上游 `Sana` 仓库代码,并在运行目录中拉取固定 commit。\n",
    "- 在线体验请直接在 GitCode Notebook 环境中执行;Notebook 默认复用环境中已预装的 `torch`、`torch_npu`、`torchvision` 与 `torchaudio`。本地运行前请先准备兼容版本的上述依赖,并配置 CANN 与 `torch_npu`。\n",
    "- 运行本教程时,主机内存需至少 16GB。Notebook 计算类型建议选择 NPU 910B、CPU 32GB,容器镜像建议选择 ubuntu22-cann8.5-py3.11-jupyter-notebook。\n"
   ]
  }
 ],
 "metadata": {
  "kernelspec": {
   "display_name": "Python 3 (ipykernel)",
   "language": "python",
   "name": "python3"
  },
  "language_info": {
   "codemirror_mode": {
    "name": "ipython",
    "version": 3
   },
   "file_extension": ".py",
   "mimetype": "text/x-python",
   "name": "python",
   "nbconvert_exporter": "python",
   "pygments_lexer": "ipython3",
   "version": "3.10"
  }
 },
 "nbformat": 4,
 "nbformat_minor": 4
}