triton-ascend/python/test · wxlong_ustc/triton-ascend - AtomGit

文件	最后提交记录	最后更新时间
backend	Remove `test_xpu_backend.py` (#4616) This test case is never run, so removing it. Signed-off-by: Whitney Tsang <whitney.tsang@intel.com>	1 年前
kernel_comparison	[CI] Update kernels names (#2093) Co-authored-by: Philippe Tillet <phil@openai.com>	2 年前
regression	[TEST] Reenable mixed precision dot tests (#4965) And remove the outdated performance tests. We can also add various float8 types and move `scaled_dot` tests here.	1 年前
unit	feat(runtime/autotune): add AsyncCompileMode to support parallel compilation in autotuning Co-authored-by: Xuan Peng<pengxuan9@huawei.com> # message auto-generated for no-merge-commit merge: !1268 merge feat/async-compile-0210 into main feat(runtime/autotune): add AsyncCompileMode to support parallel compilation in autotuning Created-by: HinPeng Commit-by: Xuan Peng Merged-by: ascend-robot Description: ## PR description 1. Introduce async compile mode from triton v3.5.1 (with little modification to be compatible with current branch and torch-2.7.1) 2. Refactor autotuner to compile triton kernel in parallel ## Notice 1. Introduce `MLIR_DISABLE_MULTITHREADING` environment variable ahead from triton v3.5.1 2. Add TRITON_AUTOTUNE_PARALLEL_COMPILE to control whether compiling kernels in parallel in autotuner, default to '1' See merge request: Ascend/triton-ascend!1268	3 个月前