文件最后提交记录最后更新时间
Nvidia CI with `torch 2.11` (#45243) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>1 个月前
fix: hf-doc-builder insallation was failing (#45225) * the dev extra now installs hf-doc-builder as well * use main1 个月前
🚨🚨🚨 Fully remove Tensorflow and Jax support library-wide (#40760) * setup * start the purge * continue the purge * more and more * more * continue the quest: remove loading tf/jax checkpoints * style * fix configs * oups forgot conflict * continue * still grinding * always more * in tje zone * never stop * should fix doc * fic * fix * fix * fix tests * still tests * fix non-deterministic * style * remove last rebase issues * onnx configs * still on the grind * always more references * nearly the end * could it really be the end? * small fix * add converters back * post rebase * latest qwen * add back all converters * explicitly add functions in converters * re-add8 个月前
Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel (#41567) * fully deprecate autogptq * remove use_cuda and use_exllama toggles are fully deprecated in gptqmodel * format * add `act_group_aware` property * fix QUANT_TYPE assert Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * format * mod awq import * remove autoawq fuse support * remove remove autoawq.config fuse * cleanup * remove awq fuse test * fix import * use gptqmodel * cleanup * remove get_modules_to_fuse * mod require_auto_awq -> require_gptqmodel * convert vertion to checkpoint_format * check is_gptqmodel_available * revert modules_to_not_convert * pass bits, sym, desc_act * fix awqconfig init * fix wrong args * fix ipex * mod ipex version check * cleanup * fix awq_linear * remove self.exllama_config = exllama_config * cleanuo * Revert "cleanuo" This reverts commit 90019c6fc4f7a617ed9db482a42ecd1cd07f9108. * update is_trainable * cleanup * remove fused * call hf_select_quant_linear_v2() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Remove the "version" field from AwqConfig Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Add torch_fused inferencefix test_gptq test Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix AwqConfig Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * call hf_select_quant_linear_v2() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * remove auto_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix typo Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Compatible with legacy field: checkpoint_format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Compatible with legacy field: checkpoint_format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * CLEANUP Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * update test_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix get_modules_to_not_convert() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq.py::AwqTest::test_quantized_model_exllama Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Apply style fixes * test_awq.py added EXPECTED_OUTPUT Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * update test_gptq.py Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq.py::AwqTest::test_save_pretrained Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * use assertEqual() instead of assertTrue() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_quantized_layers_class() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * remove ExllamaV1 Test Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix get_modules_to_not_convert() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * added EXPECTED_OUTPUT Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * remove ExllamaV1 Test Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * add AwqBackend.AUTO_TRAINABLE Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Update docs/source/zh/llm_tutorial.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * revert temporarily fix Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> --------- Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: LRL2-ModelCloud <lrl2@modelcloud.ai> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: ZX-ModelCloud <165115237+ZX-ModelCloud@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>5 个月前
[CI] AMD docker: bump to ROCm 7.2.2 / PyTorch 2.10 + prebuilt FA wheel (#45913) * new image & FA whls * update some expectations * fix more expectations * fix style15 天前
fix `deeepspeed` in AMD docker file (#42025) fix deeepspeed in AMD docker Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>6 个月前
Fix `torch+deepspeed` docker file (#41985) * fix * delete --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>6 个月前
unpin `torchcodec==0.5.0` and use `torch 2.8` on daily CI (#40072) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>9 个月前
Fix docker files (#43946) fix Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>3 个月前
Rename master to main for notebooks links and leftovers (#16397) 4 年前
Fully deprecate AutoGPTQ and AutoAWQ for GPT-QModel (#41567) * fully deprecate autogptq * remove use_cuda and use_exllama toggles are fully deprecated in gptqmodel * format * add `act_group_aware` property * fix QUANT_TYPE assert Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * format * mod awq import * remove autoawq fuse support * remove remove autoawq.config fuse * cleanup * remove awq fuse test * fix import * use gptqmodel * cleanup * remove get_modules_to_fuse * mod require_auto_awq -> require_gptqmodel * convert vertion to checkpoint_format * check is_gptqmodel_available * revert modules_to_not_convert * pass bits, sym, desc_act * fix awqconfig init * fix wrong args * fix ipex * mod ipex version check * cleanup * fix awq_linear * remove self.exllama_config = exllama_config * cleanuo * Revert "cleanuo" This reverts commit 90019c6fc4f7a617ed9db482a42ecd1cd07f9108. * update is_trainable * cleanup * remove fused * call hf_select_quant_linear_v2() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Remove the "version" field from AwqConfig Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Add torch_fused inferencefix test_gptq test Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix AwqConfig Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * call hf_select_quant_linear_v2() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * remove auto_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix typo Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Compatible with legacy field: checkpoint_format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Compatible with legacy field: checkpoint_format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * CLEANUP Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * update test_awq Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix get_modules_to_not_convert() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq.py::AwqTest::test_quantized_model_exllama Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Apply style fixes * test_awq.py added EXPECTED_OUTPUT Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * update test_gptq.py Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_awq.py::AwqTest::test_save_pretrained Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * use assertEqual() instead of assertTrue() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix test_quantized_layers_class() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * remove ExllamaV1 Test Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * format Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * fix get_modules_to_not_convert() Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * added EXPECTED_OUTPUT Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * remove ExllamaV1 Test Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * add AwqBackend.AUTO_TRAINABLE Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> * Update docs/source/zh/llm_tutorial.md Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com> * revert temporarily fix Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> --------- Signed-off-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: ZX-ModelCloud <zx@modelcloud.ai> Co-authored-by: LRL2-ModelCloud <lrl2@modelcloud.ai> Co-authored-by: github-actions[bot] <github-actions[bot]@users.noreply.github.com> Co-authored-by: ZX-ModelCloud <165115237+ZX-ModelCloud@users.noreply.github.com> Co-authored-by: Marc Sun <57196510+SunMarc@users.noreply.github.com> Co-authored-by: Mohamed Mekkouri <93391238+MekkCyber@users.noreply.github.com>5 个月前
Bump torchao >=0.15 and fix quantization CI (#44604) * Fix * update expected * fix fp_quant * fix metal * fix mxfp4 * fix * fix bnb * style * bump torchao to 0.15.0 min to have safetensor support by default * style * fix torchao tests * Fix ! * style * update docs * update dockerfile * build quant docker * maybe this ? * uncomment2 个月前
Updated docker files to use `uv` for installing packages (#36957) * Updated docker files to use uv pip install as uv is blazingly fast. * Removed -y flag for uv pip uninstall. * Passed --no-build-isolation flag --------- Co-authored-by: Yih-Dar <2521628+ydshieh@users.noreply.github.com>1 年前
CircleCI with torch 2.11 (#45633) * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>1 个月前
CircleCI with torch 2.11 (#45633) * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>1 个月前
CircleCI with torch 2.11 (#45633) * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>1 个月前
CircleCI with torch 2.11 (#45633) * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>1 个月前
CircleCI with torch 2.11 (#45633) * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>1 个月前
No serving in quality docker image (#45677) no serving Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>29 天前
CircleCI with torch 2.11 (#45633) * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 * circleci with torch 2.11 --------- Co-authored-by: ydshieh <ydshieh@users.noreply.github.com>1 个月前
README.md

Dockers for transformers

In this folder you will find various docker files, and some subfolders.

  • dockerfiles (ex: consistency.dockerfile) present under ~/docker are used for our "fast" CIs. You should be able to use them for tasks that only need CPU. For example torch-light is a very light weights container (703MiB).
  • subfolders contain dockerfiles used for our slow CIs, which can be used for GPU tasks, but they are BIG as they were not specifically designed for a single model / single task. Thus the ~/docker/transformers-pytorch-gpu includes additional dependencies to allow us to run ALL model tests (say librosa or tesseract, which you do not need to run LLMs)

Note that in both case, you need to run uv pip install -e ., which should take around 5 seconds. We do it outside the dockerfile for the need of our CI: we checkout a new branch each time, and the transformers code is thus updated.

We are open to contribution, and invite the community to create dockerfiles with potential arguments that properly choose extras depending on the model's dependencies! 🤗