msmodelslim/lab_practice/default/default-w8a8.yaml-代码预览-MindStudio-ModelSlim:基于昇腾生态的模型压缩工具项目 - AtomGit

ascend-robot【msmodelslim】【feature】支持默认结构的离群值抑制

apiversion: modelslim_v1
metadata:
  config_id: default-w8a8
  score: 50
  verified_model_types: [ ]
  label:
    w_bit: 8
    a_bit: 8
    is_sparse: False
    kv_cache: False
spec:
  process:
    - type: "iter_smooth"
      include:
        - "*"
    - type: "linear_quant"
      qconfig:
        act:
          scope: "per_token"
          dtype: "int8"
          symmetric: True
          method: "minmax"
        weight:
          scope: "per_channel"
          dtype: "int8"
          symmetric: True
          method: "minmax"
      include: [ "*" ]
  runner: "model_wise"
  save:
    - type: "ascendv1_saver"
      part_file_size: 4