模型支持列表

NLP

masked_language_modeling

text_generation

模型
model
模型规格
type
数据集
dataset
评估指标
metric
评估得分
score
配置
config
llama2 llama2_7b
llama2_13b
llama2_7b_lora
llama2_13b_lora
llama2_70b
alpaca PPL / EM / F1 6.58 / 39.6 / 60.5
6.14 / 27.91 / 44.23
-
-
-
configs
glm2 glm2_6b
glm2_6b_lora
ADGEN BLEU-4 / Rouge-1 / Rouge-2 / Rouge-l 7.47 / 30.78 / 7.07 / 24.77
7.23 / 31.06 / 7.18 / 24.23
configs
glm3 glm3_6b ADGEN - - configs
gpt2 gpt2_small
gpt2_13b
wikitext-2 - - configs
baichuan2 baichuan2_7b
baichuan2_13b
baichuan2_7b_lora
baichuan2_13b_lora
belle - - configs
codegeex2 codegeex2_6b CodeAlpaca - - configs
codellama codellama_34b CodeAlpaca - - configs
deepseek deepseek_33b - - - configs
internlm internlm_7b
internlm_20b
alpaca - - configs
mixtral mixtral-8x7b wikitext-2 - - configs
qwen qwen_7b
qwen_14b
qwen_7b_lora
qwen_14b_lora
alpaca C-Eval 63.3
72.13
-
-
configs
qwen1.5 qwen1.5-14b
qwen1.5-72b
- - - configs
wizardcoder wizardcoder_15b CodeAlpaca MBPP Pass@1 50.8 configs
yi yi_6b
yi_34b
alpaca_gpt4_data_zh - - configs

LLM大模型能力支持一览

模型 \ 特性 低参微调 边训边评 Flash Attention 并行推理 流式推理 Chat 多轮对话
Llama2-7B/13B/70B Lora PPL dp/mp
GLM2-6B Lora/P-TuningV2 PPL/Bleu/Rouge dp/mp
GLM3-6B Lora × dp/mp
CodeGeex2-6B × PPL/Bleu/Rouge dp/mp
CodeLlama-34B Lora pass@1 dp/mp ×
GPT2-128m/13B Lora PPL dp/mp × ×
BaiChuan2-7B/13B Lora PPL dp/mp
Qwen-7B/14B × dp/mp
Qwen1.5-14B × × × dp/mp
Qwen1.5-72B Lora × dp/mp
InternLM-7B/20B Lora PPL dp/mp
Wizardcoder-15B × PPL × dp/mp
Deepseek-33B × × × × × ×
Mixtral-8×7B × × × × ×
Yi-6B Lora × dp/mp
Yi-34B × × × dp/mp × ×