0

0

LLin-Bertfix docs.

2f568c76创建于 2023年11月8日历史提交

模型支持列表

NLP

masked_language_modeling

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
bert	bert_base_uncased	wiki	-	-	configs

text_classification

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
txtcls_bert	txtcls_bert_base_uncased txtcls_bert_base_uncased_mnli	Mnli Mnli	Entity F1 Entity F1	- 84.80%	configs

token_classification

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
tokcls_bert	tokcls_bert_base_chinese tokcls_bert_base_chinese_cluener	CLUENER CLUENER	Entity F1 Entity F1	- 0.7905	configs

question_answering

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
qa_bert	qa_bert_base_uncased qa_bert_base_chinese_uncased	SQuAD v1.1 SQuAD v1.1	EM / F1 EM / F1	80.74 / 88.33 -	configs

translation

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
t5	t5_small	WMT16	-	-	configs

text_generation

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
baichuan2	baichuan2_7b baichuan2_13b baichuan2_7b_lora baichuan2_13b_lora	- belle -	- -	- -	configs
llama2	llama2_7b llama2_13b llama2_7b_lora llama2_13b_lora llama2_70b	alpaca alpaca alpaca -	- - - -	- - - -	configs
glm2	glm2_6b glm2_6b_lora	ADGEN ADGEN	BLEU-4 / Rouge-1 / Rouge-2 / Rouge-l -	7.47 / 30.78 / 7.07 / 24.77 7.23 / 31.06 / 7.18 / 24.23	configs
llama	llama_7b llama_13b llama_7b_lora	alpaca alpaca alpaca -	- - - -	- - - -	configs
internlm	Internlm_7b	wikitext-2 alpaca	- -	- -	configs
ziya	ziya_13b	- -	- -	- -	configs
baichuan	baichuan_7b baichuan_13b	- -	- -	- -	configs
bloom	bloom_560m bloom_7.1b	alpaca alpaca alpaca alpaca	- - - -	- - - -	configs
glm	glm_6b glm_6b_lora	ADGEN ADGEN	BLEU-4 / Rouge-1 / Rouge-2 / Rouge-l -	8.42 / 31.75 / 7.98 / 25.28 -	configs
gpt2	gpt2_small gpt2_13b	wikitext-2 wikitext-2 wikitext-2	- - -	- - -	configs
pangualpha	pangualpha_2_6_b pangualpha_13b	悟道数据集悟道数据集	- -	- -	configs

CV

masked_image_modeling

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
mae	mae_vit_base_p16	ImageNet-1k	-	-	configs

image_classification

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
vit	vit_base_p16	ImageNet-1k	Accuracy	83.71%	configs
swin	swin_base_p4w7	ImageNet-1k	Accuracy	83.44%	configs

zero_shot_image_classification (by contrastive_language_image_pretrain)

模型 model	模型规格 type	数据集 dataset	评估指标 metric	评估得分 score	配置 config
clip	clip_vit_b_32 clip_vit_b_16 clip_vit_l_14 clip_vit_l_14@336	Cifar100 Cifar100 Cifar100 Cifar100	Accuracy Accuracy Accuracy Accuracy	57.24% 61.41% 69.67% 68.19%	configs
blip2	blip2_vit_g	- flickr30k -	- ITM -	- - -	configs

LLM大模型能力支持一览

模型 \ 特性	低参微调	边训边评	并行推理	流式推理	Chat	多轮对话	Lite推理
BaiChuan2-7B/13B	Lora	PPL	dp/mp	√	√	√	√
Llama2-7B/13B/70B	Lora	PPL	dp/mp	√	√	√	√
GLM2-6B	Lora/P-TuningV2	Bleu/Rouge	dp/mp	√	√	√	√
BILP2	×	×	dp/mp	√	×	×	×
BaiChuan-7B/13B	×	PPL	dp/mp	√	√	√	√
Llama-7B/13B	Lora	PPL	dp/mp	√	×	×	√
InternLM-7B	Lora	PPL	dp/mp	√	√	√	×
ZiYa-13B	×	PPL	dp/mp	√	×	×	×
Bloom-560m/7.1B	×	PPL	dp/mp	√	√	√	√
GLM-6B	Lora	Bleu/Rouge	dp/mp	√	√	√	√
GPT2-128m/13B	Lora	PPL	dp/mp	√	×	×	√
PanGuAlpha-2.6B/13B	×	PPL	dp/mp	√	×	×	×