加速库ReshapeAndCacheOperation C++ Demo

介绍

该目录下为加速库ReshapeAndCacheOperation C++调用示例。

示例中生成的数据不代表实际场景，如需数据生成参考请查看根目录下的python用例目录： tests/apitest/opstest/python/operations/reshape_and_cache/

本op在Atlas A2/A3系列和Atlas 推理系列产品上实现有所区别

提供demo分别对应不同产品的基础场景，编译运行时需要对应更改build脚本：

Atlas A2/A3：

参数设置：

成员名称	取值
compressType	COMPRESS_TYPE_UNDEFINED
kvCacheCfg	K_CACHE_V_CACHE

以下demo仅支持在Atlas A2/A3系列上运行。

reshape_and_cache_demo.cpp

tensor名字	数据类型	数据格式	维度信息
`key`	float16	nd	[2, 32, 128]
`value`	float16	nd	[2, 32, 128]
`keyCache`	float16	nd	[512, 128, 32, 128]
`valueCache`	float16	nd	[512, 128, 32, 128]
`slotMapping`	int32	nd	[2]
`keyCacheOut`	float16	nd	[512, 128, 32, 128]
`valueCacheOut`	float16	nd	[512, 128, 32, 128]

reshape_and_cache_demo_ds1.cpp

tensor名字	数据类型	数据格式	维度信息
`key`	bf16	nd	[5, 1, 128]
`value`	bf16	nd	[5, 1, 128]
`keyCache`	bf16	nd	[9, 128, 1, 128]
`valueCache`	bf16	nd	[9, 128, 1, 128]
`slotMapping`	int32	nd	[5]
`keyCacheOut`	bf16	nd	[9, 128, 1, 128]
`valueCacheOut`	bf16	nd	[9, 128, 1, 128]

reshape_and_cache_demo_ds2.cpp

tensor名字	数据类型	数据格式	维度信息
`key`	bf16	nd	[1024, 1, 128]
`value`	bf16	nd	[1024, 1, 128]
`keyCache`	bf16	nd	[9, 128, 1, 128]
`valueCache`	bf16	nd	[9, 128, 1, 128]
`slotMapping`	int32	nd	[1024]
`keyCacheOut`	bf16	nd	[9, 128, 1, 128]
`valueCacheOut`	bf16	nd	[9, 128, 1, 128]

reshape_and_cache_demo_ds3.cpp

tensor名字	数据类型	数据格式	维度信息
`key`	bf16	nd	[1, 1, 128]
`value`	bf16	nd	[1, 1, 128]
`keyCache`	bf16	nd	[9, 128, 1, 128]
`valueCache`	bf16	nd	[9, 128, 1, 128]
`slotMapping`	int32	nd	[1]
`keyCacheOut`	bf16	nd	[9, 128, 1, 128]
`valueCacheOut`	bf16	nd	[9, 128, 1, 128]

Atlas推理系列产品： reshape_and_cache_inference_demo.cpp

相较于A2/A3的demo，本示例主要有以下修改点：

参数设置：

成员名称	取值
compressType	COMPRESS_TYPE_UNDEFINED
kvCacheCfg	K_CACHE_V_CACHE

tensor名字	数据类型	数据格式	维度信息
`key`	bf16	nd	[3, 4, 128]
`value`	bf16	nd	[3, 4, 128]
`keyCache`	bf16	nd	[512, 32, 128, 16]
`valueCache`	bf16	nd	[512, 32, 128, 16]
`slotMapping`	int32	nd	[3]
`keyCacheOut`	bf16	nd	[512, 32, 128, 16]
`valueCacheOut`	bf16	nd	[512, 32, 128, 16]

更改编译脚本为： g++ -D_GLIBCXX_USE_CXX11_ABI=$cxx_abi -I "${ATB_HOME_PATH}/include" -I "${ASCEND_HOME_PATH}/include" -L "${ATB_HOME_PATH}/lib" -L "${ASCEND_HOME_PATH}/lib64" reshape_and_cache_inference_demo.cpp demo_util.h -l atb -l ascendcl -o reshape_and_cache_inference_demo
运行时调用： ./reshape_and_cache_inference_demo
该demo仅支持在Atlas 推理系列产品上运行