文件最后提交记录最后更新时间
add bazel build2 个月前
embeddedclient mmap3 个月前
Hi all, this is yuanrong datasystem7 个月前
Hi all, this is yuanrong datasystem7 个月前
Hi all, this is yuanrong datasystem7 个月前
Hi all, this is yuanrong datasystem7 个月前
feat:Support three steps pipeline rh2d Three steps pipeline: 1. worker2 -> worker1: urma write 2. worker1 -> kvclient: share memory queue 3. kvclient -> gpu: cuda The main idea diff from two step pipeline lays sending chunk info from worker1 to kvclient by share memory queue. Changes: 1. enlarge chunk maxsize from 16m to 4G 2. pin cuda memory in connecting stage 3. support timeout cancelling pipeline in kvclient and worker 4. make client request id atomic. 5. add vlog for pipeline debuging 6. auto find os-transport lib without installing rpm 7. support gpu id and gpu uuid as connection args 8. rename H2DParam to PiplnRh2dParam 9. worker don't need cuda lib 10. clean up worker rpc channel when worker lost Signed-off-by: mszjaas <mszjaas@qq.com>7 天前
feat:Support three steps pipeline rh2d Three steps pipeline: 1. worker2 -> worker1: urma write 2. worker1 -> kvclient: share memory queue 3. kvclient -> gpu: cuda The main idea diff from two step pipeline lays sending chunk info from worker1 to kvclient by share memory queue. Changes: 1. enlarge chunk maxsize from 16m to 4G 2. pin cuda memory in connecting stage 3. support timeout cancelling pipeline in kvclient and worker 4. make client request id atomic. 5. add vlog for pipeline debuging 6. auto find os-transport lib without installing rpm 7. support gpu id and gpu uuid as connection args 8. rename H2DParam to PiplnRh2dParam 9. worker don't need cuda lib 10. clean up worker rpc channel when worker lost Signed-off-by: mszjaas <mszjaas@qq.com>7 天前