Introduction to Kunpeng TensorFlow

Latest Updates

  • [2025.09.30]: Added the TensorFlow ANNC for graph compilation optimization feature, providing optimizations including computational graph optimization, and generation and integration of high-performance fused operators.
  • [2025.06.30]: Released the TensorFlow Serving thread scheduling optimization feature for the first time.

Project Introduction

Kunpeng TensorFlow is a high-performance inference acceleration extension based on open-source TensorFlow. It focuses on efficient execution in search, recommendation, and advertising inference scenarios. It significantly improves throughput and cuts latency for model inference through in-depth enhancements in graph optimization, operators, and runtime, providing top performance for AI applications based on Kunpeng CPUs.

Pixie provides the following features:

  • ANNC for graph compilation optimization: It leverages Kunpeng hardware affinity operators and TensorFlow graph fusion technologies to accelerate model inference.
  • Thread scheduling optimization: It uses Kunpeng affinity operator scheduling and thread pool management technologies to optimize TensorFlow operator scheduling in high-concurrency scenarios.

Figure 1 Project architecture project-architecture

  • Executor layer: runtime optimization
  • Kernel layer: custom operators, which provide high-performance DNN operators based on KDNN.
  • XLA layer: provides the Kunpeng graph compiler based on ANNC.

Feature Description

Feature

Description

Thread scheduling optimization

Refines operator scheduling algorithms and provides other thread management optimizations, delivering throughput improvements for concurrent model inference.

ANNC for graph compilation optimization

ANNC is a compiler dedicated to accelerating neural network computing. It focuses on technologies including computational graph optimization, generation and integration of high-performance fused operators, and efficient code generation and optimization. These capabilities significantly improve inference performance in recommendation scenarios.

For details about the features of Kunpeng TensorFlow, see Feature Introduction.

Release Notes

For details about the version updates of Kunpeng TensorFlow, see Release Notes.

Directory Structure

tensorflow
├── 0001-tensorflow_2.15.0-optimize.patch // TensorFlow patch file
├── LICENSE                                   // License file
├── README_en.md                                 // Open-source repository introduction
└── docs                                      // Documentation

Documents

Resource Type

Resource Name

Resource Description

Document

Release Notes

Provides basic information and feature updates of each Kunpeng TensorFlow release.

Document

Feature Introduction

Describes the Kunpeng TensorFlow features.

Document

Quick Start

Provides guidance for getting started with Kunpeng TensorFlow.

Document

Installation Guide

Describes how to compile and install Kunpeng TensorFlow.

Document

API Reference

Describes how to use Kunpeng TensorFlow APIs.

Disclaimer

This code repository contributes to the TensorFlow community. It strictly adheres to the coding style and methods, as well as security design of the native open-source software. Any vulnerability and security issues of the software shall be resolved by the corresponding upstream communities according to their response mechanisms. Please pay attention to the notifications and version updates released by the upstream communities. The Kunpeng computing community does not assume any responsibility for software vulnerabilities and security issues.

License

This project is licensed under Apache License 2.0. For details, see the LICENSE file.

This project document is licensed under CC-BY 4.0. For details, see the LICENSE file.

Contribution Statement

We welcome your contributions to the community. If you have any questions/suggestions or want to provide feedback on feature requirements and bug reports, you can submit issues. For details, see Contribution Guideline. You are also welcome to share insights in the Discussions. Thank you for your support.

Acknowledgments

Kunpeng TensorFlow is jointly developed by the following Huawei department:

  • Kunpeng Computing BoostKit Development Dept

Thank you for every PR from the community. We welcome your contributions to Kunpeng TensorFlow!