JJoseph Huber[libc] Fix invalid format specifier in benchmark
| 文件 | 最后提交记录 | 最后更新时间 |
|---|---|---|
[libc] Add Multithreaded GPU Benchmarks (#98964) This PR runs benchmarks on a 32 threads (A single warp on NVPTX) by default, adding the option for single threaded benchmarks. We can specify that a benchmark should be run on a single thread using the SINGLE_THREADED_BENCHMARK() macro. I chose to use a flag here so that other options could be added in the future. | 1 年前 | |
[libc] Add AMDGPU Timing to CMake (#99603) libc/benchmarks/gpu/timing/CMakeLists.txt did not correctly build amdgpu utils. This PR fixes that issue by adding amdgpu to the loop that adds the correct sub directories. | 1 年前 | |
| 1 年前 | ||
| 1 年前 | ||
[libc] Improve Benchmark UI (#99796) This PR changes the output to resemble Google Benchmark. e.g. Running Suite: LlvmLibcIsAlNumGpuBenchmark Benchmark | Cycles | Min | Max | Iterations | Time (ns) | Stddev | Threads | ----------------------------------------------------------------------------------------------------- IsAlnum | 92 | 76 | 482 | 23 | 86500 | 76 | 64 | IsAlnumSingleThread | 87 | 76 | 302 | 20 | 72000 | 49 | 1 | IsAlnumSingleWave | 87 | 76 | 302 | 20 | 72000 | 49 | 32 | IsAlnumCapital | 89 | 76 | 299 | 17 | 78500 | 52 | 64 | IsAlnumNotAlnum | 87 | 76 | 303 | 20 | 76000 | 49 | 64 | | 1 年前 | |
[libc] Fix invalid format specifier in benchmark Summary: This value is a uint32_t but is printed as a uint64_t, leading to invalid offsets when done on AMDGPU due to its packed format extending past the buffer. | 1 年前 | |
[libc] Add N Threads Benchmark Helper (#99834) This PR adds a BENCHMARK_N_THREADS() helper to register benchmarks with a specific number of threads. This PR replaces the flags used originally to allow any amount of threads. | 1 年前 | |
[libc] NVPTX Profiling (#92009) PR for adding microbenchmarking infrastructure for NVPTX. nvlink cannot perform LTO, so we cannot inline libc functions and this function call overhead is not adjusted for during microbenchmarking. | 1 年前 |