1# Bionic Benchmarks 2 3[TOC] 4 5## libc benchmarks (bionic-benchmarks) 6 7`bionic-benchmarks` is a command line tool for measuring the runtimes of libc functions. It is built 8on top of [Google Benchmark](https://github.com/google/benchmark) with some additions to organize 9tests into suites. 10 11### Device benchmarks 12 13 $ mmma bionic/benchmarks 14 $ adb root 15 $ adb sync data 16 $ adb shell /data/benchmarktest/bionic-benchmarks/bionic-benchmarks 17 $ adb shell /data/benchmarktest64/bionic-benchmarks/bionic-benchmarks 18 19By default, `bionic-benchmarks` runs all of the benchmarks in alphabetical order. Pass 20`--benchmark_filter=getpid` to run just the benchmarks with "getpid" in their name. 21 22### Host benchmarks 23 24See the `benchmarks/run-on-host.sh` script. The host benchmarks can be run with 32-bit or 64-bit 25Bionic, or the host glibc. 26 27### XML suites 28 29Suites are stored in the `suites/` directory and can be chosen with the command line flag 30`--bionic_xml`. 31 32To choose a specific XML file, use the `--bionic_xml=FILE.XML` option. By default, this option 33searches for the XML file in the `suites/` directory. If it doesn't exist in that directory, then 34the file will be found as relative to the current directory. If the option specifies the full path 35to an XML file such as `/data/nativetest/suites/example.xml`, it will be used as-is. 36 37If no XML file is specified through the command-line option, the default is to use `suites/full.xml`. 38However, for the host bionic benchmarks (`bionic-benchmarks-glibc`), the default is to use 39`suites/host.xml`. 40 41### XML suite format 42 43The format for a benchmark is: 44 45``` 46<fn> 47 <name>BM_sample_benchmark</name> 48 <cpu><optional_cpu_to_lock></cpu> 49 <iterations><optional_iterations_to_run></iterations> 50 <args><space separated list of function args|shorthand></args> 51</fn> 52``` 53 54XML-specified values for iterations and cpu take precedence over those specified via command line 55(via `--bionic_iterations` and `--bionic_cpu`, respectively.) 56 57To make small changes in runs, you can also schedule benchmarks by passing in their name and a 58space-separated list of arguments via the `--bionic_extra` command line flag, e.g. 59`--bionic_extra="BM_string_memcpy AT_COMMON_SIZES"` or `--bionic_extra="BM_string_memcmp 32 8 8"` 60 61Note that benchmarks will run normally if extra arguments are passed in, and it will fail 62with a segfault if too few are passed in. 63 64### Shorthand 65 66For the sake of brevity, multiple runs can be scheduled in one XML element by putting one of the 67following in the args field: 68 69 NUM_PROPS 70 MATH_COMMON 71 AT_ALIGNED_<ONE|TWO>BUF 72 AT_<any power of two between 2 and 16384>_ALIGNED_<ONE|TWO>BUF 73 AT_COMMON_SIZES 74 75Definitions for these can be found in bionic_benchmarks.cpp, and example usages can be found in 76the suites directory. 77 78### Unit Tests 79 80`bionic-benchmarks` also has its own set of unit tests, which can be run from the binary in 81`/data/nativetest[64]/bionic-benchmarks-tests` 82 83## Process startup time (bionic-spawn-benchmarks) 84 85The `spawn/` subdirectory has a few benchmarks measuring the time used to start simple programs 86(e.g. Toybox's `true` and `sh -c true`). Run it on a device like so: 87 88 m bionic-spawn-benchmarks 89 adb root 90 adb sync data 91 adb shell /data/benchmarktest/bionic-spawn-benchmarks/bionic-spawn-benchmarks 92 adb shell /data/benchmarktest64/bionic-spawn-benchmarks/bionic-spawn-benchmarks 93 94Google Benchmark reports both a real-time figure ("Time") and a CPU usage figure. For these 95benchmarks, the CPU measurement only counts time spent in the thread calling `posix_spawn`, not that 96spent in the spawned process. The real-time is probably more useful, and it is the figure used to 97determine the iteration count. 98 99Locking the CPU frequency seems to improve the results of these benchmarks significantly, and it 100reduces variability. 101 102## Google Benchmark notes 103 104### Repetitions 105 106Google Benchmark uses two settings to control how many times to run each benchmark, "iterations" and 107"repetitions". By default, the repetition count is one. Google Benchmark runs the benchmark a few 108times to determine a sufficiently-large iteration count. 109 110Google Benchmark can optionally run a benchmark run repeatedly and report statistics (median, mean, 111standard deviation) for the runs. To do so, pass the `--benchmark_repetitions` option, e.g.: 112 113 # ./bionic-benchmarks --benchmark_filter=BM_stdlib_strtoll --benchmark_repetitions=4 114 ... 115 ------------------------------------------------------------------- 116 Benchmark Time CPU Iterations 117 ------------------------------------------------------------------- 118 BM_stdlib_strtoll 27.7 ns 27.7 ns 25290525 119 BM_stdlib_strtoll 27.7 ns 27.7 ns 25290525 120 BM_stdlib_strtoll 27.7 ns 27.7 ns 25290525 121 BM_stdlib_strtoll 27.8 ns 27.7 ns 25290525 122 BM_stdlib_strtoll_mean 27.7 ns 27.7 ns 4 123 BM_stdlib_strtoll_median 27.7 ns 27.7 ns 4 124 BM_stdlib_strtoll_stddev 0.023 ns 0.023 ns 4 125 126There are 4 runs, each with 25290525 iterations. Measurements for the individual runs can be 127suppressed if they aren't needed: 128 129 # ./bionic-benchmarks --benchmark_filter=BM_stdlib_strtoll --benchmark_repetitions=4 --benchmark_report_aggregates_only 130 ... 131 ------------------------------------------------------------------- 132 Benchmark Time CPU Iterations 133 ------------------------------------------------------------------- 134 BM_stdlib_strtoll_mean 27.8 ns 27.7 ns 4 135 BM_stdlib_strtoll_median 27.7 ns 27.7 ns 4 136 BM_stdlib_strtoll_stddev 0.043 ns 0.043 ns 4 137 138### CPU frequencies 139 140To get consistent results between runs, it can sometimes be helpful to restrict a benchmark to 141specific cores, or to lock cores at specific frequencies. Some phones have a big.LITTLE core setup, 142or at least allow some cores to run at higher frequencies than others. 143 144A core can be selected for `bionic-benchmarks` using the `--bionic_cpu` option or using the 145`taskset` utility. e.g. A Pixel 3 device has 4 Kryo 385 Silver cores followed by 4 Gold cores: 146 147 blueline:/ # /data/benchmarktest64/bionic-benchmarks/bionic-benchmarks --benchmark_filter=BM_stdlib_strtoll --bionic_cpu=0 148 ... 149 ------------------------------------------------------------ 150 Benchmark Time CPU Iterations 151 ------------------------------------------------------------ 152 BM_stdlib_strtoll 64.2 ns 63.6 ns 11017493 153 154 blueline:/ # /data/benchmarktest64/bionic-benchmarks/bionic-benchmarks --benchmark_filter=BM_stdlib_strtoll --bionic_cpu=4 155 ... 156 ------------------------------------------------------------ 157 Benchmark Time CPU Iterations 158 ------------------------------------------------------------ 159 BM_stdlib_strtoll 21.8 ns 21.7 ns 33167103 160 161A similar result can be achieved using `taskset`. The first parameter is a bitmask of core numbers 162to pass to `sched_setaffinity`: 163 164 blueline:/ # taskset f /data/benchmarktest64/bionic-benchmarks/bionic-benchmarks --benchmark_filter=BM_stdlib_strtoll 165 ... 166 ------------------------------------------------------------ 167 Benchmark Time CPU Iterations 168 ------------------------------------------------------------ 169 BM_stdlib_strtoll 64.3 ns 63.6 ns 10998697 170 171 blueline:/ # taskset f0 /data/benchmarktest64/bionic-benchmarks/bionic-benchmarks --benchmark_filter=BM_stdlib_strtoll 172 ... 173 ------------------------------------------------------------ 174 Benchmark Time CPU Iterations 175 ------------------------------------------------------------ 176 BM_stdlib_strtoll 21.3 ns 21.2 ns 33094801 177 178To lock the CPU frequency, use the sysfs interface at `/sys/devices/system/cpu/cpu*/cpufreq/`. 179Changing the scaling governor to `performance` suppresses the warning that Google Benchmark 180otherwise prints: 181 182 ***WARNING*** CPU scaling is enabled, the benchmark real time measurements may be noisy and will incur extra overhead. 183 184Some devices have a `perf-setup.sh` script that locks CPU and GPU frequencies. Some TradeFed 185benchmarks appear to be using the script. For more information: 186 * run `get_build_var BOARD_PERFSETUP_SCRIPT` 187 * run `m perf-setup.sh` to install the script into `${OUT}/data/local/tmp/perf-setup.sh` 188 * see: https://android.googlesource.com/platform/platform_testing/+/refs/heads/master/scripts/perf-setup/ 189