Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 1 | # Benchmarking |
| 2 | |
| 3 | IREE uses benchmarks to inspect performance at varying levels of granularity. |
| 4 | Benchmarking is implemented using the |
Lei Zhang | c0b34e8 | 2020-10-08 14:35:48 -0400 | [diff] [blame] | 5 | [Google Benchmark library](https://github.com/google/benchmark). To understand |
| 6 | performance details and guide optimization, please refer to the |
| 7 | IREE [profiling](./profiling.md) documentation. |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 8 | |
| 9 | ## Module Benchmarks |
| 10 | |
| 11 | `iree-benchmark-module` is a program accepting (almost) the same inputs as |
| 12 | `iree-run-module` that will benchmark the invocation of a single entry function. |
| 13 | It measures timing for the whole process of invoking a function through the VM, |
| 14 | including allocating and freeing output buffers. This is a high-level benchmark |
| 15 | of an entire invocation flow. It provides a big picture view, but depends on |
| 16 | many different variables, like an integration test. For finer-grained |
Kojo Acquah | 12d3b28 | 2021-11-10 20:53:53 -0800 | [diff] [blame] | 17 | measurements more akin to unit tests, see [Executable Benchmarks](#executable-benchmarks). |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 18 | |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 19 | To use `iree-benchmark-module`, generate an IREE module for the target backend: |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 20 | |
| 21 | ```shell |
Stella Laurenzo | 7f2972c | 2022-03-19 14:09:43 -0700 | [diff] [blame] | 22 | $ bazel run //iree/tools:iree-compile -- \ |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 23 | -iree-mlir-to-vm-bytecode-module \ |
Ben Vanik | ab3b79f | 2021-05-14 11:19:17 -0700 | [diff] [blame] | 24 | -iree-hal-target-backends=vmvx \ |
CindyLiu | 63fd39f | 2022-04-22 22:21:15 -0700 | [diff] [blame^] | 25 | $PWD/samples/models/simple_abs.mlir \ |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 26 | -o /tmp/module.fb |
| 27 | ``` |
| 28 | |
| 29 | and then benchmark an exported function in that module: |
| 30 | |
| 31 | ```shell |
| 32 | $ bazel run //iree/tools:iree-benchmark-module -- \ |
Han-Chung Wang | 8255d34 | 2020-10-05 07:32:51 -0700 | [diff] [blame] | 33 | --module_file=/tmp/module.fb \ |
Ben Vanik | ab3b79f | 2021-05-14 11:19:17 -0700 | [diff] [blame] | 34 | --driver=vmvx \ |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 35 | --entry_function=abs \ |
CindyLiu | af7dfeb | 2021-06-08 22:31:37 +0000 | [diff] [blame] | 36 | --function_input=f32=-2 |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 37 | ``` |
| 38 | |
| 39 | You'll see output like |
| 40 | |
| 41 | ```shell |
| 42 | Run on (12 X 4500 MHz CPU s) |
| 43 | CPU Caches: |
| 44 | L1 Data 32K (x6) |
| 45 | L1 Instruction 32K (x6) |
| 46 | L2 Unified 1024K (x6) |
| 47 | L3 Unified 8448K (x1) |
| 48 | Load Average: 2.21, 1.93, 3.34 |
| 49 | ***WARNING*** CPU scaling is enabled, the benchmark real time measurements may |
| 50 | be noisy and will incur extra overhead. |
| 51 | ***WARNING*** Library was built as DEBUG. Timings may be affected. |
| 52 | ------------------------------------------------------------------------------ |
| 53 | Benchmark Time CPU Iterations |
| 54 | ------------------------------------------------------------------------------ |
Phoenix Meadowlark | fed6620 | 2020-06-16 11:17:19 -0700 | [diff] [blame] | 55 | BM_RunModule/process_time/real_time 0.22 ms 0.23 ms 3356 |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 56 | ``` |
| 57 | |
| 58 | Notice that there are a few warnings in there (you may not see all of these). |
| 59 | The benchmark library helpfully warns about some common issues that will affect |
| 60 | benchmark timing. When trying to obtain real benchmark numbers, you should |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 61 | generally build an optimized build (`-c opt` in Bazel) and |
| 62 | [disable CPU scaling](#cpu-configuration). |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 63 | |
| 64 | ```shell |
| 65 | $ bazel build -c opt //iree/tools:iree-benchmark-module |
| 66 | ``` |
| 67 | |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 68 | Another thing to consider is that depending on where you are running the |
| 69 | benchmark you might want to avoid additional programs running at the same time. |
| 70 | Bazel itself runs a server even when it's not being actively invoked that can be |
| 71 | quite a memory hog, so we'll instead invoke the binary directly. Use your |
| 72 | favorite process manager (e.g. [htop](https://hisham.hm/htop/) or |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 73 | [pkill](https://en.wikipedia.org/wiki/Pkill) on Linux) to kill heavy-weight |
| 74 | programs such as Chrome and Bazel. |
| 75 | |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 76 | Now we'll actually invoke the binary: |
| 77 | |
| 78 | ```shell |
| 79 | $ ./bazel-bin/iree/tools/iree-benchmark-module \ |
Han-Chung Wang | 8255d34 | 2020-10-05 07:32:51 -0700 | [diff] [blame] | 80 | --module_file=/tmp/module.fb \ |
Ben Vanik | ab3b79f | 2021-05-14 11:19:17 -0700 | [diff] [blame] | 81 | --driver=vmvx \ |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 82 | --entry_function=abs \ |
CindyLiu | af7dfeb | 2021-06-08 22:31:37 +0000 | [diff] [blame] | 83 | --function_input=f32=-2 |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 84 | ``` |
| 85 | |
| 86 | ```shell |
| 87 | Run on (12 X 4500 MHz CPU s) |
| 88 | CPU Caches: |
| 89 | L1 Data 32K (x6) |
| 90 | L1 Instruction 32K (x6) |
| 91 | L2 Unified 1024K (x6) |
| 92 | L3 Unified 8448K (x1) |
| 93 | Load Average: 1.49, 3.42, 3.49 |
| 94 | ------------------------------------------------------------------------------ |
| 95 | Benchmark Time CPU Iterations |
| 96 | ------------------------------------------------------------------------------ |
Phoenix Meadowlark | fed6620 | 2020-06-16 11:17:19 -0700 | [diff] [blame] | 97 | BM_RunModule/process_time/real_time 0.011 ms 0.014 ms 61654 |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 98 | ``` |
| 99 | |
Hanhan Wang | 904f0c1 | 2020-06-03 13:46:36 -0700 | [diff] [blame] | 100 | Remember to [restore CPU scaling](#cpu-configuration) when you're done. |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 101 | |
Han-Chung Wang | d13da6a | 2020-10-30 09:00:53 -0700 | [diff] [blame] | 102 | ## Executable Benchmarks |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 103 | |
Han-Chung Wang | d13da6a | 2020-10-30 09:00:53 -0700 | [diff] [blame] | 104 | We also benchmark the performance of individual parts of the IREE system in |
| 105 | isolation. IREE breaks a model down to dispatch functions. To benchmark all the |
Ben Vanik | bfe0bc4 | 2021-03-12 10:51:55 -0800 | [diff] [blame] | 106 | dispatch functions, generate an IREE module with the |
| 107 | `-iree-flow-export-benchmark-funcs` flag set: |
Han-Chung Wang | d13da6a | 2020-10-30 09:00:53 -0700 | [diff] [blame] | 108 | |
| 109 | ```shell |
Stella Laurenzo | 7f2972c | 2022-03-19 14:09:43 -0700 | [diff] [blame] | 110 | $ build/iree/tools/iree-compile \ |
CindyLiu | af7dfeb | 2021-06-08 22:31:37 +0000 | [diff] [blame] | 111 | -iree-input-type=mhlo \ |
Ben Vanik | bfe0bc4 | 2021-03-12 10:51:55 -0800 | [diff] [blame] | 112 | -iree-mlir-to-vm-bytecode-module \ |
| 113 | -iree-flow-export-benchmark-funcs \ |
Ben Vanik | ab3b79f | 2021-05-14 11:19:17 -0700 | [diff] [blame] | 114 | -iree-hal-target-backends=vmvx \ |
Han-Chung Wang | d13da6a | 2020-10-30 09:00:53 -0700 | [diff] [blame] | 115 | iree/test/e2e/models/fullyconnected.mlir \ |
| 116 | -o /tmp/fullyconnected.vmfb |
| 117 | ``` |
| 118 | |
| 119 | and then benchmark all exported dispatch functions (and all exported functions) |
| 120 | in that module: |
| 121 | |
| 122 | ```shell |
| 123 | $ build/iree/tools/iree-benchmark-module |
| 124 | --module_file=/tmp/fullyconnected.vmfb |
Ben Vanik | ab3b79f | 2021-05-14 11:19:17 -0700 | [diff] [blame] | 125 | --driver=vmvx |
Han-Chung Wang | d13da6a | 2020-10-30 09:00:53 -0700 | [diff] [blame] | 126 | ``` |
| 127 | |
| 128 | If no `entry_function` is specified, `iree-benchmark-module` will register a |
| 129 | benchmark for each exported function that takes no inputs. |
| 130 | |
| 131 | You will see output like: |
| 132 | |
| 133 | ```shell |
| 134 | Run on (72 X 3700 MHz CPU s) |
| 135 | CPU Caches: |
| 136 | L1 Data 32 KiB (x36) |
| 137 | L1 Instruction 32 KiB (x36) |
| 138 | L2 Unified 1024 KiB (x36) |
| 139 | L3 Unified 25344 KiB (x2) |
| 140 | Load Average: 4.39, 5.72, 6.76 |
| 141 | --------------------------------------------------------------------------------------------- |
| 142 | Benchmark Time CPU Iterations |
| 143 | --------------------------------------------------------------------------------------------- |
Ben Vanik | bfe0bc4 | 2021-03-12 10:51:55 -0800 | [diff] [blame] | 144 | BM_main_ex_dispatch_0_benchmark/process_time/real_time 0.030 ms 0.037 ms 34065 |
| 145 | BM_main_ex_dispatch_1_benchmark/process_time/real_time 0.034 ms 0.042 ms 20567 |
| 146 | BM_main_ex_dispatch_2_benchmark/process_time/real_time 0.043 ms 0.051 ms 18576 |
| 147 | BM_main_ex_dispatch_3_benchmark/process_time/real_time 0.029 ms 0.036 ms 21345 |
| 148 | BM_main_ex_dispatch_4_benchmark/process_time/real_time 0.042 ms 0.051 ms 15880 |
| 149 | BM_main_ex_dispatch_5_benchmark/process_time/real_time 0.030 ms 0.037 ms 17854 |
| 150 | BM_main_ex_dispatch_6_benchmark/process_time/real_time 0.043 ms 0.052 ms 14919 |
| 151 | BM_main_benchmark/process_time/real_time 0.099 ms 0.107 ms 5892 |
Han-Chung Wang | d13da6a | 2020-10-30 09:00:53 -0700 | [diff] [blame] | 152 | ``` |
Geoffrey Martin-Noble | 36dd317 | 2020-02-14 16:03:49 -0800 | [diff] [blame] | 153 | |
| 154 | ### Bytecode Module Benchmarks |
| 155 | |
Lei Zhang | c0b34e8 | 2020-10-08 14:35:48 -0400 | [diff] [blame] | 156 | Normally, the IREE VM is expected to be integrated into applications and driving |
| 157 | model execution. So its performance is of crucial importance. We strive to |
| 158 | introduce as little overhead as possible and have several benchmark binaries |
| 159 | dedicated for evaluating the VM's performance. These benchmark binaries are |
Geoffrey Martin-Noble | f822f92 | 2020-12-28 09:11:19 -0800 | [diff] [blame] | 160 | named as `*_benchmark` in the |
| 161 | [`iree/vm/`](https://github.com/google/iree/tree/main/iree/vm) directory. They |
| 162 | also use the Google Benchmark library as the above. |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 163 | |
| 164 | ## CPU Configuration |
| 165 | |
| 166 | When benchmarking, it's important to consider the configuration of your CPUs. |
| 167 | Most notably, CPU scaling can give variable results, so you'll usually want to |
| 168 | disable it. This can get pretty complex, but the most basic thing to do is to |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 169 | run all CPUs at maximum frequency. The other thing to consider is what CPU(s) |
| 170 | your program is running on. Both of these get more complicated on mobile and in |
| 171 | multithreaded workloads. |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 172 | |
| 173 | ### Linux |
| 174 | |
| 175 | Google benchmark provides some |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 176 | [instructions](https://github.com/google/benchmark#disabling-cpu-frequency-scaling). |
| 177 | Note that the library will print "CPU scaling is enabled" warnings for any |
| 178 | configuration that |
| 179 | [doesn't have the quota governor set to performance](https://github.com/google/benchmark/blob/3d1c2677686718d906f28c1d4da001c42666e6d2/src/sysinfo.cc#L228). |
| 180 | Similarly the CPU frequency it reports is the |
| 181 | [maximum frequency of cpu0](https://github.com/google/benchmark/blob/3d1c2677686718d906f28c1d4da001c42666e6d2/src/sysinfo.cc#L533), |
| 182 | not the frequency of the processor it's actually running on. This means that |
| 183 | more advanced configurations should ignore these messages. |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 184 | |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 185 | Turn off CPU scaling before benchmarking. |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 186 | |
| 187 | ```shell |
| 188 | $ sudo cpupower frequency-set --governor performance |
| 189 | ``` |
| 190 | |
Hanhan Wang | 904f0c1 | 2020-06-03 13:46:36 -0700 | [diff] [blame] | 191 | Restore CPU scaling after benchmarking: |
| 192 | |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 193 | ```shell |
| 194 | $ sudo cpupower frequency-set --governor powersave |
| 195 | ``` |
| 196 | |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 197 | To learn more about different quota |
| 198 | governor settings, see |
| 199 | https://www.kernel.org/doc/Documentation/cpu-freq/governors.txt. To restrict |
| 200 | which CPUs you run on, use the `taskset` command which takes a hexadecimal mask. |
| 201 | |
| 202 | To only run on the lowest-numbered CPU you can run |
| 203 | |
| 204 | ```shell |
| 205 | $ taskset 1 sleep 20 & |
| 206 | ``` |
| 207 | |
| 208 | You can confirm that the process is running on the given CPU: |
| 209 | |
| 210 | ```shell |
| 211 | $ ps -o psr $! |
| 212 | ``` |
| 213 | |
| 214 | Note that `$!` indicates the process ID of the last executed background command, |
| 215 | so you can only use this shorthand if you didn't run any commands after the |
| 216 | sleep. For more info on taskset, see https://linux.die.net/man/1/taskset. |
| 217 | |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 218 | ### Android |
| 219 | |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 220 | Read and understand the [Linux](#linux) instructions first. |
| 221 | |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 222 | Android doesn't give us quite as nice tooling, but the principle is basically |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 223 | the same. One important difference is that thermal throttling is a much bigger |
| 224 | concern on mobile. Without a cooling plate, it is likely that high clock speeds |
| 225 | will overheat the device and engage thermal throttling, which will ignore |
| 226 | whatever clock speeds you may have set to prevent things from catching on fire. |
| 227 | Therefore the naive approach above is likely not a good idea. |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 228 | |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 229 | You will likely need to be root (use `su` or `adb root`). The commands will |
| 230 | depend on your exact phone and number of cores. First play around and make sure |
| 231 | you understand what everything means. Note that each CPU has its own files which |
| 232 | are used to control its behavior, but changes to a single CPU will sometimes |
| 233 | affect others (see `/sys/devices/system/cpu/cpu0/cpufreq/affected_cpus`). |
| 234 | |
| 235 | Some useful files: |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 236 | |
| 237 | ```shell |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 238 | /proc/cpuinfo |
| 239 | /sys/devices/system/cpu/possible |
| 240 | /sys/devices/system/cpu/present |
| 241 | /sys/devices/system/cpu/cpu0/online |
| 242 | /sys/devices/system/cpu/cpu0/cpufreq/scaling_available_governors |
| 243 | /sys/devices/system/cpu/cpu0/cpufreq/scaling_governor |
| 244 | /sys/devices/system/cpu/cpu0/cpufreq/scaling_available_frequencies |
| 245 | /sys/devices/system/cpu/cpu0/cpufreq/cpuinfo_max_freq |
| 246 | /sys/devices/system/cpu/cpu0/cpufreq/cpuinfo_min_freq |
| 247 | /sys/devices/system/cpu/cpu0/cpufreq/cpuinfo_cur_freq |
| 248 | /sys/devices/system/cpu/cpu0/cpufreq/affected_cpus |
| 249 | /sys/devices/system/cpu/cpu0/cpufreq/scaling_setspeed |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 250 | ``` |
| 251 | |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 252 | See the clockspeed of each CPU |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 253 | |
| 254 | ```shell |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 255 | $ for i in `cat /sys/devices/system/cpu/present | tr '-' ' ' | xargs seq`; do \ |
| 256 | paste \ |
| 257 | "/sys/devices/system/cpu/cpu${i?}/cpufreq/cpuinfo_cur_freq" \ |
| 258 | "/sys/devices/system/cpu/cpu${i?}/cpufreq/cpuinfo_min_freq" \ |
| 259 | "/sys/devices/system/cpu/cpu${i?}/cpufreq/cpuinfo_max_freq"; \ |
| 260 | done |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 261 | ``` |
| 262 | |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 263 | Before changing things, make sure to check the current scaling governor settings |
| 264 | first so you can put them back when you're done. |
| 265 | |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 266 | ```shell |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 267 | $ for i in `cat /sys/devices/system/cpu/present | tr '-' ' ' | xargs seq`; do \ |
| 268 | cat "/sys/devices/system/cpu/cpu${i?}/cpufreq/scaling_governor"; \ |
| 269 | done |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 270 | ``` |
| 271 | |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 272 | #### Single-Core Example |
| 273 | |
| 274 | Here's an example to run IREE in a single-threaded context on CPU 7 at its |
| 275 | lowest clock speed. |
| 276 | |
| 277 | First we'll take control of the clockspeed by setting the governor to |
| 278 | "userspace". |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 279 | |
| 280 | ```shell |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 281 | $ for i in `cat /sys/devices/system/cpu/present | tr '-' ' ' | xargs seq`; do \ |
| 282 | echo userspace > \ |
| 283 | "/sys/devices/system/cpu/cpu${i?}/cpufreq/scaling_governor"; \ |
| 284 | done |
| 285 | ``` |
| 286 | |
| 287 | We can now set individual clock speeds. We'll pin cpu7 to its minimum frequency. |
| 288 | We choose the minimum instead of the maximum here to mitigate thermal throttling |
| 289 | concerns |
| 290 | |
| 291 | ```shell |
| 292 | $ cat /sys/devices/system/cpu/cpu7/cpufreq/cpuinfo_min_freq > \ |
| 293 | /sys/devices/system/cpu/cpu7/cpufreq/scaling_setspeed |
| 294 | ``` |
| 295 | |
| 296 | We can confirm the frequencies of all the CPUs by running the same command |
| 297 | above. Now to run a command specifically on cpu7, use `taskset 80` |
| 298 | (hex for 10000000): |
| 299 | |
| 300 | ```shell |
Phoenix Meadowlark | e162bf6 | 2021-02-11 14:58:03 -0800 | [diff] [blame] | 301 | $ taskset 80 sleep 20 & |
Geoffrey Martin-Noble | 1e58032 | 2020-10-19 09:56:13 -0700 | [diff] [blame] | 302 | $ ps -o psr $! |
| 303 | ``` |
| 304 | |
| 305 | Remember to cleanup when you're done! Here we'll set the scaling governor back |
| 306 | to schedutil because that's what they were before on the particular device this, |
| 307 | was tested on, but that may not exist on all devices. |
| 308 | |
| 309 | ```shell |
| 310 | $ for i in `cat /sys/devices/system/cpu/present | tr '-' ' ' | xargs seq`; do \ |
| 311 | echo schedutil > \ |
| 312 | "/sys/devices/system/cpu/cpu${i?}/cpufreq/scaling_governor"; \ |
| 313 | done |
Geoffrey Martin-Noble | 48d2d6d | 2020-03-02 16:04:58 -0800 | [diff] [blame] | 314 | ``` |
| 315 | |
| 316 | TODO(scotttodd): Windows instructions |