|  | This document explains potential effects of speculation, and how undesirable | 
|  | effects can be mitigated portably using common APIs. | 
|  |  | 
|  | =========== | 
|  | Speculation | 
|  | =========== | 
|  |  | 
|  | To improve performance and minimize average latencies, many contemporary CPUs | 
|  | employ speculative execution techniques such as branch prediction, performing | 
|  | work which may be discarded at a later stage. | 
|  |  | 
|  | Typically speculative execution cannot be observed from architectural state, | 
|  | such as the contents of registers. However, in some cases it is possible to | 
|  | observe its impact on microarchitectural state, such as the presence or | 
|  | absence of data in caches. Such state may form side-channels which can be | 
|  | observed to extract secret information. | 
|  |  | 
|  | For example, in the presence of branch prediction, it is possible for bounds | 
|  | checks to be ignored by code which is speculatively executed. Consider the | 
|  | following code: | 
|  |  | 
|  | int load_array(int *array, unsigned int index) | 
|  | { | 
|  | if (index >= MAX_ARRAY_ELEMS) | 
|  | return 0; | 
|  | else | 
|  | return array[index]; | 
|  | } | 
|  |  | 
|  | Which, on arm64, may be compiled to an assembly sequence such as: | 
|  |  | 
|  | CMP	<index>, #MAX_ARRAY_ELEMS | 
|  | B.LT	less | 
|  | MOV	<returnval>, #0 | 
|  | RET | 
|  | less: | 
|  | LDR	<returnval>, [<array>, <index>] | 
|  | RET | 
|  |  | 
|  | It is possible that a CPU mis-predicts the conditional branch, and | 
|  | speculatively loads array[index], even if index >= MAX_ARRAY_ELEMS. This | 
|  | value will subsequently be discarded, but the speculated load may affect | 
|  | microarchitectural state which can be subsequently measured. | 
|  |  | 
|  | More complex sequences involving multiple dependent memory accesses may | 
|  | result in sensitive information being leaked. Consider the following | 
|  | code, building on the prior example: | 
|  |  | 
|  | int load_dependent_arrays(int *arr1, int *arr2, int index) | 
|  | { | 
|  | int val1, val2, | 
|  |  | 
|  | val1 = load_array(arr1, index); | 
|  | val2 = load_array(arr2, val1); | 
|  |  | 
|  | return val2; | 
|  | } | 
|  |  | 
|  | Under speculation, the first call to load_array() may return the value | 
|  | of an out-of-bounds address, while the second call will influence | 
|  | microarchitectural state dependent on this value. This may provide an | 
|  | arbitrary read primitive. | 
|  |  | 
|  | ==================================== | 
|  | Mitigating speculation side-channels | 
|  | ==================================== | 
|  |  | 
|  | The kernel provides a generic API to ensure that bounds checks are | 
|  | respected even under speculation. Architectures which are affected by | 
|  | speculation-based side-channels are expected to implement these | 
|  | primitives. | 
|  |  | 
|  | The array_index_nospec() helper in <linux/nospec.h> can be used to | 
|  | prevent information from being leaked via side-channels. | 
|  |  | 
|  | A call to array_index_nospec(index, size) returns a sanitized index | 
|  | value that is bounded to [0, size) even under cpu speculation | 
|  | conditions. | 
|  |  | 
|  | This can be used to protect the earlier load_array() example: | 
|  |  | 
|  | int load_array(int *array, unsigned int index) | 
|  | { | 
|  | if (index >= MAX_ARRAY_ELEMS) | 
|  | return 0; | 
|  | else { | 
|  | index = array_index_nospec(index, MAX_ARRAY_ELEMS); | 
|  | return array[index]; | 
|  | } | 
|  | } |