marvell/linux/Documentation/speculation.txt - T108 - Gitiles

 This document explains potential effects of speculation, and how undesirable
 effects can be mitigated portably using common APIs.

 ===========
 Speculation
 ===========

 To improve performance and minimize average latencies, many contemporary CPUs
 employ speculative execution techniques such as branch prediction, performing
 work which may be discarded at a later stage.

 Typically speculative execution cannot be observed from architectural state,
 such as the contents of registers. However, in some cases it is possible to
 observe its impact on microarchitectural state, such as the presence or
 absence of data in caches. Such state may form side-channels which can be
 observed to extract secret information.

 For example, in the presence of branch prediction, it is possible for bounds
 checks to be ignored by code which is speculatively executed. Consider the
 following code::

 	int load_array(int *array, unsigned int index)
 	{
 		if (index >= MAX_ARRAY_ELEMS)
 			return 0;
 		else
 			return array[index];
 	}

 Which, on arm64, may be compiled to an assembly sequence such as::

 	CMP	<index>, #MAX_ARRAY_ELEMS
 	B.LT	less
 	MOV	<returnval>, #0
 	RET
   less:
 	LDR	<returnval>, [<array>, <index>]
 	RET

 It is possible that a CPU mis-predicts the conditional branch, and
 speculatively loads array[index], even if index >= MAX_ARRAY_ELEMS. This
 value will subsequently be discarded, but the speculated load may affect
 microarchitectural state which can be subsequently measured.

 More complex sequences involving multiple dependent memory accesses may
 result in sensitive information being leaked. Consider the following
 code, building on the prior example::

 	int load_dependent_arrays(int *arr1, int *arr2, int index)
 	{
 		int val1, val2,

 		val1 = load_array(arr1, index);
 		val2 = load_array(arr2, val1);

 		return val2;
 	}

 Under speculation, the first call to load_array() may return the value
 of an out-of-bounds address, while the second call will influence
 microarchitectural state dependent on this value. This may provide an
 arbitrary read primitive.

 ====================================
 Mitigating speculation side-channels
 ====================================

 The kernel provides a generic API to ensure that bounds checks are
 respected even under speculation. Architectures which are affected by
 speculation-based side-channels are expected to implement these
 primitives.

 The array_index_nospec() helper in <linux/nospec.h> can be used to
 prevent information from being leaked via side-channels.

 A call to array_index_nospec(index, size) returns a sanitized index
 value that is bounded to [0, size) even under cpu speculation
 conditions.

 This can be used to protect the earlier load_array() example::

 	int load_array(int *array, unsigned int index)
 	{
 		if (index >= MAX_ARRAY_ELEMS)
 			return 0;
 		else {
 			index = array_index_nospec(index, MAX_ARRAY_ELEMS);
 			return array[index];
 		}
 	}
	This document explains potential effects of speculation, and how undesirable
	effects can be mitigated portably using common APIs.

	===========
	Speculation
	===========

	To improve performance and minimize average latencies, many contemporary CPUs
	employ speculative execution techniques such as branch prediction, performing
	work which may be discarded at a later stage.

	Typically speculative execution cannot be observed from architectural state,
	such as the contents of registers. However, in some cases it is possible to
	observe its impact on microarchitectural state, such as the presence or
	absence of data in caches. Such state may form side-channels which can be
	observed to extract secret information.

	For example, in the presence of branch prediction, it is possible for bounds
	checks to be ignored by code which is speculatively executed. Consider the
	following code::

	int load_array(int *array, unsigned int index)
	{
	if (index >= MAX_ARRAY_ELEMS)
	return 0;
	else
	return array[index];
	}

	Which, on arm64, may be compiled to an assembly sequence such as::

	CMP <index>, #MAX_ARRAY_ELEMS
	B.LT less
	MOV <returnval>, #0
	RET
	less:
	LDR <returnval>, [<array>, <index>]
	RET

	It is possible that a CPU mis-predicts the conditional branch, and
	speculatively loads array[index], even if index >= MAX_ARRAY_ELEMS. This
	value will subsequently be discarded, but the speculated load may affect
	microarchitectural state which can be subsequently measured.

	More complex sequences involving multiple dependent memory accesses may
	result in sensitive information being leaked. Consider the following
	code, building on the prior example::

	int load_dependent_arrays(int arr1, int arr2, int index)
	{
	int val1, val2,

	val1 = load_array(arr1, index);
	val2 = load_array(arr2, val1);

	return val2;
	}

	Under speculation, the first call to load_array() may return the value
	of an out-of-bounds address, while the second call will influence
	microarchitectural state dependent on this value. This may provide an
	arbitrary read primitive.

	====================================
	Mitigating speculation side-channels
	====================================

	The kernel provides a generic API to ensure that bounds checks are
	respected even under speculation. Architectures which are affected by
	speculation-based side-channels are expected to implement these
	primitives.

	The array_index_nospec() helper in <linux/nospec.h> can be used to
	prevent information from being leaked via side-channels.

	A call to array_index_nospec(index, size) returns a sanitized index
	value that is bounded to [0, size) even under cpu speculation
	conditions.

	This can be used to protect the earlier load_array() example::

	int load_array(int *array, unsigned int index)
	{
	if (index >= MAX_ARRAY_ELEMS)
	return 0;
	else {
	index = array_index_nospec(index, MAX_ARRAY_ELEMS);
	return array[index];
	}
	}