Aim is to avoid having to load an extra `blake2b_s` into the stack (241 bytes). For context, a Nano S stack is only 1024 bytes.