blake2s_compress for the final block of the message.
The input msg must always be exactly 16 u32 elements, padded with zeros if necessary,
regardless of the value of byte_count. Using any padding scheme other than zero-padding
will produce a different hash output.