On Hoeffding’s Inequality
Hoeffding inequality was first proved in 1963 for independent bounded random variables. It captures the effect of cancellation among independent random variables, which results in the concentration of their linear combination. It is further extended to the sub-Gaussian random variables, which shows that the linear combination of sub-Gaussian random variables is still sub-Gaussian, and a careful analysis can show the property of the norms of sub-Gaussian random variables. We leave the investigation on the generalization to later notes.
This note will focuses on the proof of Hoeffding’s inequality. First, Let’s state the theorem. For simplicity, we just assume all the random variables are bounded by the same quantity.
Theorem. Assume are independent random variables in , with . Then
To prove the theorem, we first take off the absolute value by considering one-sided inequality. Then we use Laplace transform to deal with the independent sum, and apply the Markov inequality to get
Here comes the key trick in the proof. For , the function is always less than the secant line connecting the end points, that is,
Hence, we get
and we can apply the fact . So far, the problem has been converted to a calculus question, to give an upper bound of the function
implies that . Let’s make a substitution of variables,
Then the function becomes . We further take the log over the function and get . Compare this function with . , .
So we conclude that for all . And thus,
By optimizing the function
we get the desired result.