In the case that \widehat{\beta} has more than one component, and the components are independent, we use the same estimator on each component separately. If the probability density function of X_i , p(x) , had been known, then it would have been straightforward to do this by:
By repeated use of the above two points, many estimates of \widehat{\beta} can be obtained. The idea is to use the relative frequency of \widehat{\beta}^* (think of a histogram) as an estimate of p(\boldsymbol{t}) .