There are mainly two steps
How do we construct the regions R_1,\dots,R_J ? In theory, the regions could have any shape. However, we choose to divide the predictor space into high-dimensional rectangles, or boxes, for simplicity and for ease of interpretation of the resulting predictive model. The goal is to find boxes R_1,\dots,R_J that minimize the MSE, given by
\sum_{j=1}^J\sum_{i\in R_j}(y_i-\overline{y}_{R_j})^2,where \overline{y}_{R_j} is the mean response for the training observations within box j .