In our discussion on Logistic Regression we studied the case of two classes, with yi either 0 or 1. Furthermore we assumed also that we have only two parameters β in our fitting, that is we defined probabilities
p(yi=1|xi,β)=exp(β0+β1xi)1+exp(β0+β1xi),p(yi=0|xi,β)=1−p(yi=1|xi,β),where β are the weights we wish to extract from data, in our case β0 and β1.