The problem of exploding or vanishing gradients

RNNs have difficulty dealing with long-range dependencies.