Attention networks for document classification