Using scikit-learn metrics with multiclass models