fix confusion matrix code by matteopilotto · Pull Request #76 · nlp-with-transformers/notebooks

matteopilotto · 2022-11-26T16:29:32Z

fix issues mentioned in #75.

@lewtun @lvwerra when you're free, please take a look at it.

review-notebook-app · 2022-11-26T16:29:36Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

lvwerra · 2023-01-30T13:57:43Z

04_multilingual-ner.ipynb

@@ -15,16 +15,15 @@
  },


I think the order was ok before no? I know that it's a bit inconsistent that plot_confusion_matrix and confusion_matrix use reversed order of arguments but I think it's functionally correct, no?

Reply via ReviewNB

Thanks for double checking Leandro. I triple checked and I still believe the code in your notebook and the resulting confusion matrix are not correct.

In your notebook you define plot_confusion_matrix as
def plot_confusion_matrix(y_preds, y_true, labels):
where the order of the input is first predictions, then ground truth labels.

However when you call the function you're passing the function inputs in the reverse order. First ground truth labels, then predicted labels:
plot_confusion_matrix( df_tokens["labels"], df_tokens["predicted_label"], tags.names )

Hopefully with code what I'm trying to highlight is more clear.

In any case, this is just one aspect that, in my opinion, makes the confusion matrix inaccurate. The second one is the mismatch between the values in the confusion matrix and the labels.

If you run this simple code
df_ILOC = df_tokens[df_tokens['labels'] == 'I-LOC'] (df_ILOC['labels'] == df_ILOC['predicted_label']).sum() / len(df_ILOC)
to check the accuracy of the I-LOC label, you will immediately notice that the model doesn't predict this label correctly 99% of the times as the confusion matrix produced by your notebook (and showed in the book) tells us. In fact, the model accuracy for this specific label (i.e. I-LOC) is around 85% and the actual label predicted correctly with 99% accuracy is O.

The problems arises because sklearn confusion matrix sorts string inputs (i.e. our target and predicted labels) in alphabetical order. However, when you pass the labels to display here,
disp = ConfusionMatrixDisplay( confusion_matrix=cm, display_labels=labels )
you're passing then according to the order defined in the dataset which is not alphabetically
['O', 'B-PER', 'I-PER', 'B-ORG', 'I-ORG', 'B-LOC', 'I-LOC']

Again thanks for your time and feel free to reach out to me at any time if you have any further questions or comments.
This open-source contributing is becoming more and more interesting...

fix confusion matrix code

e1af439

lvwerra reviewed Jan 30, 2023

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix confusion matrix code#76

fix confusion matrix code#76
matteopilotto wants to merge 1 commit intonlp-with-transformers:mainfrom
matteopilotto:confmat-fix

matteopilotto commented Nov 26, 2022

Uh oh!

review-notebook-app bot commented Nov 26, 2022

Uh oh!

lvwerra Jan 30, 2023 •

edited

Loading

Uh oh!

matteopilotto Feb 5, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

matteopilotto commented Nov 26, 2022

Uh oh!

review-notebook-app bot commented Nov 26, 2022

Uh oh!

lvwerra Jan 30, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

matteopilotto Feb 5, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

lvwerra Jan 30, 2023 •

edited

Loading