Home Backend Development Python Tutorial Confusion Matrix Tricks in Python

Confusion Matrix Tricks in Python

Jun 11, 2023 am 10:43 AM
Skill python programming confusion matrix

With the popularity of machine learning and data mining, more and more data scientists and researchers are beginning to use Python, a high-level programming language, to process and analyze data, and Python’s intuitiveness and ease of use make it in-depth It is widely used in the fields of learning and artificial intelligence. However, many beginners encounter some difficulties when using Python, one of which is the difficulty of confusion matrix. In this article, we will introduce the use of confusion matrices in Python and some useful techniques when dealing with confusion matrices.

1. What is a confusion matrix

In deep learning and data mining, a confusion matrix is ​​a rectangular table used to compare the differences between predicted results and actual results. This matrix shows the performance of the classification algorithm, including important indicators such as accuracy, error rate, precision and recall of the classification algorithm. The confusion matrix usually visualizes the performance of the classifier and provides the main reference for the prediction results for the improvement and optimization of the classifier.

Normally, the confusion matrix consists of four parameters:

  • True Positive (TP): The classification algorithm correctly predicts the positive class as a positive class.
  • False Negative (FN): The classification algorithm incorrectly predicts a positive class as a negative class.
  • False Positive (FP): The classification algorithm incorrectly predicts a negative class as a positive class.
  • True Negative (TN): The classification algorithm correctly predicts a negative class as a negative class.

2. How to calculate the confusion matrix

The scikit-learn library in Python provides a convenient function to calculate the confusion matrix. This function is called confusion_matrix() and can be used as input between the classifier and the real results of the test set, and returns the parameter values ​​of the confusion matrix. The syntax of this function is as follows:

from sklearn.metrics import confusion_matrix
confusion_matrix(y_true, y_pred, labels=None, sample_weight=None)
Copy after login

Among them, y_true represents the correct result of the classifier, y_pred represents the prediction result of the classifier, and labels represents the name of the class label (if not provided, the default is from y_true and y_pred extracted value), sample_weight represents the weight of each sample (if not needed, do not set this parameter).

For example, suppose we need to calculate the confusion matrix of the following data:

y_true = [1, 0, 1, 2, 0, 1]
y_pred = [1, 0, 2, 1, 0, 2]
Copy after login

To calculate the confusion matrix, you can use the following code:

from sklearn.metrics import confusion_matrix
cm = confusion_matrix(y_true, y_pred)
print(cm)
Copy after login

The output result is:

array([[2, 0, 0],
       [0, 1, 2],
       [0, 1, 0]])
Copy after login

That is, the confusion matrix shows that "1" is correctly classified as "1" twice, "0" is correctly classified as "0" once, and "2" is correctly classified as "2" "There are 0 cases of "1" being misclassified as "2", there are 2 cases of "2" being misclassified as "1", and "0" being misclassified as "2" The situation occurred once.

3. Display the confusion matrix

There are many situations where we need a better visualization of the confusion matrix. The matplotlib library in Python can visualize confusion matrices. The following is Python code that uses the matplotlib library and sklearn.metrics to visualize the confusion matrix.

import itertools
import numpy as np
import matplotlib.pyplot as plt
from sklearn.metrics import confusion_matrix

def plot_confusion_matrix(cm, classes,
                          normalize=False,
                          title='Confusion matrix',
                          cmap=plt.cm.Blues):
    """
    This function prints and plots the confusion matrix.
    Normalization can be applied by setting `normalize=True`.
    """
    if normalize:
        cm = cm.astype('float') / cm.sum(axis=1)[:, np.newaxis]
        print("Normalized confusion matrix")
    else:
        print('Confusion matrix, without normalization')

    print(cm)

    plt.imshow(cm, interpolation='nearest', cmap=cmap)
    plt.title(title)
    plt.colorbar()
    tick_marks = np.arange(len(classes))
    plt.xticks(tick_marks, classes, rotation=45)
    plt.yticks(tick_marks, classes)

    fmt = '.2f' if normalize else 'd'
    thresh = cm.max() / 2.
    for i, j in itertools.product(range(cm.shape[0]), range(cm.shape[1])):
        plt.text(j, i, format(cm[i, j], fmt),
                 horizontalalignment="center",
                 color="white" if cm[i, j] > thresh else "black")

    plt.tight_layout()
    plt.ylabel('True label')
    plt.xlabel('Predicted label')

# Compute confusion matrix
cnf_matrix = confusion_matrix(y_test, y_pred)
np.set_printoptions(precision=2)

# Plot non-normalized confusion matrix
plt.figure()
plot_confusion_matrix(cnf_matrix, classes=class_names,
                      title='Confusion matrix, without normalization')

# Plot normalized confusion matrix
plt.figure()
plot_confusion_matrix(cnf_matrix, classes=class_names, normalize=True,
                      title='Normalized confusion matrix')

plt.show()
Copy after login

In the above code, we define a custom function named plot_confusion_matrix(), which accepts the parameters of the confusion matrix as parameters, the text string of the category name, and outputs the confusion matrix as a color image. The color of each cell of the confusion matrix represents the size of its value. Next, we need to calculate the confusion matrix using the respective true and predicted categories and represent the confusion matrix using the plot_confusion_matrix() function defined above.

4. Summary

The Python language provides a large number of visualization and data analysis libraries, which can enable data scientists and researchers to conduct deep learning and artificial intelligence data analysis more quickly. In this article, we introduce the confusion matrix and its applications, as well as how to calculate the confusion matrix in Python and how to use the matplotlib library to generate the graphics of the confusion matrix. Confusion matrix technology has important applications in the fields of deep learning and artificial intelligence. Therefore, it is very necessary to learn confusion matrix technology.

The above is the detailed content of Confusion Matrix Tricks in Python. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

AI Hentai Generator

AI Hentai Generator

Generate AI Hentai for free.

Hot Article

R.E.P.O. Energy Crystals Explained and What They Do (Yellow Crystal)
2 weeks ago By 尊渡假赌尊渡假赌尊渡假赌
Hello Kitty Island Adventure: How To Get Giant Seeds
1 months ago By 尊渡假赌尊渡假赌尊渡假赌
Two Point Museum: All Exhibits And Where To Find Them
1 months ago By 尊渡假赌尊渡假赌尊渡假赌

Hot Tools

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

Win11 Tips Sharing: Skip Microsoft Account Login with One Trick Win11 Tips Sharing: Skip Microsoft Account Login with One Trick Mar 27, 2024 pm 02:57 PM

Win11 Tips Sharing: One trick to skip Microsoft account login Windows 11 is the latest operating system launched by Microsoft, with a new design style and many practical functions. However, for some users, having to log in to their Microsoft account every time they boot up the system can be a bit annoying. If you are one of them, you might as well try the following tips, which will allow you to skip logging in with a Microsoft account and enter the desktop interface directly. First, we need to create a local account in the system to log in instead of a Microsoft account. The advantage of doing this is

What are the tips for novices to create forms? What are the tips for novices to create forms? Mar 21, 2024 am 09:11 AM

We often create and edit tables in excel, but as a novice who has just come into contact with the software, how to use excel to create tables is not as easy as it is for us. Below, we will conduct some drills on some steps of table creation that novices, that is, beginners, need to master. We hope it will be helpful to those in need. A sample form for beginners is shown below: Let’s see how to complete it! 1. There are two methods to create a new excel document. You can right-click the mouse on a blank location on the [Desktop] - [New] - [xls] file. You can also [Start]-[All Programs]-[Microsoft Office]-[Microsoft Excel 20**] 2. Double-click our new ex

A must-have for veterans: Tips and precautions for * and & in C language A must-have for veterans: Tips and precautions for * and & in C language Apr 04, 2024 am 08:21 AM

In C language, it represents a pointer, which stores the address of other variables; & represents the address operator, which returns the memory address of a variable. Tips for using pointers include defining pointers, dereferencing pointers, and ensuring that pointers point to valid addresses; tips for using address operators & include obtaining variable addresses, and returning the address of the first element of the array when obtaining the address of an array element. A practical example demonstrating the use of pointer and address operators to reverse a string.

VSCode Getting Started Guide: A must-read for beginners to quickly master usage skills! VSCode Getting Started Guide: A must-read for beginners to quickly master usage skills! Mar 26, 2024 am 08:21 AM

VSCode (Visual Studio Code) is an open source code editor developed by Microsoft. It has powerful functions and rich plug-in support, making it one of the preferred tools for developers. This article will provide an introductory guide for beginners to help them quickly master the skills of using VSCode. In this article, we will introduce how to install VSCode, basic editing operations, shortcut keys, plug-in installation, etc., and provide readers with specific code examples. 1. Install VSCode first, we need

Oracle database query skills: get only one piece of duplicate data Oracle database query skills: get only one piece of duplicate data Mar 08, 2024 pm 01:33 PM

Oracle database query skills: To obtain only one piece of duplicate data, specific code examples are required. In actual database queries, we often encounter situations where we need to obtain the only piece of data from duplicate data. This article will introduce how to use Oracle database techniques to obtain only one record in duplicate data, and provide specific code examples. Scenario Description Suppose we have a table named employee, which contains employee information. There may be duplicate employee information. We need to find all duplicates

PHP programming skills: How to jump to the web page within 3 seconds PHP programming skills: How to jump to the web page within 3 seconds Mar 24, 2024 am 09:18 AM

Title: PHP Programming Tips: How to Jump to a Web Page within 3 Seconds In web development, we often encounter situations where we need to automatically jump to another page within a certain period of time. This article will introduce how to use PHP to implement programming techniques to jump to a page within 3 seconds, and provide specific code examples. First of all, the basic principle of page jump is realized through the Location field in the HTTP response header. By setting this field, the browser can automatically jump to the specified page. Below is a simple example demonstrating how to use P

Win11 Tricks Revealed: How to Bypass Microsoft Account Login Win11 Tricks Revealed: How to Bypass Microsoft Account Login Mar 27, 2024 pm 07:57 PM

Win11 tricks revealed: How to bypass Microsoft account login Recently, Microsoft launched a new operating system Windows11, which has attracted widespread attention. Compared with previous versions, Windows 11 has made many new adjustments in terms of interface design and functional improvements, but it has also caused some controversy. The most eye-catching point is that it forces users to log in to the system with a Microsoft account. For some users, they may be more accustomed to logging in with a local account and are unwilling to bind their personal information to a Microsoft account.

Tips for using Laravel form classes: ways to improve efficiency Tips for using Laravel form classes: ways to improve efficiency Mar 11, 2024 pm 12:51 PM

Forms are an integral part of writing a website or application. Laravel, as a popular PHP framework, provides rich and powerful form classes, making form processing easier and more efficient. This article will introduce some tips on using Laravel form classes to help you improve development efficiency. The following explains in detail through specific code examples. Creating a form To create a form in Laravel, you first need to write the corresponding HTML form in the view. When working with forms, you can use Laravel

See all articles