About the basic operations of pandas.DataFrame in python

Y2J
Release: 2017-05-09 14:50:31
Original
2487 people have browsed it

This article mainly introduces you to the method of excluding specific rows in pandas.DataFrame in python. The article gives detailed example codes. I believe it has certain reference value for everyone's understanding and learning. Friends who need it can follow Let’s take a look together.

Preface

When you use Python for data analysis, a data structure that you often use is the pandas DataFrame. About the basic operations of pandas.DataFrame in python , you can check this article.

pandas.DataFrame excludes specific rows

If we want to filter only one or certain rows like Excel, we can use isin( ) method, pass in the values ​​of the required rows in a list, you can also pass in a dictionary and specify columns for filtering.

But if we only want all content that does not contain a specific line, there is no isnotin() method. I encountered such a requirement at work today. After searching frequently, I found that I could only use isin() in another way to achieve this requirement.

The example is as follows:

In [3]: df = pd.DataFrame([['GD', 'GX', 'FJ'], ['SD', 'SX', 'BJ'], ['HN', 'HB'
 ...: , 'AH'], ['HEN', 'HEN', 'HLJ'], ['SH', 'TJ', 'CQ']], columns=['p1', 'p2
 ...: ', 'p3'])

In [4]: df
Out[4]:
 p1 p2 p3
0 GD GX FJ
1 SD SX BJ
2 HN HB AH
3 HEN HEN HLJ
4 SH TJ CQ
Copy after login

If you only want the two rows where p1 is GD and HN, you can do this:

In [8]: df[df.p1.isin(['GD', 'HN'])]
Out[8]:
 p1 p2 p3
0 GD GX FJ
2 HN HB AH
Copy after login

But if we want To get data other than these two rows, you need to take a detour.

The principle is to first take out p1 and convert it into a list, then remove unnecessary rows (values) from the list, and then use isin() in the DataFrame

In [9]: ex_list = list(df.p1)

In [10]: ex_list.remove('GD')

In [11]: ex_list.remove('HN')

In [12]: ex_list
Out[12]: ['SD', 'HEN', 'SH']

In [13]: df[df.p1.isin(ex_list)]
Out[13]:
 p1 p2 p3
1 SD SX BJ
3 HEN HEN HLJ
4 SH TJ CQ
Copy after login

Summary

[Related recommendations]

1. Python Free Video Tutorial

2. Python basic introductory tutorial

3. Python meets data collection video tutorial

The above is the detailed content of About the basic operations of pandas.DataFrame in python. For more information, please follow other related articles on the PHP Chinese website!

Related labels:
source:php.cn
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template