Home Backend Development Python Tutorial How to Group Consecutive Values in a Pandas DataFrame?

How to Group Consecutive Values in a Pandas DataFrame?

Nov 30, 2024 am 06:47 AM

How to Group Consecutive Values in a Pandas DataFrame?

Grouping Consecutive Values in a Pandas DataFrame

In data analysis, we often encounter situations where data is ordered and there's a need to group consecutive values together. This task can be achieved in pandas using customized grouping techniques.

Suppose we have a DataFrame with a column named 'a' containing the following values:

[1, 1, -1, 1, -1, -1]
Copy after login

Our goal is to group these values into consecutive blocks, like so:

[1,1] [-1] [1] [-1, -1]
Copy after login

To accomplish this, we can employ the following steps:

  1. Create a custom Series: We create a new Series using the ne and shift functions. This Series returns a Boolean value indicating whether the current value is different from the previous value.
  2. Use the Series for grouping: We pass the custom Series to the groupby function. This groups the data by the consecutive blocks.
  3. Iterate over the grouped data: We iterate over the grouped data and print the index, the grouped DataFrame, and a list of the values in the 'a' column for each group.

Here's the code implementing these steps:

import pandas as pd

df = pd.DataFrame({'a': [1, 1, -1, 1, -1, -1]})
print(df)

custom_series = df['a'].ne(df['a'].shift()).cumsum()
print(custom_series)

for i, g in df.groupby(custom_series):
    print(i)
    print(g)
    print(g.a.tolist())
Copy after login

This outputs the desired grouping:

1
   a
0  1
1  1
[1, 1]
2
   a
2 -1
[-1]
3
   a
3  1
[1]
4
   a
4 -1
5 -1
[-1, -1]
Copy after login

The above is the detailed content of How to Group Consecutive Values in a Pandas DataFrame?. For more information, please follow other related articles on the PHP Chinese website!

Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn

Hot Article Tags

Notepad++7.3.1

Notepad++7.3.1

Easy-to-use and free code editor

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

Zend Studio 13.0.1

Zend Studio 13.0.1

Powerful PHP integrated development environment

Dreamweaver CS6

Dreamweaver CS6

Visual web development tools

SublimeText3 Mac version

SublimeText3 Mac version

God-level code editing software (SublimeText3)

How to Use Python to Find the Zipf Distribution of a Text File How to Use Python to Find the Zipf Distribution of a Text File Mar 05, 2025 am 09:58 AM

How to Use Python to Find the Zipf Distribution of a Text File

How Do I Use Beautiful Soup to Parse HTML? How Do I Use Beautiful Soup to Parse HTML? Mar 10, 2025 pm 06:54 PM

How Do I Use Beautiful Soup to Parse HTML?

Image Filtering in Python Image Filtering in Python Mar 03, 2025 am 09:44 AM

Image Filtering in Python

How to Perform Deep Learning with TensorFlow or PyTorch? How to Perform Deep Learning with TensorFlow or PyTorch? Mar 10, 2025 pm 06:52 PM

How to Perform Deep Learning with TensorFlow or PyTorch?

Introduction to Parallel and Concurrent Programming in Python Introduction to Parallel and Concurrent Programming in Python Mar 03, 2025 am 10:32 AM

Introduction to Parallel and Concurrent Programming in Python

Serialization and Deserialization of Python Objects: Part 1 Serialization and Deserialization of Python Objects: Part 1 Mar 08, 2025 am 09:39 AM

Serialization and Deserialization of Python Objects: Part 1

How to Implement Your Own Data Structure in Python How to Implement Your Own Data Structure in Python Mar 03, 2025 am 09:28 AM

How to Implement Your Own Data Structure in Python

Mathematical Modules in Python: Statistics Mathematical Modules in Python: Statistics Mar 09, 2025 am 11:40 AM

Mathematical Modules in Python: Statistics

See all articles