如何找到 Pandas DataFrame 特定列中具有最大值的行？-Python教程-PHP中文网

如何找到 Pandas DataFrame 特定列中具有最大值的行？

Patricia Arquette

发布： 2024-10-31 06:40:02

原创

947 人浏览过

How do you find the row with the maximum value in a specific column of a Pandas DataFrame?

在 Pandas DataFrame 中查找最大值

在 pandas 中，识别包含特定列最大值的行需要一种简单的方法。

使用 pandas.DataFrame.idxmax

pandas 库提供了 idxmax 函数，可以直接满足此需求。它检索给定列中具有最大值的行的索引标签。考虑以下示例：

<code class="python">import pandas as pd
import numpy as np

df = pd.DataFrame(np.random.randn(5, 3), columns=['A', 'B', 'C'])

print(df)
          A         B         C
0  1.232853 -1.979459 -0.573626
1  0.140767  0.394940  1.068890
2  0.742023  1.343977 -0.579745
3  2.125299 -0.649328 -0.211692
4 -0.187253  1.908618 -1.862934

print(df['A'].idxmax())  # row index with maximum value in column 'A'
print(df['B'].idxmax())  # row index with maximum value in column 'B'
print(df['C'].idxmax())  # row index with maximum value in column 'C'

# Output
3  # row index 3
4  # row index 4
1  # row index 1</code>

登录后复制

使用 numpy.argmax 的替代方法

或者，您可以使用 numpy.argmax 来获得相同的结果。它返回位置索引而不是标签索引。请记住，argmax 曾经被称为 idxmax，但后来被后者取代。

历史背景：行标签与整数索引

中pandas 的早期版本，行标签由整数索引而不是标签表示。这种做法虽然现在已经过时，但在许多常用的应用程序中仍然存在。

为了适应向标记行索引的转变，argmax 函数被修改为返回包含最大元素的行索引内的位置索引。此更改旨在减轻使用整数索引引起的混乱，特别是在重复行标签等情况下。

处理重复行标签

值得注意的是 idxmax 返回行标签，而不是整数。在存在重复行标签的情况下，使用 idxmax 就不够了。要在这种情况下获取位置索引，您可能需要手动从索引标签中提取它。

以上是如何找到 Pandas DataFrame 特定列中具有最大值的行？的详细内容。更多信息请关注PHP中文网其他相关文章！