Sentry Answers>Python>

Iterate over rows in a Pandas DataFrame in Python

Iterate over rows in a Pandas DataFrame in Python

David Y.

The ProblemJump To Solution

How can I iterate over the rows of a Python Pandas DataFrame, accessing its elements by their column names?

The Solution

Iterating over rows is considered bad practice in Pandas, as it becomes quite slow for large DataFrames. It is worth considering whether the problem we’re aiming to solve with iteration over rows could instead be solved using one of the following approaches:

If neither of these solutions is appropriate and iterating over the rows of our DataFrame cannot be avoided, we can use DataFrame.itertuples(). This function returns rows as named tuples. For example:

Click to Copy
import pandas prices_df = pandas.DataFrame({'cost_price': [1, 2], 'sale_price': [2, 4]}, index=['apple', 'orange']) for row in prices_df.itertuples(): print(f"An {row.Index} costs ${row.cost_price} and sells for ${row.sale_price}.")

This will print the following:

Click to Copy
An apple costs $1 and sells for $2. An orange costs $2 and sells for $4.

DataFrame.iterrows() can also be used to iterate over DataFrame rows, but is slower than itertuples() and does not preserve data types.

  • Sentry BlogPython Performance Testing: A Comprehensive Guide
  • Sentry BlogLogging in Python: A Developer’s Guide
  • Syntax.fm logo
    Listen to the Syntax Podcast

    Tasty treats for web developers brought to you by Sentry. Get tips and tricks from Wes Bos and Scott Tolinski.

    SEE EPISODES

Considered “not bad” by 4 million developers and more than 100,000 organizations worldwide, Sentry provides code-level observability to many of the world’s best-known companies like Disney, Peloton, Cloudflare, Eventbrite, Slack, Supercell, and Rockstar Games. Each month we process billions of exceptions from the most popular products on the internet.

© 2024 • Sentry is a registered Trademark
of Functional Software, Inc.