pandas.parser.CParserError: Error Tokenizing Data
Problem:
When attempting to manipulate a .csv file with Pandas, you encounter the following error:
pandas.parser.CParserError: Error tokenizing data. C error: Expected 2 fields in line 3, saw 12
Possible Resolution:
One potential solution to this issue is to use the on_bad_lines parameter when calling pd.read_csv():
data = pd.read_csv('GOOG Key Ratios.csv', on_bad_lines='skip')
By setting on_bad_lines to 'skip', Pandas will ignore any lines that it cannot parse and continue processing the remaining lines. This approach is useful if you can tolerate losing some bad lines.
If you prefer to handle invalid lines differently, such as displaying a warning or raising an exception, you can provide a custom callable function to the on_bad_lines parameter. For more information on handling malformed lines, refer to the Pandas documentation.
Note:
For Pandas versions prior to 1.3.0, you can use the error_bad_lines parameter to achieve the same result:
data = pd.read_csv("GOOG Key Ratios.csv", error_bad_lines=False)
The above is the detailed content of How to Resolve Pandas' `CParserError: Error tokenizing data` When Reading CSV Files?. For more information, please follow other related articles on the PHP Chinese website!