BUG: fix read_parquet failing with TimedeltaIndex column (#59692) by tianlangqin · Pull Request #65117 · pandas-dev/pandas

tianlangqin · 2026-04-08T04:47:30Z

closes BUG: pyarrow cannot read timedelta64[ns] #59692 (Replace xxxx with the GitHub issue number)
Tests added and passed if fixing a bug or adding a new feature
All code checks passed.
Added type annotations to new arguments/methods/functions.
Added an entry in the latest doc/source/whatsnew/vX.X.X.rst file if fixing a bug or adding a new feature.
I have reviewed and followed all the contribution guidelines
If I used AI to develop this pull request, I prompted it to follow AGENTS.md.

When writing a dataframe with TimedeltaIndex as column names to parquet with pyarrow and then reading it back, pyarrow reconstructs the column index by calling level.astype(dtype) with a unitless timedelta64 dtype, which causes ValueError to be thrown since Pandas doesn't know the unit.

I fixed it by default replacing the unitless timedelta64 with timedelta64[ns] in ExtensionArray.astype before calling TimedeltaArray._from_sequence. Then I added two short circuit checks in _astype_nansafe and TimedeltaArray.astype so if the source array already has a valid timedelta or datetime resolution and the requested target is unitless, it now returns the array as it is instead of attempting an invalid conversion.

The change in TimedeltaArray.astype relaxes the previous behavior where idx.astype("timedelta64") automatically raised a ValueError. Which was introduced in #13149 to prevent silent data corruptions. This concern doesn't apply here since the cast request from timedelta64[ns] to timedelta64 results in the identical array to be returned. The validation for the opposite direction remains unchanged.

…59692)

jbrockmendel · 2026-04-09T00:22:20Z


+            from pandas._libs.tslibs import is_unitless
+
+            if is_unitless(dtype):


We excluded support for this intentionally.

Thanks for the review. Since the error originates from pyarrow calling level.astype(dtype) with a unitless dtype during column reconstruction, where would you suggest handling this instead?

where does this happen?

jbrockmendel · 2026-04-09T00:22:31Z

            return DatetimeArray._from_sequence(self, dtype=dtype, copy=copy)

        elif lib.is_np_dtype(dtype, "m"):
+            from pandas._libs.tslibs import is_unitless


Import at the top

tianlangqin added 2 commits April 8, 2026 00:26

BUG: fix read_parquet failing with TimedeltaIndex column (pandas-dev#…

69efe86

…59692)

handle unitless timedelta64 in TimedeltaArray.astype and update tests

7915ef1

jbrockmendel reviewed Apr 9, 2026

View reviewed changes

tianlangqin requested a review from jbrockmendel April 16, 2026 17:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

BUG: fix read_parquet failing with TimedeltaIndex column (#59692)#65117

BUG: fix read_parquet failing with TimedeltaIndex column (#59692)#65117
tianlangqin wants to merge 2 commits intopandas-dev:mainfrom
hbd9577:fix-59692-read-parquet-timedelta-columns

tianlangqin commented Apr 8, 2026 •

edited

Loading

Uh oh!

jbrockmendel Apr 9, 2026

Uh oh!

tianlangqin Apr 15, 2026

Uh oh!

jbrockmendel Apr 16, 2026

Uh oh!

jbrockmendel Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		from pandas._libs.tslibs import is_unitless

		if is_unitless(dtype):

Uh oh!

Conversation

tianlangqin commented Apr 8, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jbrockmendel Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

tianlangqin Apr 15, 2026

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

jbrockmendel Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

tianlangqin commented Apr 8, 2026 •

edited

Loading