FIX: silent data loss in MultiIndex __setitem__ with object-dtype level by Bahtya · Pull Request #65119 · pandas-dev/pandas

Bahtya · 2026-04-08T10:37:58Z

Problem

df[key] = df[key] / x silently drops the assignment (no error, no warning) when:

DataFrame has a column MultiIndex
Level 1 has object dtype due to mixed types (e.g. string + int)
The top-level label has exactly one sub-column

cols = pd.MultiIndex.from_tuples(
    [("info", "M"), ("info", 0), ("earnings", 1), ("earnings", 2), ("prices", 0)]
)
df = pd.DataFrame(np.arange(20, dtype=float).reshape(4, 5), columns=cols)
df["prices"] = df["prices"] / 100  # silent no-op! values unchanged

This is a regression from 2.3.x → 3.0.x and causes silent data loss.

Root Cause

In _set_item_frame_value, the guard added for GH#62518/GH#61841 checks:

is_string_dtype(cols_droplevel.dtype) and not cols_droplevel.any()

When maybe_droplevels produces Index([0], dtype="object"):

is_string_dtype(object) → True
Index([0]).any() → False (because 0 is falsy)

This causes the early return to trigger incorrectly, silently discarding the assignment.

Fix

Replace not cols_droplevel.any() with (cols_droplevel == "").all() to explicitly check for empty strings:

and len(cols_droplevel) > 0
and (cols_droplevel == "").all()

This correctly identifies actual empty-string columns without being fooled by falsy integer values in object-dtype Indexes.

Testing

Verified locally that the fix resolves the issue from the bug report:

# Before fix: prices values unchanged [4. 9. 14. 19.]
# After fix: prices values correctly divided [0.04 0.09 0.14 0.19]

Also confirmed the original GH#62518/GH#61841 cases are still protected since their columns genuinely contain only empty strings.

Fixes #65118

…type level When setting a top-level column on a DataFrame with a MultiIndex where level 1 has object dtype (mixed types), assignments to single-subcolumn groups are silently dropped. Root cause: In _set_item_frame_value, the guard for GH#62518/GH#61841 (avoiding reindex into empty-string columns) used is_string_dtype(cols_droplevel.dtype) and not cols_droplevel.any(). For object-dtype Index containing integer 0, is_string_dtype returns True and any() returns False (0 is falsy), causing the early return to trigger incorrectly. Fix: Replace not cols_droplevel.any() with (cols_droplevel == "").all() to explicitly check for empty strings instead of relying on truthiness. Fixes pandas-dev#65118 Signed-off-by: bahtya <bahtyar153@qq.com>

jbrockmendel · 2026-04-10T03:29:32Z

Pls add test

Bahtya · 2026-04-11T04:07:17Z

Thanks for the review! I'll add a test for the silent data loss case in MultiIndex __setitem__ with object-dtype levels.

Bahtya · 2026-04-17T16:20:14Z

Hi @jbrockmendel, thanks for the review! I'll add tests for this fix. Working on it now.

…#65118)

Bahtya · 2026-04-18T13:09:32Z

Hi @jbrockmendel, I've added three test cases for this fix:

test_multiindex_setitem_object_dtype_level_no_silent_drop — exact reproduction from the bug report (mixed string/int level values, df["prices"] = df["prices"] / 100)
test_multiindex_setitem_object_dtype_level_single_subcolumn — minimal case with integer 0 as level value
test_multiindex_setitem_object_dtype_level_falsy_values — ensures columns with empty string "" alongside falsy integer 0 are still writable

All tests pass locally. Ready for re-review!

TST: add tests for MultiIndex __setitem__ with object-dtype level (GH…

dac24e9

…#65118)

jbrockmendel mentioned this pull request Apr 20, 2026

BUG: Fix silent __setitem__ discard for MultiIndex with falsy sub-column label (GH#65118) #65303

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

FIX: silent data loss in MultiIndex setitem with object-dtype level#65119

FIX: silent data loss in MultiIndex setitem with object-dtype level#65119
Bahtya wants to merge 2 commits intopandas-dev:mainfrom
Bahtya:fix/multiindex-setitem-silent-drop

Bahtya commented Apr 8, 2026

Uh oh!

jbrockmendel commented Apr 10, 2026

Uh oh!

Bahtya commented Apr 11, 2026

Uh oh!

Bahtya commented Apr 17, 2026

Uh oh!

Bahtya commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

Bahtya commented Apr 8, 2026

Problem

Root Cause

Fix

Testing

Uh oh!

jbrockmendel commented Apr 10, 2026

Uh oh!

Bahtya commented Apr 11, 2026

Uh oh!

Bahtya commented Apr 17, 2026

Uh oh!

Bahtya commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants