Fix `numpy.ma.fix_invalid` issue in NumPy 2.1.0 by replacing with `numpy.ma.masked_invalid` #2042

kounelisagis · 2024-08-20T15:20:22Z

For an unknown reason, numpy.ma.fix_invalid behaves differently between NumPy 2.1.0 and NumPy 2.0.0. Specifically, when passing a pandas Series containing a numpy.nan value, numpy.ma.fix_invalid now makes changes in-place, even if the copy argument is set to its default value of True. This issue occurs only with pandas Series, not with NumPy arrays, for example.

Nevertheless, we don't actually need the numpy.ma.fix_invalid function since we handle NaNs later using numpy.nan_to_num. It would be wiser (and likely more performant) to use numpy.ma.masked_invalid, which simply creates a MaskedArray instance and allows us to obtain the mask from there.

cc: @jdblischak
Closes #2040

>>> import pandas as pd, numpy as np
>>> pd.__version__; np.__version__
'2.2.2'
'2.1.0'
>>> my_series = pd.Series([1.0, 2.0, np.nan, 0.0, 1.0])
>>> my_series
0    1.0
1    2.0
2    NaN
3    0.0
4    1.0
dtype: float64
>>> np.ma.fix_invalid(my_series)
masked_array(data=[1.0, 2.0, --, 0.0, 1.0],
             mask=[False, False,  True, False, False],
       fill_value=1e+20)
>>> my_series
0    1.000000e+00
1    2.000000e+00
2    1.000000e+20
3    0.000000e+00
4    1.000000e+00
dtype: float64

>>> import pandas as pd, numpy as np
>>> pd.__version__; np.__version__
'2.2.2'
'2.0.0'
>>> my_series = pd.Series([1.0, 2.0, np.nan, 0.0, 1.0])
>>> my_series
0    1.0
1    2.0
2    NaN
3    0.0
4    1.0
dtype: float64
>>> np.ma.fix_invalid(my_series)
masked_array(data=[1.0, 2.0, --, 0.0, 1.0],
             mask=[False, False,  True, False, False],
       fill_value=1e+20)
>>> my_series
0    1.0
1    2.0
2    NaN
3    0.0
4    1.0
dtype: float64

teo-tsirpanis · 2024-08-20T15:22:47Z

numpy.ma.fix_invalid behaves differently between NumPy 2.1.0 and NumPy 2.0.0

Is there an issue on NumPy about that? Can you open one if not?

teo-tsirpanis · 2024-08-20T15:24:55Z

Change seems fine. Launched nightlies from this branch and will approve if passed.

kounelisagis · 2024-08-20T15:26:06Z

numpy.ma.fix_invalid behaves differently between NumPy 2.1.0 and NumPy 2.0.0

Is there an issue on NumPy about that? Can you open one if not?

I couldn't find an existing issue, but I can open a new one.

teo-tsirpanis

One previously failing job now succeeds. Thanks!

Fix numpy 2.21.0

681dd69

kounelisagis requested review from teo-tsirpanis and KiterLuc August 20, 2024 15:20

teo-tsirpanis changed the title ~~Fix numpy.ma.fix_invalid issue in NumPy 2.21.0~~ Fix numpy.ma.fix_invalid issue in NumPy 2.1.0 Aug 20, 2024

teo-tsirpanis approved these changes Aug 20, 2024

View reviewed changes

kounelisagis changed the title ~~Fix numpy.ma.fix_invalid issue in NumPy 2.1.0~~ Fix numpy.ma.fix_invalid issue in NumPy 2.1.0 by replacing with numpy.ma.masked_invalid Aug 20, 2024

kounelisagis merged commit cbdc6ed into dev Aug 20, 2024
61 checks passed

kounelisagis deleted the agis/fix-numpy-2.21.0 branch August 20, 2024 15:35

kounelisagis mentioned this pull request Aug 20, 2024

BUG: numpy.ma.fix_invalid makes changes in-place in numpy 2.1.0 even with copy=True numpy/numpy#27253

Open

jdblischak mentioned this pull request Aug 20, 2024

The centralized nightlies job failed on Friday (2024-08-16) jdblischak/centralized-tiledb-nightlies#18

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix `numpy.ma.fix_invalid` issue in NumPy 2.1.0 by replacing with `numpy.ma.masked_invalid` #2042

Fix `numpy.ma.fix_invalid` issue in NumPy 2.1.0 by replacing with `numpy.ma.masked_invalid` #2042

kounelisagis commented Aug 20, 2024

teo-tsirpanis commented Aug 20, 2024

teo-tsirpanis commented Aug 20, 2024

kounelisagis commented Aug 20, 2024

teo-tsirpanis left a comment

Fix numpy.ma.fix_invalid issue in NumPy 2.1.0 by replacing with numpy.ma.masked_invalid #2042

Fix numpy.ma.fix_invalid issue in NumPy 2.1.0 by replacing with numpy.ma.masked_invalid #2042

Conversation

kounelisagis commented Aug 20, 2024

teo-tsirpanis commented Aug 20, 2024

teo-tsirpanis commented Aug 20, 2024

kounelisagis commented Aug 20, 2024

teo-tsirpanis left a comment

Choose a reason for hiding this comment

Fix `numpy.ma.fix_invalid` issue in NumPy 2.1.0 by replacing with `numpy.ma.masked_invalid` #2042

Fix `numpy.ma.fix_invalid` issue in NumPy 2.1.0 by replacing with `numpy.ma.masked_invalid` #2042