`astype(int)` silently returns wrong answer under Windows #17640

Rik-de-Kort · 2020-10-26T12:52:53Z

Casting an array of float64 to int using astype with as argument int will yield the value -2147483648 under windows.

Reproducing code example:

x = 2384351503.0
np.testing.assert_array_equal(np.array([x]).astype(int), np.array([int(x)]))

I expect this test to go through, but it fails with the following message:

Traceback (most recent call last):
  File "C:\Users\koRR\AppData\Local\conda\conda\envs\LRE\lib\site-packages\IPython\core\interactiveshell.py", line 3417, in run_code
    exec(code_obj, self.user_global_ns, self.user_ns)
  File "<ipython-input-10-725c5d2f7abb>", line 1, in <module>
    np.testing.assert_array_equal(np.array([x]).astype(int), np.array([int(x)]))
  File "C:\Users\koRR\AppData\Local\conda\conda\envs\LRE\lib\site-packages\numpy\testing\_private\utils.py", line 931, in assert_array_equal
    verbose=verbose, header='Arrays are not equal')
  File "C:\Users\koRR\AppData\Local\conda\conda\envs\LRE\lib\site-packages\numpy\testing\_private\utils.py", line 840, in assert_array_compare
    raise AssertionError(msg)
AssertionError: 
Arrays are not equal
Mismatched elements: 1 / 1 (100%)
Max absolute difference: 4531835151
Max relative difference: 1.90065733

Using dtypes np.int64 works fine, already np.int32 returns the wrong answer (although that's 0 and the above result occurs only with the builtin int or np.int).

NumPy/Python version information:

Numpy version: 1.19.1
System version: '3.7.6 (default, Jan 8 2020, 20:23:39) [MSC v.1916 64 bit (AMD64)]'
Bug is taking place under Windows 10. Cannot reproduce on my Arch system with numpy 1.19.2 and python 3.8.5 built using GCC.

The text was updated successfully, but these errors were encountered:

eric-wieser · 2020-10-26T12:55:46Z

Your example is not the same as your error message - the test uses np.array([2384351503], dtype=int), while the error message shows you are using np.array([238435103]). At least on 1.17, the former raises an OverflowError, so I can't even run your example code.

eric-wieser · 2020-10-26T13:04:35Z

Bug is taking place under Windows 10. Cannot reproduce on my Arch system with numpy 1.19.2 and python 3.8.5 built using GCC.

int gives you a C long - and long is 32-bit on windows but 64-bit everywhere else.

Rik-de-Kort · 2020-10-27T07:36:01Z

Sorry about that, I was typing the message from outside the VM where I got it. Very sloppy on my part. I updated the OP with a proper example.

and long is 32-bit on windows but 64-bit everywhere else.

Thanks for the info. int has unbounded precision in Python so this really took me by surprise. Also it's weird that it didn't raise an overflow error.

charris · 2020-10-27T12:23:44Z

so this really took me by surprise.

Python 2 had two types of integer: int (C long) and long (unbounded precision). Python 3 only kept the second.

finoptimal-dev · 2021-06-01T20:49:44Z

Is there a workaround for this?

BvB93 · 2021-06-01T21:03:24Z

Is there a workaround for this?

Since the issue is caused by an overflow of np.int32 (i.e. the default int-type on windows) you can use np.int64 instead.

finoptimal-dev · 2021-06-01T21:05:23Z

Thanks, BvB93. Is there a way to map it globally? I have a codebase that works on linux with astype(int) all over it; now I have a Windows developer. Before I consider changing it everywhere it exists, I'm wondering if there's a solution that spares needing to take that route.

BvB93 · 2021-06-02T10:35:34Z

Thanks, BvB93. Is there a way to map it globally?

I think you might be out of luck here. As far as I'm aware there is no way of manually setting the default integer type to something different from what is specified by the platform in question.

charris · 2021-06-03T16:18:28Z

There has been some discussion about changing the default type. I believe the use of c_long is inherited from early python.

seberg · 2021-06-03T16:55:50Z

I had a PR once to make it np.intp at least, so that 64bit windows would at least use 64bits (making the rule "64bit on 64bit systems").

That was mainly for discussion, but I am more and more seeing a NumPy 2.0 in any case, and this might be important enough to fold in if it happens. (Even if I would prefer to not do too many of such changes at once and rather make this type of "breaking a bit" releases every few years so that the number of such changes are small each time.)

mattip · 2021-06-03T19:51:18Z

Is this done correctly in the NEP 47 array API?

seberg · 2021-06-03T20:01:16Z

@asmeurer can you check this? From what I remember of the pass I did just today, it probably isn't.

asmeurer · 2021-06-03T21:03:03Z

I think this issue is relevant data-apis/array-api#151

seberg · 2021-06-03T21:12:14Z

@asmeurer the point is that the standard probably wants a well defined "default integer"? But NumPy's default integer is currently not well defined:

It differs for windows 64bit compared to linux 64 bit (because long is defined differently) and is also 32bit on 32bit platforms.
We automatically "spill" into long long or unsigned long long ~~as noted here~~. EDIT np.array(2**63)

If we add a new, clean namespace we should try to make good use of it and fix both of these. (unless we are sure we fix both of them in NumPy proper, which I would like to try but is more difficult.)

asmeurer · 2021-06-03T21:28:01Z

I actually mean this change specifically https://github.com/data-apis/array-api/pull/167/files#diff-7e75cfe3133de16126433bc962f9fe14f216bd682e3870efa965bab40d08322fR9. Based on what is there, I guess we should make the "default" integer dtype int64 on 64-bit Windows. Note that in the array API itself the default integer dtype is only relevant in a couple of places.

seberg · 2021-06-03T21:39:04Z

@asmeurer good that it is spelled out nicely!

But your PR does not conform to this for asarra([1, 2, 3, 4]). We could probably make it conform from within NumPy at least with an abstract DType right now. Or you would have to write a light-weight asarray yourself, I guess.

asmeurer · 2021-06-03T21:44:50Z

You mean specifically on Windows? Or does asarray also use some value based casting in some cases?

What is an abstract DType? If there is some trick that would avoid having to reimplement asarray, that would be ideal.

seberg · 2021-06-03T22:23:22Z

We automatically "spill" into long long or unsigned long long as noted here. EDIT np.array(2**63)

This part is not windows specific.

What is an abstract DType

A DType class as per NEP 42. That could customize the dtype discovery during array coercion as per the NEP. But, it we may have to fix casting to/for abstract DTypes first. (Shouldn't be super hard, it should have an additional "common DType" based path, I think – that is probably already mentioned in NEP 42.)

Rik-de-Kort · 2021-06-04T12:49:57Z

Thanks, BvB93. Is there a way to map it globally? I have a codebase that works on linux with astype(int) all over it; now I have a Windows developer. Before I consider changing it everywhere it exists, I'm wondering if there's a solution that spares needing to take that route.

In my codebase this is exactly what I have done. Explicit is better than implicit after all. No problems changing over, no problems after. Would recommend!

numpy/numpy#17640)

seberg · 2023-11-03T07:17:40Z

Closing, we are trying to switch to 64bit on 64bi tplatforms for NumPy 2.0. See the dev release notes for example. Which will change the default on windows specifically.
(And if we may have to undo that, there is probably not much we can do about this.)

seberg · 2023-11-03T07:19:11Z

Note that for now, NumPy will still happily return uints and object for out of bound ints, this also doesn't affect 32bit platforms, just because it would be a bigger, more difficult to deal with change.

jklaise mentioned this issue Jan 11, 2022

RuntimeError: expected scalar type Long but found Int in [cd_spot_the_diff_mnist_wine.ipynb] SeldonIO/alibi-detect#411

Closed

patrick-kidger mentioned this issue Feb 15, 2022

jax.config.update("jax_enable_x64", True); jnp.array(..., dtype=int) produces 32-bit instead of 64-bit result on Windows jax-ml/jax#9574

Closed

peterdsharpe added a commit to peterdsharpe/AeroSandbox that referenced this issue Mar 4, 2022

.astype(int) -> .astype(int64), to fend off cross-platform issues (see:

ccfdad8

numpy/numpy#17640)

seberg mentioned this issue Oct 9, 2023

DISCUSS: What should the default integer type/dtype be #24890

Closed

seberg closed this as completed Nov 3, 2023

ebsmothers mentioned this issue May 15, 2024

(Windows 11) cross_entropy_loss(): RuntimeError: expected scalar type Long but found Int pytorch/torchtune#981

Closed

gaustin15 mentioned this issue Jun 21, 2024

Runtime error when running the demo walkthrough code korem-lab/DEBIAS-M#2

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`astype(int)` silently returns wrong answer under Windows #17640

`astype(int)` silently returns wrong answer under Windows #17640

Rik-de-Kort commented Oct 26, 2020 •

edited

Loading

eric-wieser commented Oct 26, 2020 •

edited

Loading

eric-wieser commented Oct 26, 2020

Rik-de-Kort commented Oct 27, 2020

charris commented Oct 27, 2020

finoptimal-dev commented Jun 1, 2021

BvB93 commented Jun 1, 2021

finoptimal-dev commented Jun 1, 2021

BvB93 commented Jun 2, 2021

charris commented Jun 3, 2021

seberg commented Jun 3, 2021

mattip commented Jun 3, 2021

seberg commented Jun 3, 2021

asmeurer commented Jun 3, 2021

seberg commented Jun 3, 2021 •

edited

Loading

asmeurer commented Jun 3, 2021

seberg commented Jun 3, 2021

asmeurer commented Jun 3, 2021

seberg commented Jun 3, 2021 •

edited

Loading

Rik-de-Kort commented Jun 4, 2021

seberg commented Nov 3, 2023

seberg commented Nov 3, 2023

astype(int) silently returns wrong answer under Windows #17640

astype(int) silently returns wrong answer under Windows #17640

Comments

Rik-de-Kort commented Oct 26, 2020 • edited Loading

Reproducing code example:

NumPy/Python version information:

eric-wieser commented Oct 26, 2020 • edited Loading

eric-wieser commented Oct 26, 2020

Rik-de-Kort commented Oct 27, 2020

charris commented Oct 27, 2020

finoptimal-dev commented Jun 1, 2021

BvB93 commented Jun 1, 2021

finoptimal-dev commented Jun 1, 2021

BvB93 commented Jun 2, 2021

charris commented Jun 3, 2021

seberg commented Jun 3, 2021

mattip commented Jun 3, 2021

seberg commented Jun 3, 2021

asmeurer commented Jun 3, 2021

seberg commented Jun 3, 2021 • edited Loading

asmeurer commented Jun 3, 2021

seberg commented Jun 3, 2021

asmeurer commented Jun 3, 2021

seberg commented Jun 3, 2021 • edited Loading

Rik-de-Kort commented Jun 4, 2021

seberg commented Nov 3, 2023

seberg commented Nov 3, 2023

`astype(int)` silently returns wrong answer under Windows #17640

`astype(int)` silently returns wrong answer under Windows #17640

Rik-de-Kort commented Oct 26, 2020 •

edited

Loading

eric-wieser commented Oct 26, 2020 •

edited

Loading

seberg commented Jun 3, 2021 •

edited

Loading

seberg commented Jun 3, 2021 •

edited

Loading