fix: don't let CuPy iterate over Index with Python for loops #3142

jpivarski · 2024-06-06T01:24:17Z

Arguably, this is a performance thing, but it's so catastrophic that I'd call it a bug-fix.

@lgray noticed that slicing an Awkward-CUDA array was significantly slower than slicing an Awkward Array on the CPU. It's because CuPy thought that an ak.index.Index is a generic Python iterable, not a CUDA array.

We assumed that adding a Index.__cuda_array_interface__ property would make CuPy notice that Index is a CUDA capable array. The CuPy documentation describes instances in which it produces (for Numba) and consumes (from PyTorch) the __cuda_array_interface__ property. However, none of those examples illustrate the pattern

cupy_array[object_that_implements_cuda_interface]

(with a cp.ndarray.__getitem__). Since they don't promise to promote CUDA arrays on slices, I can't be sure that this is a CuPy bug.

At least to support existing versions of CuPy, we can't make this assumption. I temporarily added this to Index:

--- a/src/awkward/index.py
+++ b/src/awkward/index.py
@@ -161,6 +161,31 @@ class Index:
     def __len__(self) -> int:
         return int(self.length)
 
+    _cycler = 0
+    def __getattr__(self, name):
+        if name == "__array_struct__":
+            self._cycler = 1
+            raise AttributeError(name)
+        elif name == "__array_interface__":
+            if self._cycler == 1:
+                self._cycler = 2
+            else:
+                self._cycler = 0
+            return self._data.__array_interface__
+        elif name == "__array__":
+            if self._cycler == 2:
+                self._cycler = 3
+                raise Exception(
+                    "CuPy is coming to the conclusion that Index is a Python iterable"
+                )
+            else:
+                self._cycler = 0
+            raise AttributeError(name)
+        elif name in self.__dict__:
+            return self.__dict__[name]
+        else:
+            raise AttributeError(name)
+
     @property
     def __cuda_array_interface__(self):
         return self._data.__cuda_array_interface__  # type: ignore[attr-defined]

and ran pytest tests-cuda to catch all instances that are covered by the tests.¹ There were two (fixed in this PR).

@ManasviGoyal, can you check this?

@agoose77, do you think there might be other cases in which you assumed that CuPy would auto-promote an Index as a CUDA array?

Meanwhile, I'm going to report this to CuPy, just in case it is unintended/a bug.

We have to do this "cycler" thing to identify requests for __array_struct__, __array_interface__, __array__ in succession because CuPy does not appear in the stack trace. It's implemented in Cython, which doesn't emit Python stack frames. ↩

jpivarski · 2024-06-06T01:30:44Z

Reported at cupy/cupy#8352.

jpivarski · 2024-06-06T20:38:13Z

@ManasviGoyal, give this a quick check and let me know if something seems wrong (since you've been deep in the GPU code, you might spot something that I didn't). Otherwise, check this as "approved" and I'll merge it. Thanks!

ManasviGoyal

@jpivarski I did check it earlier and everything looks good. I used the function you mentioned and didn't notice any other cases with the same issue in the new tests I added. I'll test it again when I add more integration tests. Go ahead and merge it!

jpivarski · 2024-06-06T22:03:43Z

Thanks!

fix: don't let CuPy iterate over Index with Python for loops

2edd12d

lgray mentioned this pull request Jun 6, 2024

feat: add reduce kernels #3136

Merged

13 tasks

jpivarski requested a review from ManasviGoyal June 6, 2024 20:36

jpivarski deployed to docs June 6, 2024 20:43 — with GitHub Actions View deployment

ManasviGoyal approved these changes Jun 6, 2024

View reviewed changes

jpivarski merged commit 5de7b35 into main Jun 6, 2024
41 checks passed

jpivarski deleted the jpivarski/dont-let-cupy-iterate-over-Index-with-Python-for-loops branch June 6, 2024 22:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: don't let CuPy iterate over Index with Python for loops #3142

fix: don't let CuPy iterate over Index with Python for loops #3142

jpivarski commented Jun 6, 2024

jpivarski commented Jun 6, 2024

jpivarski commented Jun 6, 2024

ManasviGoyal left a comment

jpivarski commented Jun 6, 2024

fix: don't let CuPy iterate over Index with Python for loops #3142

fix: don't let CuPy iterate over Index with Python for loops #3142

Conversation

jpivarski commented Jun 6, 2024

Footnotes

jpivarski commented Jun 6, 2024

jpivarski commented Jun 6, 2024

ManasviGoyal left a comment

Choose a reason for hiding this comment

jpivarski commented Jun 6, 2024