[WIP] Report all unsupported operations for a query in cudf.polars #16960

Matt711 · 2024-10-01T02:05:29Z

Description

Closes #16690. The purpose of this PR is to list all of the unique operations that are unsupported by cudf.polars when running a query. The current approach is to create a new node (ErrorNode) in the IR when translating polars IR to cudf.polars IR if the translation step fails with a NotImplementedError. And then traverse the new tree and report where ErrorNodes occured to the user.

Question: How to traverse the tree to report the error nodes? Should this be done upstream in Polars?
Instead of traversing the query afterwards, we should probably catch each unsupported feature as we translate the IR.

Checklist

I am familiar with the Contributing Guidelines.
New or existing tests cover these changes.
The documentation is up to date with these changes.

wence- · 2024-10-01T09:40:24Z

python/cudf_polars/cudf_polars/dsl/ir.py

+    def evaluate(self, *, cache: MutableMapping[int, DataFrame]) -> DataFrame:
+        return pl.DataFrame()


Suggested change

def evaluate(self, *, cache: MutableMapping[int, DataFrame]) -> DataFrame:

return pl.DataFrame()

def evaluate(self, *, cache: MutableMapping[int, DataFrame]) -> DataFrame:

return DataFrame([])

The object evaluate returns should be an internal DataFrame, rather than a polars DataFrame.

But, plausibly we don't want to define this method at all, and just let the default implementation raise.

It would be nice if we could keep track of the exception so that we can report it later.

I agree that we should keep track of the errors. There's two ways off the top of my head, either maintain a global list that we append to in debug mode, or, have the ErrorNode constructor accept a string error message which attaches it to the instance itself. Either way I think in debug mode we'll need to read once more over the errors and print them in some reasonable way.

My preference would be to pass the error to ErrorNode. To print "the errors in some reasonable way", would we have to traverse the NodeTraverser object upstream in Polars and print the ErrorNodes?

wence- · 2024-10-01T09:41:46Z

python/cudf_polars/cudf_polars/utils/other.py

+from __future__ import annotations
+
+import os
+
+
+def _env_get_int(name, default):
+    """Get the integer value of the environment variable."""
+    try:
+        return int(os.getenv(name, default))
+    except (ValueError, TypeError):
+        return default
+
+
+def _env_get_bool(name, default):
+    """Get the the boolean value of the environment variable."""
+    env = os.getenv(name)
+    if env is None:
+        return default
+    as_a_int = _env_get_int(name, None)
+    env = env.lower().strip()
+    if env == "true" or env == "on" or as_a_int:
+        return True
+    if env == "false" or env == "off" or as_a_int == 0:
+        return False
+    return default


suggestion: in the long run we probably want to use the polars Config object for this, rather than having our own parallel configuration.

Removed this in favor of passing a config through translate_ir

wence- · 2024-10-01T16:30:50Z

python/cudf_polars/cudf_polars/dsl/translate.py

+            if other._env_get_bool("CUDF_POLARS_DEBUG_MODE", default=False):
+                return ir.ErrorNode(args[0].get_schema())


This kind of action-at-distance stateful modification of the environment makes me think (and @brandon-b-miller wants it too for other reasons) that we need to carry some kind of config object around in the translate_ir

The other advantage of sending the config through to translate_ir would be the ability to configure per-query rather than per-session

I agree with configuring the "debug_mode" on a per-query basis. Eg.

In [1]: import polars as pl ...: ...: df = pl.LazyFrame( ...: { ...: "key": [1, 1, 1, 2, 3, 3, 2, 2], ...: "value": [1, 2, 3, 4, 5, 6, 7, 8], ...: } ...: ) ...: ...: q = df.select(pl.col("value").sum().over("key")) ...: ...: ...: result = q.collect(engine=pl.GPUEngine(debug_mode=True)) --------------------------------------------------------------------------- ComputeError Traceback (most recent call last) Cell In[1], line 13 3 df = pl.LazyFrame( 4 { 5 "key": [1, 1, 1, 2, 3, 3, 2, 2], 6 "value": [1, 2, 3, 4, 5, 6, 7, 8], 7 } 8 ) 10 q = df.select(pl.col("value").sum().over("key")) ---> 13 result = q.collect(engine=pl.GPUEngine(debug_mode=True)) File ~/.conda/envs/rapids/lib/python3.12/site-packages/polars/lazyframe/frame.py:2053, in LazyFrame.collect(self, type_coercion, predicate_pushdown, projection_pushdown, simplify_expression, slice_pushdown, comm_subplan_elim, comm_subexpr_elim, cluster_with_columns, collapse_joins, no_optimization, streaming, engine, background, _eager, **_kwargs) 2051 # Only for testing purposes 2052 callback = _kwargs.get("post_opt_callback", callback) -> 2053 return wrap_df(ldf.collect(callback)) ComputeError: NotImplementedError: Evaluation of plan ErrorNode

And for global configuration, maybe add an option to this config options list in Polars?

…ed-ops

Matt711 · 2024-10-02T20:18:29Z

python/cudf_polars/cudf_polars/callback.py

-                    translate_ir(nt),
+                    translate_ir(
+                        nt,
+                        debug_mode=1 if debug_mode else 0,


Pass a more general config object?

[FEA] Report all unsupported operations for a query in cudf.polars

bb51789

Matt711 added feature request New feature or request 5 - DO NOT MERGE Hold off on merging; see PR for details non-breaking Non-breaking change labels Oct 1, 2024

Matt711 self-assigned this Oct 1, 2024

github-actions bot added Python Affects Python cuDF API. cudf.polars Issues specific to cudf.polars labels Oct 1, 2024

wence- reviewed Oct 1, 2024

View reviewed changes

Matt711 and others added 3 commits October 2, 2024 10:25

Merge branch 'branch-24.12' into fea/cudf-polars/report-all-unsupport…

763cb32

…ed-ops

address reviews

ab91793

remove other utils

c1e2d37

Matt711 commented Oct 2, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Report all unsupported operations for a query in cudf.polars #16960

[WIP] Report all unsupported operations for a query in cudf.polars #16960

Matt711 commented Oct 1, 2024

wence- Oct 1, 2024

wence- Oct 1, 2024

brandon-b-miller Oct 1, 2024

Matt711 Oct 2, 2024

wence- Oct 1, 2024

Matt711 Oct 2, 2024

wence- Oct 1, 2024

brandon-b-miller Oct 1, 2024

Matt711 Oct 2, 2024 •

edited

Loading

Matt711 Oct 2, 2024

Matt711 Oct 2, 2024

		def evaluate(self, *, cache: MutableMapping[int, DataFrame]) -> DataFrame:
		return pl.DataFrame()

		if other._env_get_bool("CUDF_POLARS_DEBUG_MODE", default=False):
		return ir.ErrorNode(args[0].get_schema())

[WIP] Report all unsupported operations for a query in cudf.polars #16960

Are you sure you want to change the base?

[WIP] Report all unsupported operations for a query in cudf.polars #16960

Conversation

Matt711 commented Oct 1, 2024

Description

Checklist

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Matt711 Oct 2, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Matt711 Oct 2, 2024 •

edited

Loading