Why would GE run faster on a dataset with a lot of rows compared to a dataset with fewer rows? #5577
cosgroveblue
started this conversation in
General
Replies: 1 comment
-
Hey @cosgroveblue ! Thanks for reaching out. There are a few possibilities. Whenever possible, GE pushes compute back onto your backend (Spark instance, database, etc.) Instances where that isn't possible, for example when working with local data using the Pandas Execution Engine, may see slower compute when compared to a similar operation taking advantage of greater compute available to your backend. Additionally, a given expectation may potentially:
Any of which may cause an expectation to take longer than another which doesn't have those requirements. |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Do certain expectations tend to take longer than others?
Beta Was this translation helpful? Give feedback.
All reactions