Fix wrong status code for SearchPhaseExecutionException #19851

qwerty4030 · 2016-08-07T04:13:16Z

Fix wrong status code for SearchPhaseExecutionException.
Updated SearchPhaseExecutionException to determine the status from the cause if one exists and there were no shard failures.

Throw exception if (date) histogram order path is invalid and there is only one bucket.
Forced check of sub-aggregation names and fields used in (date) histogram order path if there is only one bucket. The previous code relied on the sorting code (bypassed if less than 2 buckets) to do this check. Ideally these checks should be performed during parsing instead of the reduce phase.

Tests pass: gradle test and gradle core:integTest

Closes #14771

Updated SearchPhaseExecutionException to determine the status from the cause if one exists and there were no shard failures. Throw exception if (date) histogram order path is invalid and there is only one bucket. Forced check of sub-aggregation names and fields used in (date) histogram order path if there is only one bucket. The previous code relied on the sorting code (bypassed if less than 2 buckets) to do this check. Ideally these checks should be performed during parsing instead of the reduce phase. Closes elastic#14771

qwerty4030 · 2016-08-08T21:44:02Z

.../main/java/org/elasticsearch/search/aggregations/bucket/histogram/InternalDateHistogram.java

@@ -396,6 +396,12 @@ public InternalAggregation doReduce(List<InternalAggregation> aggregations, Redu
        } else {
            // sorted by sub-aggregation, need to fall back to a costly n*log(n) sort
            CollectionUtil.introSort(reducedBuckets, order.comparator());
+            if (reducedBuckets.size() == 1) {
+                // hack: force check of sub-aggregation names and fields if there is only 1 bucket (sort code bypassed)


Currently if the (date) histogram is ordered by sub-aggregation(s), the order path is validated during the reduce phase. This check occurs implicitly in the comparator when the histogram buckets are sorted. However if there are 0 or 1 buckets, there is nothing to sort so this code was bypassed. I added a hack here to catch the case with 1 bucket. To catch the case with 0 buckets, the validation code needs to be refactored to run during the query parsing phase (if possible). This would require parsing all sub-aggregations first and then validating the order path.

clintongormley · 2016-08-11T17:09:26Z

@colings86 could you review this one please?

colings86 · 2016-08-16T12:53:17Z

@qwerty4030 Thanks for raising this PR. I would personally prefer to fix this at parse time rather than trying to change the logic of the SearchPhaseExecutionException#status(). This is for a couple of reasons:

I am a bit uncomfortable about having an exception where the status code it reports changes based on the cause since the cause can often be making different assumptions of who the user is (e.g. developer v.s. user). We could end up with a library throwing e.g. an IllegalArgumentException because of something other than data the user provides (e.g. bad data in a document, internal node state etc.) and we would incorrectly return a 400 status even though the problem is not actually with the request.
As you also point out, the PR fixes the case where there is an invalid order when only one bucket is returned by using a hack and comparing that bucket with itself but is not able to fix the case where no buckets are returned.

Because of this I would rather fix this at parsing time, meaning that we can throw a SearchParseException which is in line with the other validation code for the search API and also meaning we can cover all cases and catch it early.

qwerty4030 · 2016-08-20T04:22:46Z

@colings86 Thanks for taking a look. I agree; catching this during parsing is better than adding a hack just for this one case. I took a quick look at the code and had some ideas, will comment in #20003.

qwerty4030 reviewed Aug 8, 2016
View reviewed changes

clintongormley added >bug review :Analytics/Aggregations Aggregations labels Aug 11, 2016

colings86 mentioned this pull request Aug 16, 2016

Validate agg order parameter at parse time #20003

Open

qwerty4030 closed this Aug 20, 2016

qwerty4030 deleted the fix/14771_histogram_bad_order branch August 27, 2016 22:10

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix wrong status code for SearchPhaseExecutionException #19851

Fix wrong status code for SearchPhaseExecutionException #19851

qwerty4030 commented Aug 7, 2016

qwerty4030 Aug 8, 2016

clintongormley commented Aug 11, 2016

colings86 commented Aug 16, 2016

qwerty4030 commented Aug 20, 2016

Fix wrong status code for SearchPhaseExecutionException #19851

Fix wrong status code for SearchPhaseExecutionException #19851

Conversation

qwerty4030 commented Aug 7, 2016

qwerty4030 Aug 8, 2016

Choose a reason for hiding this comment

clintongormley commented Aug 11, 2016

colings86 commented Aug 16, 2016

qwerty4030 commented Aug 20, 2016