[ML] Linear scaling change detection #25

tveasey · 2018-03-27T10:38:47Z

This implements detection of linear scaling events. It also finishes up the unit testing of change detection and fixes some issues these turned up: specifically, 1) the behaviour when a change is detected but the trend model has no components, 2) the handling of time shifts in the trend model and 3) the handling of data types in the trend component change model. Finally, we are now more careful with the weights we apply to samples added to both the standard and change models. This has meant I've been able to revert scaling the changes, since the trend is less influenced by values during the change detection period if we're likely to detect a change.

…ge-modelling-part-3

…ents to change detection

…ge-modelling-part-3

…delling-part-4

…functionality in change detection

…delling-part-4

…ect change

…ge-modelling-part-4

hendrikmuhs · 2018-04-04T07:22:36Z

lib/maths/CTimeSeriesChangeDetector.cc

+
+    change = candidates[0].second - m_ChangeModels.begin();
+
+    return p / 0.03125;


does this magic value has some origin/explanation?

This is 1/2^5 and ensures that the value of the function equals one, and we'd just accept the change, at the point at which all the decision variables are at the centre of their respective sigmoid functions and the time is at the lower end of the decision interval. If you think about this as converting the hard decision criteria to a soft one then this means the decision is (just) the same as it would be in the case that all the individual hard criteria are met. I'll add a comment.

hendrikmuhs · 2018-04-04T07:28:47Z

lib/maths/CTimeSeriesChangeDetector.cc

+
+double CUnivariateLinearScaleModel::bic() const
+{
+    return -2.0 * this->logLikelihood() + std::log(m_SampleCount);


could we use fastlog, same below?

Quite possibly. This won't be a bottleneck, but I agree we probably don't need the accuracy here and it is good to use it whenever possible.

hendrikmuhs · 2018-04-04T07:31:40Z

include/maths/CTimeSeriesChangeDetector.h

@@ -374,9 +431,9 @@ class MATHS_EXPORT CUnivariateTimeShiftModel final : public CUnivariateChangeMod

        //! Update with \p samples.
        virtual void addSamples(std::size_t count,


nit as you change it: could be const

edsavage

LGTM

edsavage · 2018-04-04T08:10:45Z

include/maths/CTimeSeriesModel.h

+                               std::size_t dimension,
+                               double derate,
+                               double scale,
+                               const core::CSmallVector<double, 10> &value);


Use a typedef/alias here - TDouble10Vec?

I didn't want to introduce a new typedef into the maths namespace just in this header: the reason I try to avoid this is that I think it tends to promote fragile transitive dependencies between headers when something compiles only because it (in)directly includes a header. That said, I think it would probably be worthwhile defining some widely used typedefs such as TDoubleVec, TDouble1Vec, etc in a global forward decls header and removing the corresponding duplicate typedefs which we currently have. I'll look at making this change in a follow up PR.

hendrikmuhs

LGTM, some questions

This implements detection of linear scaling events. It also finishes up the unit testing of change detection and fixes some issues these turned up: specifically, 1) the behaviour when a change is detected but the trend model has no components, 2) the handling of time shifts in the trend model and 3) the handling of data types in the trend component change model. Finally, we are now more careful with the weights we apply to samples added to both the standard and change models. This has meant I've been able to revert scaling the changes, since the trend is less influenced by values during the change detection period if we're likely to detect a change.

droberts195 · 2018-12-18T11:54:04Z

Removing version label as this is a feature branch PR and it causes confusion when generating release notes. (For interest this was eventually merged to 6.4 and above in #92.)

tveasey added 21 commits March 19, 2018 17:03

Implement an absolute test for suitability of change hypotheses

9b91ed5

Remove debug

0dbf15b

Fix restore

628eb53

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

a9531fa

…ge-modelling-part-3

Smooth decision to accept change over various factors

c388c9b

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

a87b74e

…ge-modelling-part-3

Implement linear scaling change detection, plus some general improvem…

f2eb387

…ents to change detection

Update test threshold

554a101

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

030d613

…ge-modelling-part-3

Merge branch 'feature/change-modelling-part-3' into feature/change-mo…

49db219

…delling-part-4

Bad merge

639fd43

Update test thresholds

b407bb5

Bad merge

fbb511f

Finish testing, correct application of time shift, factor out common …

2ffa31e

…functionality in change detection

Explicitly initialise likelihoods

d1e466e

Merge branch 'feature/change-modelling-part-3' into feature/change-mo…

7aa6e9d

…delling-part-4

Derate winsorisation during change detection and increase time to det…

572c781

…ect change

Fix remaining tests

393af2d

Merge branch 'feature/forecast-enhancements-part-2' into feature/chan…

69acacd

…ge-modelling-part-4

Unneeded new lines

41e2b15

Fix bad merge

7694e5a

tveasey added >enhancement v7.0.0 :ml labels Mar 27, 2018

sophiec20 added :ml and removed :ml labels Mar 28, 2018

hendrikmuhs reviewed Apr 4, 2018

View reviewed changes

edsavage approved these changes Apr 4, 2018

View reviewed changes

hendrikmuhs approved these changes Apr 4, 2018

View reviewed changes

Review comments

b7edff7

tveasey merged this pull request into elastic:feature/forecast-enhancements-part-2 Apr 4, 2018

droberts195 removed the v7.0.0 label Dec 18, 2018

droberts195 mentioned this pull request Dec 18, 2018

[DOCS] Updates changelog for 7.0.0-alpha2 #347

Merged

davidkyle mentioned this pull request Jun 20, 2023

[NLP] Catch exceptions thrown during inference and report as errors #2542

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[ML] Linear scaling change detection #25

[ML] Linear scaling change detection #25

tveasey commented Mar 27, 2018

hendrikmuhs Apr 4, 2018

tveasey Apr 4, 2018 •

edited

Loading

hendrikmuhs Apr 4, 2018

tveasey Apr 4, 2018

hendrikmuhs Apr 4, 2018

edsavage left a comment

edsavage Apr 4, 2018

tveasey Apr 4, 2018

hendrikmuhs left a comment

droberts195 commented Dec 18, 2018


		change = candidates[0].second - m_ChangeModels.begin();

		return p / 0.03125;

		@@ -374,9 +431,9 @@ class MATHS_EXPORT CUnivariateTimeShiftModel final : public CUnivariateChangeMod

		//! Update with \p samples.
		virtual void addSamples(std::size_t count,

[ML] Linear scaling change detection #25

[ML] Linear scaling change detection #25

Conversation

tveasey commented Mar 27, 2018

hendrikmuhs Apr 4, 2018

Choose a reason for hiding this comment

tveasey Apr 4, 2018 • edited Loading

Choose a reason for hiding this comment

hendrikmuhs Apr 4, 2018

Choose a reason for hiding this comment

tveasey Apr 4, 2018

Choose a reason for hiding this comment

hendrikmuhs Apr 4, 2018

Choose a reason for hiding this comment

edsavage left a comment

Choose a reason for hiding this comment

edsavage Apr 4, 2018

Choose a reason for hiding this comment

tveasey Apr 4, 2018

Choose a reason for hiding this comment

hendrikmuhs left a comment

Choose a reason for hiding this comment

droberts195 commented Dec 18, 2018

tveasey Apr 4, 2018 •

edited

Loading