optimize: add at least a small cost for most instructions #31455

vtjnash · 2019-03-22T23:36:44Z

This should better reflect the aggregate expected cost of their presence
and restrict endless inlining: SROA may eliminate many of them, but
probably not all of them.

This was part of #31338, but I realized that I need to separate the nanosoldier results.

vtjnash · 2019-03-22T23:37:04Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2019-03-23T06:09:13Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

timholy · 2019-03-23T07:06:09Z

Looks like some regressions in array handling. Maybe pair this with an increase in the cost thresholds?

vtjnash · 2019-03-23T16:26:43Z

That would somewhat defeat the purpose of this. In the benchmark, it looks like this improved performance by 10% on many benchmarks, and up to 40% on others. I would rather understand why that one benchmark is not being optimized well.

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2019-03-23T22:57:15Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

timholy · 2019-03-24T09:33:56Z

That would somewhat defeat the purpose of this.

Depends on the purpose; it wouldn't defeat the "endless inlining" (in the sense of infinite). But agreed, best to focus on the few failures.

vtjnash · 2019-03-25T23:55:33Z

@nanosoldier runbenchmarks("broadcast", vs=":master")

nanosoldier · 2019-03-26T01:10:04Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

vtjnash · 2019-03-29T16:13:24Z

Results seem pretty good to me now, so I'll plan to merge over the weekend, unless there's any more comments.

KristofferC · 2019-03-29T16:16:19Z

Why did you only run the broadcast benchmarks? There where categories with regression outside of that.

ID	time ratio	memory ratio
["array", "index", "("sumvector_view", "1:100000")"]	1.03 (50%)	1.40 (1%) ❌
["array", "index", "("sumvector_view", "BitArray{2}")"]	1.60 (50%) ❌	2.49 (1%) ❌
["array", "index", "("sumvector_view", "SubArray{Float32,2,Base.Res...},true}")"]	2.26 (50%) ❌	2.49 (1%) ❌
["array", "index", "("sumvector_view", "SubArray{Int32,2,Base.ReshapedArray{Int...true}")"]	2.23 (50%) ❌	2.49 (1%) ❌

for example

vtjnash · 2019-03-29T17:21:19Z

Those are the only ones that would have changed. I looked into several others. Most were expected/intended. The usual causes were an insufficient benchmark complexity, that's now just measuring overhead noise (e.g. scalar iteration), and inaccurate cost modeling issues (harder to deal with, but maybe we should store the result for boundscheck=none at least? this problem existed before, we just don't have good benchmark coverage of the case.)

mbauman · 2019-03-29T17:39:56Z

View construction and indexing seems like quite a relevant microbenchmark... and is something we've tried hard to make as fast as possible. Why can't we just manually inline whatever is throwing a wrench into things here?

Edit: oh, without looking I'd bet it's the generic first that's not getting inlined. Alright, if that's the case I'm slightly more sympathetic.

vtjnash · 2020-07-27T16:59:13Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2020-07-28T01:02:18Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @ararslan

vtjnash · 2020-07-28T03:53:09Z

Wow, okay, that seems quite bad

This removes the dependence on inlining for performance.

This should better reflect the aggregate expected cost of their presence and restrict endless inlining: SROA may eliminate many of them, but probably not all of them.

vtjnash · 2021-06-04T04:29:08Z

@nanosoldier runbenchmarks(ALL, vs=":master")

nanosoldier · 2021-06-04T10:47:26Z

Your benchmark job has completed - possible performance regressions were detected. A full report can be found here. cc @christopher-dG

vtjnash · 2021-11-22T21:31:36Z

Not sure that it is worthwhile trying to figure out those regressions for this minor gain

vtjnash force-pushed the jn/getfield-cost branch from 05cd026 to ba97088 Compare March 25, 2019 19:56

vtjnash added the status:triage This should be discussed on a triage call label Apr 15, 2019

StefanKarpinski removed the status:triage This should be discussed on a triage call label May 9, 2019

JeffBezanson self-requested a review May 9, 2019 18:30

vtjnash force-pushed the jn/getfield-cost branch from ba97088 to 0e0e39c Compare August 16, 2019 20:57

JeffBezanson mentioned this pull request Mar 23, 2020

some inlining cost model updates #35235

Merged

vtjnash force-pushed the jn/getfield-cost branch from 0e0e39c to 526964b Compare July 23, 2020 15:08

vtjnash force-pushed the jn/getfield-cost branch 2 times, most recently from 185b62a to dd5a0a8 Compare June 4, 2021 03:38

This comment has been minimized.

Sign in to view

vtjnash added 2 commits June 4, 2021 00:27

broadcast: disable nospecialize logic for outer method signature

fcc34e0

This removes the dependence on inlining for performance.

optimize: add at least a small cost for most instructions

32e6197

This should better reflect the aggregate expected cost of their presence and restrict endless inlining: SROA may eliminate many of them, but probably not all of them.

vtjnash force-pushed the jn/getfield-cost branch from dd5a0a8 to 32e6197 Compare June 4, 2021 04:28

mbauman added the compiler:optimizer Optimization passes (mostly in base/compiler/ssair/) label Jun 4, 2021

vtjnash closed this Nov 22, 2021

vtjnash deleted the jn/getfield-cost branch November 22, 2021 21:31

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

optimize: add at least a small cost for most instructions #31455

optimize: add at least a small cost for most instructions #31455

vtjnash commented Mar 22, 2019

vtjnash commented Mar 22, 2019

nanosoldier commented Mar 23, 2019

timholy commented Mar 23, 2019

vtjnash commented Mar 23, 2019

nanosoldier commented Mar 23, 2019

timholy commented Mar 24, 2019

vtjnash commented Mar 25, 2019

nanosoldier commented Mar 26, 2019

vtjnash commented Mar 29, 2019

KristofferC commented Mar 29, 2019

vtjnash commented Mar 29, 2019

mbauman commented Mar 29, 2019 •

edited

Loading

vtjnash commented Jul 27, 2020

nanosoldier commented Jul 28, 2020

vtjnash commented Jul 28, 2020

This comment has been minimized.

This comment has been minimized.

vtjnash commented Jun 4, 2021

nanosoldier commented Jun 4, 2021

vtjnash commented Nov 22, 2021

optimize: add at least a small cost for most instructions #31455

optimize: add at least a small cost for most instructions #31455

Conversation

vtjnash commented Mar 22, 2019

vtjnash commented Mar 22, 2019

nanosoldier commented Mar 23, 2019

timholy commented Mar 23, 2019

vtjnash commented Mar 23, 2019

nanosoldier commented Mar 23, 2019

timholy commented Mar 24, 2019

vtjnash commented Mar 25, 2019

nanosoldier commented Mar 26, 2019

vtjnash commented Mar 29, 2019

KristofferC commented Mar 29, 2019

vtjnash commented Mar 29, 2019

mbauman commented Mar 29, 2019 • edited Loading

vtjnash commented Jul 27, 2020

nanosoldier commented Jul 28, 2020

vtjnash commented Jul 28, 2020

This comment has been minimized.

This comment has been minimized.

vtjnash commented Jun 4, 2021

nanosoldier commented Jun 4, 2021

vtjnash commented Nov 22, 2021

mbauman commented Mar 29, 2019 •

edited

Loading