Skip to content

Incorrect AVX2 Adler implementation #1250

Discussion options

You must be logged in to vote

I'm assuming s2 = v_s2.ToScalar(); is incorrect also?

Yep, as @tannergooding said, you can think of the AVX2 version as being 2 instances of the SSSE3 version running in parallel. When you accumulate at the end, the shuffle/add pairs will total everything into the low element, but you have 2 of them to deal with.

Replies: 4 comments 8 replies

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
8 replies
@saucecontrol
Comment options

@JimBobSquarePants
Comment options

@saucecontrol
Comment options

@JimBobSquarePants
Comment options

@saucecontrol
Comment options

Answer selected by JimBobSquarePants
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants