Rel-1.12.0 cherry picks #12035

RandySheriffH · 2022-06-29T19:20:05Z

Cherry pick commits from link to release candidate.

Follow-ups that need to happen after this and before the next ORT release: * Support SequenceMap with #11731 * Support signal ops with #11778 Follow-ups that need to happen after this but don't necessarily need to happen before the release: * Implement LayerNormalization kernel for opset version 17: #11916 Fixes #11640

* Setting default version values for ovep dlls as well * Update backend_manager.cc Co-authored-by: mayavijx <mayax.vijayan@intel.com> Co-authored-by: mohsin <mohsinx.mohammad@intel.com>

* ooptimize t5 encoder * update * update * update * refactor expand impl * cuda tests passed * update * alignment * more alignments * review comments

…ng thread spinning (#11841) Introduce Start/Stop threadpool spinning switch Add a session config option to force spinning stop at the end of the Run()

* Add nested function call tests * Add overload for Specialize * Pass symboltable to onnx shape inference * Avoid renaming empty names * Enable sequence_map tests which failed before this change

Provider better documentation

* Register signal ops for op set 17 Note code is mostly being moved, not added. These ops were previously only registered as Microsoft contrib ops and only built if `BUILD_MS_EXPERIMENTAL_OPS=1`. They've been added to the ai.onnx standard op set in version 17. Main components of this change: * Move the kernels from the conrib_ops directory to the core directory. * Add function bodies for ms experimental ops. This will allow old models that use the contrib ops to continue to function. All the function bodies consist of a single op (the new standard op), so performance overhead should be minimal. Minor clean-up also in this change: * De-duplicate get_scalar_value_from_tensor: put it in a new utils.h. * Fix some bugs that caused compilation errors with the experimental ops. Tested with `build.sh --ms_experimental` * Fix some spelling errors and lint violations. * Replace a couple of switch statements with `MLTypeCallDispatcher`. * Use `InlineVector` instead of `std::vector`. Unblocks #11640

…l signal op definitions (#12006) * fix winml tests * remove legacy test * switch idft -> dft+inverse attr * upgrade opset 13->17 for signal ops tests

…ls. (#12008) Add support for double tensor output in TestPreTrainedModels.

…ation lacking training_mode attribute (#12010) FusedBatchNormalization include training_mode attribute

* create op from ep * read input count from context * create holder to host nodes * fix typo * cast type before comparison * throw error on API fail * silence warning from minimal build * switch to unique_ptr with deleter to host nodes * fix typo * fix build err for minimal * fix build err for minimal * add UT for conv * enable test on CUDA * add comment * fix typo * use gsl::span and string view for Node constructor * Added two APIs - CopyKernelInfo and ReleaseKernelInfo * pass gsl::span by value * switch to span<NodeArg* const> to allow for reference to const containers * fix typo * fix reduced build err * fix reduced build err * refactoring node construction logic * rename exceptions * add input and output count as arguments for op creation * refactor static member * use ORT_CATCH instead of catch * cancel try catch * add static value name map * format input definition and set err code * fix comments * fix typo

* Pad fallback to CPU * Added queryPad in operatorRegistration.cpp * Acknowledged PR comments * Used any_of * used none_of instead of any_of Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>

…to_pad (#11984) * Add warning about future computation change for Convtranspose with auto_pad * improve msg * update TODO to make lint happy * update more contents for warning and add if * valid was not infected * move it into kernel registration * parse auto_pad myself * try to use conv_transpose_attrs_.auto_pad directly

lgtm-com · 2022-06-29T20:57:20Z

This pull request introduces 1 alert when merging 5bba110 into 64f95d4 - view on LGTM.com

new alerts:

1 for Commented-out code

fdwr

I see all 3 of our changes present: @sumitsays's padding change plus my Conv+BN fusion and dml_defs.cc fused batchnormalization change 👍.

fdwr · 2022-07-06T18:37:55Z

Pointer to new one: #12097

garymm and others added 14 commits June 29, 2022 12:07

Dll version fix ovep4.1 (#11953)

3ba8174

* Setting default version values for ovep dlls as well * Update backend_manager.cc Co-authored-by: mayavijx <mayax.vijayan@intel.com> Co-authored-by: mohsin <mohsinx.mohammad@intel.com>

Optimize t5 encoder in beam search (#11926)

af3ea11

* ooptimize t5 encoder * update * update * update * refactor expand impl * cuda tests passed * update * alignment * more alignments * review comments

Allow saving on CPU usage for infrequent inference requests by reduci…

e6e28fb

…ng thread spinning (#11841) Introduce Start/Stop threadpool spinning switch Add a session config option to force spinning stop at the end of the Run()

Restructure function inliner (#11731)

7af5b15

* Add nested function call tests * Add overload for Specialize * Pass symboltable to onnx shape inference * Avoid renaming empty names * Enable sequence_map tests which failed before this change

Deprecate APIs returning raw ptrs and provide replacements (#11922)

3ddb4e7

Provider better documentation

Include opset 15 in Conv+BatchNormalization fusion (#11960)

b0b2487

Fix WinML Tests are still targetting deprecated (deleted) experimenta…

f226d13

…l signal op definitions (#12006) * fix winml tests * remove legacy test * switch idft -> dft+inverse attr * upgrade opset 13->17 for signal ops tests

[C# Tests] Add support for double tensor output in TestPreTrainedMode…

474aab6

…ls. (#12008) Add support for double tensor output in TestPreTrainedModels.

DML EP ResNet50 opset 15 fails in ONNX checker for FusedBatchNormaliz…

cbfe4a2

…ation lacking training_mode attribute (#12010) FusedBatchNormalization include training_mode attribute

[DML EP] Pad operator: Handle negative pad counts (#11974)

c3bb86e

* Pad fallback to CPU * Added queryPad in operatorRegistration.cpp * Acknowledged PR comments * Used any_of * used none_of instead of any_of Co-authored-by: Sumit Agarwal <sumitagarwal@microsoft.com>

RandySheriffH requested a review from a team as a code owner June 29, 2022 19:20

RandySheriffH requested review from faxu, pranavsharma and a team June 29, 2022 19:20

RandySheriffH self-assigned this Jun 29, 2022

RandySheriffH changed the title ~~rel-1.12.0 cherry picks~~ Rel-1.12.0 cherry picks Jun 29, 2022

fdwr approved these changes Jun 29, 2022

View reviewed changes

RandySheriffH closed this Jul 6, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Rel-1.12.0 cherry picks #12035

Rel-1.12.0 cherry picks #12035

RandySheriffH commented Jun 29, 2022

lgtm-com bot commented Jun 29, 2022

fdwr left a comment •

edited

Loading

fdwr commented Jul 6, 2022

Rel-1.12.0 cherry picks #12035

Rel-1.12.0 cherry picks #12035

Conversation

RandySheriffH commented Jun 29, 2022

lgtm-com bot commented Jun 29, 2022

fdwr left a comment • edited Loading

Choose a reason for hiding this comment

fdwr commented Jul 6, 2022

fdwr left a comment •

edited

Loading