Python: #6761 Onnx Connector #8106

nmoeller · 2024-08-14T07:39:51Z

Motivation and Context

Why is this changed required ?
To enable Onnx Models with Semantic Kernel, there was the issue Python: Add support for local models via ONNX #6761 in the Backlog to add a Onnx Connector
What problem does it solve ?
It solves the problem, that semantic kernel is not yet integrated with Onnx Gen Ai Runtime
What scenario does it contribute to?
The scenario is to use different connector than HF,OpenAI or AzureOpenAI. When User's want to use Onnx they can easliy integrate it now
If it fixes an open issue, please link to the issue here.
Python: Add support for local models via ONNX #6761

Description

The changes made are designed by my own based on other connectors, i tried to stay as close as possible to the structure.
For the integration i installed the mistral python package in the repository.

I added the following Classes :

OnnxCompletionBase --> Responsible to control the inference
OnnxTextCompletion --> Inherits from OnnxCompletionBase
- Support for Text Completion with and without Images
- Ready for Multimodal Inference
- Ready for Text Only Inference
- Supports all Models on onnxruntime-genai
OnnxChatCompletion -->Inherits from OnnxCompletionBase
- Support for Text Completion with and without Images
- The user needs to provide the corresponding chat template to use this class
- Ready for Multimodal Inference
- Ready for Text Only Inference
- Supports all Models on onnxruntime-genai

What is integrated yet :

OnnxCompletionBase Class
OnnxChatCompletionBase Class with Dynamic Template Support
OnnxTextCompletionBase Class
Sample Multimodal Inference with Phi3-Vision
Sample of OnnxChatCompletions with Phi3
Sample of OnnxTextCompletions with Phi3
Integration Tests
Unit Tests

Some Notes

Contribution Checklist

The code builds clean without any errors or warnings
The PR follows the SK Contribution Guidelines and the pre-submission formatting script raises no violations
All unit tests pass, and I have added new tests where possible
I didn't break anyone 😄

python/pyproject.toml

python/semantic_kernel/connectors/ai/onnx/services/onnx_text_completion.py

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

…i-Connector

…i-Connector # Conflicts: # python/tests/integration/completions/chat_completion_test_base.py # python/uv.lock

python/semantic_kernel/connectors/ai/onnx/onnx_gen_ai_prompt_execution_settings.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_completion_base.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_chat_completion.py

python/tests/conftest.py

python/tests/integration/completions/test_text_completion.py

python/semantic_kernel/connectors/ai/onnx/onnx_gen_ai_settings.py

python/tests/integration/completions/test_utils.py

TaoChenOSU · 2024-09-18T23:05:24Z

Regarding our offline conversation on the prompt template, is using a prompt template to parse the chat history to some format an overkill? Prompt template can do much more that substituting arguments. Is it possible to override the _prepare_chat_history_for_request method to get what Onnx wants?

…_completion_base.py Co-authored-by: Tao Chen <taochen@microsoft.com>

…_chat_completion.py Co-authored-by: Tao Chen <taochen@microsoft.com>

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

python/samples/concepts/local_models/onnx_text_completion.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_chat_completion.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_completion_base.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_text_completion.py

…_chat_completion.py Co-authored-by: Tao Chen <taochen@microsoft.com>

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

moonbox3

Thanks for working on this! Some questions for you.

python/samples/concepts/local_models/onnx_chat_completion.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_chat_completion.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_completion_base.py

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_text_completion.py

python/tests/integration/completions/test_text_completion.py

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

…i-Connector

python/samples/concepts/README.md

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_chat_completion.py

TaoChenOSU · 2024-09-27T21:03:15Z

python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai_completion_base.py

+                # With the use of Pybind there is currently no way to load images from bytes
+                # We can only open images from a file path currently
+                image = OnnxRuntimeGenAi.Images.open(str(image.uri))
+            input_tokens = self.tokenizer(prompt, images=image)


What I meant here is that self.tokenizer is an object. We probably should not call an object directly. Please verify.

python/tests/integration/completions/chat_completion_test_base.py

Co-authored-by: Tao Chen <taochen@microsoft.com>

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

Eduard is currently oof, and this change request could be blocking.

moonbox3 · 2024-10-01T20:52:19Z

A couple of typos we'll need to fix:

Warning: "interogate" should be "interrogate".
Warning: "choosen" should be "chosen".

moonbox3 · 2024-10-01T20:53:46Z

Also, this is failing on the MacOS unit tests:

Using CPython 3.10.11 interpreter at: /Library/Frameworks/Python.framework/Versions/3.10/bin/python
Creating virtual environment at: .venv
Resolved 288 packages in 4.62s
error: distribution onnxruntime-genai==0.4.0 @ registry+https://pypi.org/simple can't be 
installed because it doesn't have a source distribution or wheel for the current platform

setup for onnx connector

ff979ba

markwallace-microsoft added the python Pull requests for the Python Semantic Kernel label Aug 14, 2024

nmoeller changed the title ~~Python : Issue-6761-Onnx-Connector~~ Python: Issue-6761-Onnx-Connector Aug 14, 2024

nmoeller changed the title ~~Python: Issue-6761-Onnx-Connector~~ Python: #6761 Onnx Connector Aug 14, 2024

eavanvalkenburg reviewed Aug 14, 2024

View reviewed changes

python/pyproject.toml Outdated Show resolved Hide resolved

TaoChenOSU reviewed Aug 15, 2024

View reviewed changes

python/semantic_kernel/connectors/ai/onnx/services/onnx_text_completion.py Outdated Show resolved Hide resolved

nmoeller added 11 commits August 16, 2024 11:08

initial implementation commit

49a2a72

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

b2c5a70

initial unit tests for onnx text completion

342db1d

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

9c371de

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

added chat completion support

adf262b

added small comment regarding Image Opening

a40a6cb

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

fd6d9b4

…i-Connector

migrated to uv

b118396

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

0da7615

…i-Connector

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

0b6df05

…i-Connector

added unit tests and integration tests

3c41141

nmoeller marked this pull request as ready for review September 17, 2024 14:32

nmoeller requested a review from a team as a code owner September 17, 2024 14:32

nmoeller requested a review from TaoChenOSU September 17, 2024 14:32

nmoeller added 2 commits September 18, 2024 17:39

added unit tests and integration tests

81bf663

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

bd21157

…i-Connector # Conflicts: # python/tests/integration/completions/chat_completion_test_base.py # python/uv.lock

TaoChenOSU reviewed Sep 18, 2024

View reviewed changes

nmoeller and others added 6 commits September 19, 2024 08:44

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

21f585f

…_completion_base.py Co-authored-by: Tao Chen <taochen@microsoft.com>

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

58702f3

…_completion_base.py Co-authored-by: Tao Chen <taochen@microsoft.com>

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

8908cb9

…_completion_base.py Co-authored-by: Tao Chen <taochen@microsoft.com>

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

af15de6

…_chat_completion.py Co-authored-by: Tao Chen <taochen@microsoft.com>

integrated pr feedback

14128e2

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

352dede

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

TaoChenOSU reviewed Sep 23, 2024

View reviewed changes

nmoeller and others added 5 commits September 24, 2024 08:08

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

3098efb

…_chat_completion.py Co-authored-by: Tao Chen <taochen@microsoft.com>

Update python/semantic_kernel/connectors/ai/onnx/services/onnx_gen_ai…

c5d6412

…_chat_completion.py Co-authored-by: Tao Chen <taochen@microsoft.com>

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

3f6a0ca

Implemented PR Feedback

434f63f

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

10eb683

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

nmoeller requested a review from TaoChenOSU September 24, 2024 13:58

nmoeller added 2 commits September 24, 2024 15:59

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

c071761

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

6f2a480

moonbox3 reviewed Sep 25, 2024

View reviewed changes

nmoeller added 3 commits September 26, 2024 11:47

implemented multiple choices and PR Feedback

59550a2

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

8d02aee

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

implemented chat_model and text_model env vars

593d757

markwallace-microsoft added the documentation label Sep 27, 2024

nmoeller added 2 commits September 27, 2024 14:44

Merge remote-tracking branch 'origin/main' into issue-6761-ONNX-gen-a…

86f5ed2

…i-Connector

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

4c0c5c5

TaoChenOSU reviewed Sep 27, 2024

View reviewed changes

python/tests/integration/completions/chat_completion_test_base.py Outdated Show resolved Hide resolved

nmoeller and others added 6 commits September 28, 2024 22:06

Update python/samples/concepts/README.md

8ec2fe4

Co-authored-by: Tao Chen <taochen@microsoft.com>

simplified test setup

673d446

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

0b7d49e

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

ed82d0c

Adjusted Samples to new Settings

9db1f41

Merge branch 'issue-6761-ONNX-gen-ai-Connector' of https://github.com…

86fc385

…/nmoeller/semantic-kernel into issue-6761-ONNX-gen-ai-Connector

moonbox3 approved these changes Oct 1, 2024

View reviewed changes

nmoeller added 2 commits October 2, 2024 08:28

prevent onnx runtime to install on mac, fix ai_model_id

1affb38

Merge branch 'main' into issue-6761-ONNX-gen-ai-Connector

8438a27

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Python: #6761 Onnx Connector #8106

Python: #6761 Onnx Connector #8106

nmoeller commented Aug 14, 2024 •

edited

Loading

TaoChenOSU commented Sep 18, 2024

moonbox3 left a comment

TaoChenOSU Sep 27, 2024

moonbox3 commented Oct 1, 2024

moonbox3 commented Oct 1, 2024

Python: #6761 Onnx Connector #8106

Are you sure you want to change the base?

Python: #6761 Onnx Connector #8106

Conversation

nmoeller commented Aug 14, 2024 • edited Loading

Motivation and Context

Description

Some Notes

Contribution Checklist

TaoChenOSU commented Sep 18, 2024

moonbox3 left a comment

Choose a reason for hiding this comment

TaoChenOSU Sep 27, 2024

Choose a reason for hiding this comment

moonbox3 commented Oct 1, 2024

moonbox3 commented Oct 1, 2024

nmoeller commented Aug 14, 2024 •

edited

Loading