.Net: Depending on the code path used semantic functions will have different max tokens #2738

markwallace-microsoft · 2023-09-06T19:30:13Z

Noticed this while create a new integration test

Calling Kernel.InvokeSemanticFunctionAsync, the max tokens will default to 256. If you call Kernel.CreateSemanticFunction the max tokens will not be set. Proposed fix is to use null as the default value as the property is optional.

Here's the documentation for OpenAI:

https://platform.openai.com/docs/api-reference/completions/create#max_tokens
max_tokens integer or null Optional Defaults to 16
The maximum number of tokens to generate in the completion.

https://platform.openai.com/docs/api-reference/chat/create#max_tokens
max_tokens integer or null Optional Defaults to inf
The total length of input tokens and generated tokens is limited by the model's context length.

The text was updated successfully, but these errors were encountered:

lemillermicrosoft · 2023-09-08T04:11:05Z

Related to #1362

### Motivation and Context #### ⚠️ Breaking change. When `Kernel.InvokeSemanticFunctionAsync` is used, `max_tokens` is set to `256` by default. However if you call `Kernel.CreateSemanticFunction`, `max_tokens` is set to `null` by default so the model default value is used. The reason we are setting `max_tokens` to 256 is because OpenAI text completion models have a default of 16. However OpenAI chat completion models have a default of infinity. So the current default makes sense for text completions but not for chat completions. To fix this, the default value of `max_tokens` will always be set to `null`. #### Troubleshooting Enable informational logging and check for token usage logs e.g., ``` Action: GetCompletionsAsync. Azure OpenAI Deployment Name: text-davinci-003. Prompt tokens: 14. Completion tokens: 16. Total tokens: 30. ``` #### Here's the documentation for OpenAI: https://platform.openai.com/docs/api-reference/completions/create#max_tokens `max_tokens` `integer or null` `Optional` `Defaults to 16` The maximum number of [tokens](https://platform.openai.com/tokenizer) to generate in the completion. https://platform.openai.com/docs/api-reference/chat/create#max_tokens `max_tokens` `integer or null` `Optional` `Defaults to inf` The total length of input tokens and generated tokens is limited by the model's context length. Resolves: #2738 ### Description The property `max_tokens` is an optional so we should not set a default value in SK. This change makes SK consistent in this approach. ### Contribution Checklist - [x] The code builds clean without any errors or warnings - [x] The PR follows the [SK Contribution Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) and the [pre-submission formatting script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts) raises no violations - [x] All unit tests pass, and I have added new tests where possible - [x] I didn't break anyone 😄

### Motivation and Context #### ⚠️ Breaking change. When `Kernel.InvokeSemanticFunctionAsync` is used, `max_tokens` is set to `256` by default. However if you call `Kernel.CreateSemanticFunction`, `max_tokens` is set to `null` by default so the model default value is used. The reason we are setting `max_tokens` to 256 is because OpenAI text completion models have a default of 16. However OpenAI chat completion models have a default of infinity. So the current default makes sense for text completions but not for chat completions. To fix this, the default value of `max_tokens` will always be set to `null`. #### Troubleshooting Enable informational logging and check for token usage logs e.g., ``` Action: GetCompletionsAsync. Azure OpenAI Deployment Name: text-davinci-003. Prompt tokens: 14. Completion tokens: 16. Total tokens: 30. ``` #### Here's the documentation for OpenAI: https://platform.openai.com/docs/api-reference/completions/create#max_tokens `max_tokens` `integer or null` `Optional` `Defaults to 16` The maximum number of [tokens](https://platform.openai.com/tokenizer) to generate in the completion. https://platform.openai.com/docs/api-reference/chat/create#max_tokens `max_tokens` `integer or null` `Optional` `Defaults to inf` The total length of input tokens and generated tokens is limited by the model's context length. Resolves: microsoft#2738 ### Description The property `max_tokens` is an optional so we should not set a default value in SK. This change makes SK consistent in this approach. ### Contribution Checklist - [x] The code builds clean without any errors or warnings - [x] The PR follows the [SK Contribution Guidelines](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md) and the [pre-submission formatting script](https://github.com/microsoft/semantic-kernel/blob/main/CONTRIBUTING.md#development-scripts) raises no violations - [x] All unit tests pass, and I have added new tests where possible - [x] I didn't break anyone 😄

markwallace-microsoft self-assigned this Sep 6, 2023

markwallace-microsoft added this to the R3 : Cycle 2 milestone Sep 6, 2023

shawncal added .NET Issue or Pull requests regarding .NET code triage labels Sep 6, 2023

markwallace-microsoft added the bug Something isn't working label Sep 6, 2023

markwallace-microsoft mentioned this issue Sep 7, 2023

.Net: Fix default value for max_tokens #2743

Merged

4 tasks

lemillermicrosoft modified the milestones: R3 : Cycle 2, v1 Sep 13, 2023

shawncal closed this as completed in #2743 Sep 16, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.Net: Depending on the code path used semantic functions will have different max tokens #2738

.Net: Depending on the code path used semantic functions will have different max tokens #2738

markwallace-microsoft commented Sep 6, 2023 •

edited

Loading

lemillermicrosoft commented Sep 8, 2023

.Net: Depending on the code path used semantic functions will have different max tokens #2738

.Net: Depending on the code path used semantic functions will have different max tokens #2738

Comments

markwallace-microsoft commented Sep 6, 2023 • edited Loading

lemillermicrosoft commented Sep 8, 2023

markwallace-microsoft commented Sep 6, 2023 •

edited

Loading