Bug: Fix prompt length computation #3448

Timoeller · 2022-10-21T10:05:30Z

Problem

The current prompt for the Answergenerator can exceed the allowed length. The prompt needs to account for the number of generated tokens as well since "The token count of your prompt plus max_tokens cannot exceed the model's context length.".

Proposed Changes:

Subtract max_tokens from leftover_token_len

Checklist

I have read the contributors guidelines and the code of conduct
I have updated the related issue with new insights and changes
I added tests that demonstrate the correct behavior of the change
I've used the conventional commit convention for my PR title
I documented my code
I ran pre-commit hooks and fixed any issue

ZanSara

LGTM 👍

Fix prompt length computation

b576e79

Timoeller requested a review from a team as a code owner October 21, 2022 10:05

Timoeller requested review from ZanSara and removed request for a team October 21, 2022 10:05

ZanSara added topic:retriever type:bug Something isn't working labels Oct 24, 2022

ZanSara approved these changes Oct 24, 2022

View reviewed changes

ZanSara merged commit 9b931bb into main Oct 24, 2022

ZanSara deleted the openailength branch October 24, 2022 09:59

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Fix prompt length computation #3448

Bug: Fix prompt length computation #3448

Timoeller commented Oct 21, 2022

ZanSara left a comment

Bug: Fix prompt length computation #3448

Bug: Fix prompt length computation #3448

Conversation

Timoeller commented Oct 21, 2022

Problem

Proposed Changes:

Checklist

ZanSara left a comment

Choose a reason for hiding this comment