Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

RecursiveCharacterTextSplitter #36

Merged
merged 10 commits into from
Oct 18, 2023
Merged

RecursiveCharacterTextSplitter #36

merged 10 commits into from
Oct 18, 2023

Commits on Oct 17, 2023

  1. fix for LlmChain OutputKey. The result of CallAsync was always contai…

    …ning only ["text'] key ignoring OutputKey. This was causing an error when used with SequentialChain
    TesAnti committed Oct 17, 2023
    Configuration menu
    Copy the full SHA
    9ea137c View commit details
    Browse the repository at this point in the history

Commits on Oct 18, 2023

  1. Configuration menu
    Copy the full SHA
    a9bc7cc View commit details
    Browse the repository at this point in the history
  2. ported CharacterTextSplitter

    - had to create and use TextSplitterBase class and Document class to match structure of original python implementation
    - i have not implemented methods which are relying on external source of information(from_huggingface_tokenizer,from_tiktoken_encoder)
    - added comments which are explaining logic behind implementation. it should help new people to understand what it does(original python implementation kind of hard to understand in some moments)
    TesAnti committed Oct 18, 2023
    Configuration menu
    Copy the full SHA
    b3152fe View commit details
    Browse the repository at this point in the history
  3. Configuration menu
    Copy the full SHA
    070b8c4 View commit details
    Browse the repository at this point in the history
  4. Configuration menu
    Copy the full SHA
    9c340d4 View commit details
    Browse the repository at this point in the history
  5. Revert "changed LengthFunctionDelegate to Func<string, int>"

    This reverts commit 9c340d4.
    TesAnti committed Oct 18, 2023
    Configuration menu
    Copy the full SHA
    908d224 View commit details
    Browse the repository at this point in the history
  6. Configuration menu
    Copy the full SHA
    3ca5a97 View commit details
    Browse the repository at this point in the history
  7. Configuration menu
    Copy the full SHA
    a140b0e View commit details
    Browse the repository at this point in the history
  8. Configuration menu
    Copy the full SHA
    1439177 View commit details
    Browse the repository at this point in the history
  9. ported RecursiveCharacterTextSplitter

    added tests
    TesAnti committed Oct 18, 2023
    Configuration menu
    Copy the full SHA
    cdbf54a View commit details
    Browse the repository at this point in the history