Skip to content

Commit

Permalink
Tokenization notebook (#173)
Browse files Browse the repository at this point in the history
* Tokenization notebook

* remove comment

* exclude special tokens before decoding

* improve descriptions

* add overlap; remove text file

* run nbtest through Makefile

* reformulate recommendation

* remove paragraph I wanted to omit

* move to `document-chunking` folder
  • Loading branch information
maxjakob authored Jan 26, 2024
1 parent 5264994 commit 5369ab6
Show file tree
Hide file tree
Showing 2 changed files with 330 additions and 0 deletions.
1 change: 1 addition & 0 deletions notebooks/document-chunking/Makefile
Original file line number Diff line number Diff line change
@@ -1,5 +1,6 @@
NBTEST = ../../bin/nbtest
NOTEBOOKS = \
tokenization.ipynb \
with-index-pipelines.ipynb \
with-langchain-splitters.ipynb

Expand Down
Loading

0 comments on commit 5369ab6

Please sign in to comment.