How to implement late chunking when my context limit is more than 8192 tokens? #2

venkatana-kore · 2024-09-10T08:31:43Z

Jina.ai support a token limit of 8192 for generating the embeddings. For late chunking if my context is more than 8192, then what are the best strategies to implement late chunking?

guenthermi · 2024-09-10T10:24:42Z

I think if you have very long documents, not all of the context might be necessary. So if you can split the text into chapters or longer sections, there might be enough context for the embedding model to interpret all of the tokens correctly. Otherwise you can also pass a bit more text before and after the first chunk yu are interested. Maybe als adding summaries before the text chunks could further improve it, but I haven't tried something like this.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to implement late chunking when my context limit is more than 8192 tokens? #2

How to implement late chunking when my context limit is more than 8192 tokens? #2

venkatana-kore commented Sep 10, 2024

guenthermi commented Sep 10, 2024

How to implement late chunking when my context limit is more than 8192 tokens? #2

How to implement late chunking when my context limit is more than 8192 tokens? #2

Comments

venkatana-kore commented Sep 10, 2024

guenthermi commented Sep 10, 2024