[GH-ISSUE #117] Take advantage of increased LLM context window #92

Closed
opened 2026-03-02 23:33:33 +03:00 by kerem · 4 comments
Owner

Originally created by @AJaySi on GitHub (Sep 2, 2024).
Original GitHub issue: https://github.com/AJaySi/ALwrity/issues/117

Originally assigned to: @AJaySi on GitHub.

RAG is dying as the context window keeps increasing exponentially.
With 1M tokens context windows, who needs RAG. Caching course material and doing vector search seems like the way to go.

We will need to implement concepts from this paper: https://ai.google.dev/gemini-api/docs/long-context

Originally created by @AJaySi on GitHub (Sep 2, 2024). Original GitHub issue: https://github.com/AJaySi/ALwrity/issues/117 Originally assigned to: @AJaySi on GitHub. RAG is dying as the context window keeps increasing exponentially. With 1M tokens context windows, who needs RAG. Caching course material and doing vector search seems like the way to go. We will need to implement concepts from this paper: https://ai.google.dev/gemini-api/docs/long-context
Author
Owner

@AJaySi commented on GitHub (Sep 24, 2024):

https://ai.google.dev/gemini-api/docs/long-context

<!-- gh-comment-id:2370051176 --> @AJaySi commented on GitHub (Sep 24, 2024): https://ai.google.dev/gemini-api/docs/long-context
Author
Owner
<!-- gh-comment-id:2370415989 --> @AJaySi commented on GitHub (Sep 24, 2024): https://github.com/google-gemini/cookbook/blob/main/examples/Apollo_11.ipynb
Author
Owner

@AJaySi commented on GitHub (Mar 13, 2025):

This needs attention for long form content generation.

<!-- gh-comment-id:2720043342 --> @AJaySi commented on GitHub (Mar 13, 2025): This needs attention for long form content generation.
Author
Owner

@AJaySi commented on GitHub (Sep 25, 2025):

We have shifted to flash 2.5 and this feature is inheriently provided and needs no extra instrumentation on our part.
I still need to test, but the quality suffers a lot with longer content. Needs more experiementation.

<!-- gh-comment-id:3332042835 --> @AJaySi commented on GitHub (Sep 25, 2025): We have shifted to flash 2.5 and this feature is inheriently provided and needs no extra instrumentation on our part. I still need to test, but the quality suffers a lot with longer content. Needs more experiementation.
Sign in to join this conversation.
No milestone
No project
No assignees
1 participant
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference
starred/ALwrity#92
No description provided.