Hi @myonlyeye,
Are you using the Pro Version? If yes, contact me directly (https://meowapps.com/support). Indeed, I could either create a summary and keep it in some cache. But then, when/how to reset the cache, that is the question π
Meanwhile, another idea would to limit the content used by the context to a certain length, to avoid issues and using too many tokens. Itβs much easier and faster to do, and that would work fine in most cases (except if the user ask a question about some parts of the article which is right at the end). What do you think?