[ad_1]
Hi @myonlyeye,
Are you using the Pro Version? If yes, contact me directly ). Indeed, I could either create a summary and keep it in some cache. But then, when/how to reset the cache, that is the question 🙂
Meanwhile, another idea would to limit the content used by the context to a certain length, to avoid issues and using too many tokens. It’s much easier and faster to do, and that would work fine in most cases (except if the user ask a question about some parts of the article which is right at the end). What do you think?