I did a similar kind of process for my own chat logs. I have about 11M tokens wo...

ma9o · on Aug 14, 2024

I'm working on something tangentially related [1] but by sourcing my Google search history data. It's surprising how LLaMA 3.1 8B is pulling most of the weight in my case too.

[1] https://github.com/enclaveid/enclaveid

mithametacs · on Aug 14, 2024

LLMs are shit at generating content, but summarization works really well.

I’d like to use your project