• 6 Posts
  • 4 Comments
Joined 2 years ago
cake
Cake day: August 16th, 2023

help-circle

  • Nice! KoboldCpp is also my software of choice. It’s easy to install, all-in-one and has a good amount of features.

    What kind of model size do you use to arrive at 1token/s? I’m in the same ballpark. Though my old desktop PC is a bit faster than my laptop. Probably because it has dual-channel memory and doesn’t throttle.

    I think that’s the point where it gets usable. At least for consecutive chat. If I feed in longer text, or KoboldCpp decides to recalculate large portions of the context, it’ll be several minutes for me until I get a reply. And that’s less fun for use-cases like dialougue.