Lavalamp too hot

swiftywizard@discuss.tchncs.de · edit-2 2 months ago

Lavalamp too hot

bunchberry@lemmy.world · 2 months ago

This happened to me a lot when I tried to run big models with low context windows. It would effectively run out of memory so each new token wouldn’t actually be added to the context so it would just get stuck in an infinite loop repeating the previous token. It is possible that there was a memory issue on Google’s end.

FishFace@piefed.social · 2 months ago

There is something wrong if it’s not discarding old context to make room for new

bunchberry@lemmy.world · 2 months ago

At least llama.cpp doesn’t seem to do that by default. If it overruns the context window it just blorps.

FishFace@piefed.social · 1 month ago

I think there are parameters for that, from googling.