@theunknownmuncher - news.sathani.com

theunknownmuncher@lemmy.world

0 Posts
2 Comments

Joined 8 months ago

Cake day: June 13th, 2024

You are not logged in. If you use a Fediverse account that is able to follow users, you can follow this user.

OverviewCommentsPosts

theunknownmuncher@lemmy.worldtoSelfhosted@lemmy.world•Can I run ollama on RTX 3060 and Inter iGPU to increase speed?
link
fedilink
English
arrow-up
3·
6 hours ago
Models are computed sequentially (the output of each layer is the input into the next layer in the sequence) so more GPUs do not offer any kind of performance benefit

link
fedilink

theunknownmuncher@lemmy.worldtoSelfhosted@lemmy.world•Faster Ollama alternative
link
fedilink
English
arrow-up
6·
edit-2
6 hours ago
Ummm… did you try /set parameter num_ctx # and /set parameter num_predict #? Are you using a model that actually supports the context length that you desire…?

link
fedilink