I realize, I need to upgrade my little NUC to something bigger for higher inference of bigger llama models. I want something that you still can have on your living room’s tv bench, so no monster rack please, but that has also the necessary muscle when needed for llama. Budget doesn’t matter right now, want to understand what’s good and what’s out there. Thanks

EDIT: Wow, thanks for the inspiration, guess I need to look at bit for “how to stuff a huge graphics card into a mini box”. To clarify a bit more what I want with it: I want to build a responsive personal assistant. I am dreaming of models bigger than 8B, good tool calling for things like memory, websearch etc., no coding, no image generation, no video generation required. Image recognition would be good but not a must. Regarding footprint, the no monster ;) Something that you can have in your livingroom, and could be wife approved - so no big gaming rig with exhaust pipes and stuff, needs to be good looking ;)

  • Scott 🇨🇦🏴‍☠️@sh.itjust.works
    link
    fedilink
    English
    arrow-up
    6
    arrow-down
    4
    ·
    5 hours ago

    From what I have observed, the Fediverse is against “AI”. I doubt if you will find your answer here. “AI” is using too much water, electricity and is putting people out of work.

    • bazinga@discuss.tchncs.deOP
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      3 hours ago

      I agree. I also think that there is nothing good in for-profit AI corporations. I can recommend the book “the empire of AI” However, I personally think self hosting and having full control of the use is a bit different.

    • illusionist@lemmy.zip
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      2
      ·
      edit-2
      5 hours ago

      Not the whole fediverse.

      I have a good efficiency boost thanks to LLMs. They are not perfect, they lie and everything. But they write simple and good bash scripts. They know cron and regex. Stuff that I could do but I don’t want to.

      Creating videos is costly. My LLM usage compared to creating videos is a joke. People playing bus simulator is much worse than me asking a llm how the function is called to calculate a mean.

      Ai is not the problem. People using it are.