I’ve tried coding and every one I’ve tried fails unless really, really basic small functions like what you learn as a newbie compared to say 4o mini that can spit out more sensible stuff that works.

I’ve tried explanations and they just regurgitate sentences that can be irrelevant, wrong, or get stuck in a loop.

So. what can I actually use a small LLM for? Which ones? I ask because I have an old laptop and the GPU can’t really handle anything above 4B in a timely manner. 8B is about 1 t/s!

  • MTK@lemmy.world
    link
    fedilink
    English
    arrow-up
    14
    ·
    4 months ago

    Have you tried RAG? I believe that they are actually pretty good for searching and compiling content from RAG.

    So in theory you could have it connect to all of you local documents and use it for quick questions. Or maybe connected to your signal/whatsapp/sms chat history to ask questions about past conversations

      • MTK@lemmy.world
        link
        fedilink
        English
        arrow-up
        14
        ·
        4 months ago

        RAG is basically like telling an LLM “look here for more info before you answer” so it can check out local documents to give an answer that is more relevant to you.

        You just search “open web ui rag” and find plenty kf explanations and tutorials

        • iii@mander.xyz
          link
          fedilink
          English
          arrow-up
          4
          ·
          4 months ago

          I think RAG will be surpassed by LLMs in a loop with tool calling (aka agents), with search being one of the tools.