I’ve been using Speech Note (github link) for months, but it often gets things wildly wrong.

I thought it was my mic, so I got one that’s crystal clear. I also tried a ton of different models, and other than being slow (or fast), their accuracy is usually pretty similar.

But I’m still needing to take a lot of time to edit the results, and I wonder if there’s something I should be doing to get better results.

On other speech-to-text platforms (like Futo keyboard on Android), the results are fast and very accurate. I have a hard time believing that Speech Note can’t be as good.

Can any other users share their experience?

  • Showroom7561@lemmy.caOP
    link
    fedilink
    arrow-up
    2
    ·
    15 hours ago

    It’s using my Nvidia GPU to do the LLM thing, so that may be the difference.

    This could be!

    Interestingly enough, I was playing around with LLama, as they have speech to text to interact with their chat bot, and it converts in near real-time with very good accuracy. So I do know that things can be fast and accurate, but I wish it was in Speech Note. LOL

    For now, I may just to STT through my phone on a shared document with my laptop.