Title.

Don’t really want to give my soul to a tech megacorps, but I do a lot of writing for work and hobbies, and sometimes carpal tunnel gets the best of me.

7 points

I don’t know what platform you’re using but for desktop linux I use Nerd Dictation and for android I use FUTO Keyboard, both are completely offline and FOSS.

permalink
report
reply
4 points
*

Ah sorry. Linux Desktop, yes.

EDIT: I have now tested Nerd Dictation for a bit.

I find it funny how the English model doesn’t get a single word wrong, even through my 12 inch thick brazilian accent

… But the portuguese-brazilian model could barely understand anything I was saying.

More an observation than anything. It amused me.

permalink
report
parent
reply
5 points

Ah yes, the anglocentrism of software development. I gave up on Google translate years ago because it clearly only catered to English speakers. The other way around was such ham fisted mistranslations with English syntax, it was basically useless.

permalink
report
parent
reply
1 point

i believe FUTO keyboard isn’t FOSS

permalink
report
parent
reply
3 points

You are right-ish

They have a source-available license, which allows all modification and redistribution except modification and redistribution with commercial purposes.

How you feel about that is up to you. Personally I feel like allowing commercial usage of the code is actually a major blindspot with the normal FOSS licenses, as it leaves the doors open for TiVoization.

permalink
report
parent
reply
3 points
5 points

Uhhh is that not pretty much the definition of a “tech megacorp”?

permalink
report
parent
reply
2 points

Teeeeeeeeeechnically what they linked is for running the AI model inside your own machine, and so not much is given to OpenAI when you do so other than I guess attention.

… But yes, they are absolutely a tech megacorp. It then depends on how you define “giving your soul” when it pertains to said tech megacorp.

permalink
report
parent
reply
3 points

What about Speech Note? I have used it for transcription of Zoom calls and it does a pretty good job! GitHub link

permalink
report
reply
2 points

Gave it a shot too (it even has an option to use the same language model as nerd-dictation cited by another poster)

It works. Once again, works very well in english and just gives up in portuguese.

I guess I’ll just have to accept that computers are anglo-only tools. [cackles]

permalink
report
parent
reply
2 points

I’ve been looking at Vosk, but haven’t found a good client for their API yet. Anyone else?

permalink
report
reply

Free and Open Source Software

!foss@beehaw.org

Create post

If it’s free and open source and it’s also software, it can be discussed here. Subcommunity of Technology.


This community’s icon was made by Aaron Schneider, under the CC-BY-NC-SA 4.0 license.

Community stats

  • 846

    Monthly active users

  • 440

    Posts

  • 2.4K

    Comments