Most AI translation tools rely on cloud services.

Audio leaves your device, gets processed somewhere else, and comes back translated.

We wanted to explore a different approach.

PolyTalk is an open-source translation platform built around the idea that speech recognition, translation, and speech synthesis can be powered by open models and deployed on infrastructure you control.

The project combines open-source components for transcription, translation, and TTS into a privacy-first workflow.

Curious how others in the open-source AI community think about privacy and ownership when it comes to AI-powered communication tools.

GitHub: https://github.com/PolyTalkIO/polytalk

  • hendrik@palaver.p3x.de
    link
    fedilink
    English
    arrow-up
    1
    ·
    edit-2
    2 hours ago

    Yeah, a few years ago, most of that was Google Translate. To be fair it has some limited on-device features. All of this used to be proprietary technology, though.

    Not sure if I need tools for business meetings, at least on a regular basis. We kind of all agreed to use either the local language or English as the universal language in software development. And people are expected to be somewhat fluent. And if you clients are abroad, you better hire a real translator at some point. Or you’ll end up like Microsoft with all the messed up translations in Windows 11. It’ll be handy at times, though. Or if you work on a construction site. And some other jobs.