Most AI translation tools rely on cloud services.

Audio leaves your device, gets processed somewhere else, and comes back translated.

We wanted to explore a different approach.

PolyTalk is an open-source translation platform built around the idea that speech recognition, translation, and speech synthesis can be powered by open models and deployed on infrastructure you control.

The project combines open-source components for transcription, translation, and TTS into a privacy-first workflow.

Curious how others in the open-source AI community think about privacy and ownership when it comes to AI-powered communication tools.

GitHub: https://github.com/PolyTalkIO/polytalk

  • Sergio@piefed.social
    link
    fedilink
    English
    arrow-up
    2
    ·
    5 hours ago

    lel I worked on a couple speech interface projects back in the 00s before all these corporate spyware platforms emerged. Naturally, it was all on-device (or a local server we controlled). This was more R&D/prototype stuff so it wasn’t as robust as systems nowadays, but the software is still out there: