Most AI translation tools rely on cloud services.
Audio leaves your device, gets processed somewhere else, and comes back translated.
We wanted to explore a different approach.
PolyTalk is an open-source translation platform built around the idea that speech recognition, translation, and speech synthesis can be powered by open models and deployed on infrastructure you control.
The project combines open-source components for transcription, translation, and TTS into a privacy-first workflow.
Curious how others in the open-source AI community think about privacy and ownership when it comes to AI-powered communication tools.


lel I worked on a couple speech interface projects back in the 00s before all these corporate spyware platforms emerged. Naturally, it was all on-device (or a local server we controlled). This was more R&D/prototype stuff so it wasn’t as robust as systems nowadays, but the software is still out there: