[edit]
Calame: An Open Source Transcription Software
Proceedings of the The 39th Canadian Conference on Artificial Intelligence, PMLR 318:990-996, 2026.
Abstract
While research on automatic speech processing is very active, its outcomes remain mainly inaccessible to people without programming skills or expertise. Moreover, studies focus mostly on high-resource languages and conventional setups, preventing a wider adoption and social impact of these technologies. Automatic speech processing systems can be needed in a variety of use cases, such as automatic transcription of meetings, interviews, or even conferences. They can also be useful for subtitling and dictation, or to interact with voice assistants. Non-experts may rely on commercial solutions, but these typically lack modularity, o!er only partial functionalities, increase exposure to cyber threats, and impose significant financial barriers for potential users. As automatic transcription techniques improve, it becomes crucial to make these tools accessible to both the research community and the general public. To make language technology more inclusive, we released Calame, a free, open-source, and accessible software for automatic multilingual speech processing, available for both local and remote use. Its current language coverage includes English and French, with Quebec French and other low-resource languages being gradually incorporated with state-of-the-art fine-tuned models.