The Transcription plugin enables users to generate high-quality text transcriptions from a variety of audio and video file formats, utilizing advanced speech-to-text technology. It supports multiple transcription engines, including cloud-based Swiftink.io, which offers domain-aware, free, high-quality transcriptions, and Whisper ASR, an open-source local option. Users can transcribe multiple files simultaneously, access start and end timestamps for each transcribed line, and work with the transcriptions in the background. Additionally, Swiftink.io provides summaries, outlines, and notes for each transcription, enhancing the productivity and organization of transcribed content.
The Speech To Text Keyboard Helper plugin enhances the functionality of speech-to-text services on mobile devices, specifically addressing the limitation of the Google Speech to Text Android keyboard, which does not handle new line commands. This plugin allows users to append or prepend new lines via Obsidian's command palette, providing smoother note-taking experiences. Additionally, the new line commands can be added to the mobile buttons toolbar for quick access, improving usability for those who rely on speech input for text editing.
The Text2Audio plugin is a powerful tool that converts your text notes into audio files, allowing you to focus on reading and reviewing your content without the need for manual typing. With this plugin, you can quickly convert selected text or entire notes to audio, making it ideal for individuals who prefer listening over reading. The plugin also offers customizable settings, including language selection and keyboard shortcuts, ensuring a seamless experience. Whether you're looking to improve your note-taking workflow or enhance your learning process, the Text2Audio plugin is definitely worth exploring.
The Transcription Audio (Beta) plugin converts linked audio files into structured Markdown content directly within your notes. It automatically detects audio links or embeds in the active note, sends the file to Google Gemini for transcription and summarisation and inserts the generated text exactly where your cursor was placed. A dedicated rightside progress panel shows each step in real time, including file details, request timing and success or error states, so you always know what is happening. The workflow is simple and command driven, designed to keep you focused on writing rather than managing files.