Azure speech to text batch transcript

4/29/2023

Regarding the Visual Studio solution, when you want to package the binaries, you can use the Project -> (right-mouse click) -> Publish -> Publish to Folder feature, to create a portable version. You were required to write the business logic in DLLs, test the service in exe wrapper, or debug attaching to an running service.

In the past was a nightmare to debug windows services. The built package can then be run from command line or installed as a service. Net Core implementation, is that you can easily run the code within Visual Studio without issues. The first cool thing about the new BackgroundService. I started reading the documentation about how to develop a windows background service and the I started coding. So I approached the same way, but using a more modern approach with. I was used to write Windows Background Service in the past, in the banking industry, to process text records file. To show the last transcription done, write a fixed txt file with the last result (For most usage, this is useless).txt file in the same "Processed" folder of the windows machine Save the audio transcript from cloud to a recognizable.Move the original input file from Input Folder to a "Processed" folder, adding timestamp to the filename.If necessary, convert it to a Cloud friendly format (1 or 2 audio channels, 16 bit depth, 16 Khz frequency).when a new file is detected, verify the format (channels, frequency, bit depth).polling on a specific folder, waiting for new wave files.Run in background on a windows machine, even after a reboot.Ĭheck out the dedicated documentation regarding Batch Transcription API for further info. To process long file and if you prefer a more "Batch" solution take a look to this great solution by PanosPeriorellis. The transcription, in fact stop when a long pause is detected in the original file. Another point is when you want to transcript short duration file. The Speech API I'm using is the synchronous one, so this project fit great when you don't want to setup complex architecture on Azure and you prefer the processing on the VM. The second challenge was to have this service run autonomously on a windows server. Not really a friendly format digested by cloud STT (Speech-To-Text). The first challenge was that the file format could be a Wave PCM with 4 channels, recorded at 48 kHz. This project aimed to create a simple Windows Background service in C# code, to perform subsequent Speech-To-Text (STT) process using Azure cloud. Then the desiderata was to have the transcript of the recorded audio along with the original recording. I had recently faced a challenge in creating a DEMO where an hardware device, triggered by a command, posted a wave audio file recording to a folder on a server.

The Speech-To-Text and Text-To-Speech are now widely used service, due to the network speed that is no longer a bottleneck, in most of the cases. A really interesting area is related to the tools each CSP (Cloud Service Provider) is providing to empower developers in creating interesting use cases. In the last decade, the cloud power has spinned incredible advancement in those techniques. I've always been really passionate about digital audio processing. Net Core installation required, can be obtained in the GitHub Release page The source code of solution presented in this article,is available atĪ portable release, no.

0 Comments

Azure speech to text batch transcript

Leave a Reply.

Author

Archives

Categories