Visual Guide: Audio To Text
Table Of Contents
This post covers how to transcribe audio content.
We will use a podcast as our sample audio content, and assume you are on a free trial. This post covers:
- Sign into the application.
- Create an audio specific template.
- Create a new item, with our audio specific template as parent.
- Upload our podcast, and then use an Artificial Intelligence (AI) to transcribe our content.
On reviewing this visual guide, you will be able to upload your own audio content, and transcribe using an AI
.
A quick summary of the process below is, to start with we create a new template audioTemplate
which we setup specifically for podcasts as a once off. Then, we will use this to create an audio item, myFirstPodcast
of type audioTemplate, for transcription.
Steps - Create An Audio Template
Please open your browser and go to videotranslator.ai, and then click on the
Login
button. Assuming you just registered for the free trial, the below is what you should see when you sign into the application.Audio To Text: Register for the Free Trial, then Login Normally, we could just use
myTemplate
. However, as this is a visual guide, we will make a new template, and do this the long winded way. Click on theNew Template
button, and useaudioTemplate
for the name as shown in the below image andSubmit
.Audio To Text: New Template 'audioTemplate' This is a new template, and we are going to add an
Audio
component. Click theAdd Fields
button, then selectAudio
, as shown below.Audio To Text: Add an audio component (field) Once added,
expand the audio component using the highlighted + button
. The other highlight is what the audio component will look like once we create a new item.Audio To Text: Audio component expanded Use the action bar to
Publish
the template. This makes the template usable to create new items. We have also changed theInformation
to bePlease upload the podcast?
, this change is purely cosmetic.Audio To Text: Publish 'audioTemplate' Exit the template editor. Your
Root
will now look like the below image. The green is a visual indicator indicating that this template was just created. You have successfully created your ‘audioTemplate’.Audio To Text: 'audioTemplate' is now ready
Why am I doing this?
The template editor provides you the flexibility to tailor your template to your specific use case
. Specifically, in addition to the ‘audio component’ we added into our template, you could add:
Image Fields
: Assuming you were uploading your content into a third party audio sharing service, you canadd a poster or banner image, which should boost engagement
.Text Fields
: Assuming you were uploading your content into a third party audio sharing service, you can add a title, description and other metadata as additional text fields. Uploading theseincreases SEO substantially, and this effect is magnified if we were to also translate the audio content
.
Should I do this every time?
No - absolutely not
. The entire idea of a template is to create a data structure which meets your specific use case and reuse it every time
.
Steps - Create An Audio Item, and Transcribe
Click
audioTemplate
, and then clickAdd New Item
to create an item of typeaudioTemplate
as shown below.Audio To Text: Create 'myFirstPodcast' Today, we will use a podcast from the Creative Commons team, known as Plays Well With Others. We are big fans of the work Creative Commons does, and you should check them out if you are not familiar with the contribution of CC to modern open source software.
Upload the
*.mp3
, and you should see something like the below image.Audio To Text: Upload your podcast Now, trigger the transcription. We used
American English
in this case. Accept to start the process. The item will exit, and lock itself until the AI completes the transcription process.Audio To Text: Transcribe your podcast You are done! Click on the Captions to see the results of the AI. We recommend checking/editing to make sure the AI has transcribed your podcast properly.
Remember, check grammar and capitalisation. In the below, 'plays well with others' is not capitalised. The AI did not pick this up, because it was spoken normally, not especially pronounced. People do not speak like the rules of English require us to write unfortunately.
Audio To Text: Transcript of your podcast
Conclusion
In this visual guide, we used the application to transcribe an English podcast with AI assistance.
Please connect with us on LinkedIn, YouTube or Facebook for any comments, questions, or just to keep up to date with the work we do!
We are very grateful for your support!
If you are interested in trying out our technology, please try our platform or drop us an email at hello@videotranslator.ai.