Let's dive into a simple step-by-step guide on how to use the Voice Changer tool.
Before, we get started, to make the learning process more simple, we'll break this process into three main parts:
Transcribing your Audio/Video file
Media Upload and Language Selection
The first step involved is to upload your media file from the Voice Changer tab in the side bar and select the language of the speech present in the uploaded media.
Selecting the appropriate language during media upload is critical to ensure the accuracy of the transcription.
💡Additionally, you can import audio and video files from the Add Media menu. Once imported, simply right-click on the file from the bottom timeline to access the option to transcribe the file.
Script Editor Screen
Once you've uploaded your media and select the appropriate language for transcription, you would be then shown a Script Editor Screen.
If you're looking at only transcribing your audio and want to export the script, simple click on Accept and Save and head to Export > Script.
After the transcription of your script is visible in the Script Editor screen, the Studio default voice - Nate will automatically generate the audio.
There are two possible outcomes that can happen:
1. The Studio may encounter Block Errors.
2. The Studio may not encounter any Block Errors.
Understanding Block Errors
To understand Block Errors it is important to understand the concept of Uploaded Duration and Generated Audio Duration.
Understanding the pace of delivery for different voices is key. This pace determines how long it takes for a particular voice to narrate a sentence.
Uploaded Duration
Uploaded Duration refers to the length of time the voice took in the uploaded audio to narrate the script.
Generated Audio DurationIt is important to keep in mind that each AI voice also has its own distinct pace of delivery, much like human voices. However, we have more control over this as you can customise the pace to your preference using the Speed feature in the Studio.
If a block error occurs in the Studio, it's because the Uploaded Duration is shorter than the Audio Generated Duration. The erroneous block will be highlighted in the sidebar to alert you of the issue as shown below.
As a result, the generated audio file may be longer than the original, which can cause synchronisation issues if the audio is being transcribed for a video.
❗In cases where the Uploaded Duration is longer than the Generated Audio Duration, the Studio will not display an error message.
Fixing Block Errors
There are 4 ways of fixing a block error:
Changing the Voice
It's worth noting that each AI voice has its own unique pace of delivery, which is typically measured in words per minute (wpm).
For reference, an average English speaker speaks at the rate of 120-180 wpm.
To achieve a more accurate match between the uploaded voice and the AI-generated voice, consider selecting a different AI voice that matches the wpm of the original recording.
Increasing Speed
One of the benefits of using AI voices is the option to adjust the narration speed to match the pace of the original uploaded voice.
By increasing the speed, you can shorten the generated audio duration to achieve a closer match with you original recording.
It is advisable to avoid increasing or decreasing the speed by more than 25-30% as it may result in the generated audio sounding too fast or too slow.
Editing the Content
If the previous options of changing the voice or altering the speed do not prove effective in achieving a match between the generated audio and the uploaded audio, we recommend considering editing the content of the text.
This may involve rephrasing and summarising the blocks with a significant error margin. While this may result in changes to the original script, it is one of the most effective ways to ensure that the script matches the uploaded audio duration and maintains synchronisation.
Accepting the Errors
As a last resort in fixing block errors, you can choose to accept the error. This means acknowledging that the generated audio duration may be longer than the uploaded audio duration, resulting in all blocks moving forward by a few seconds depending on the margin of error present.
Watch this video on working with SRT files -