LipSync videos with Custom Voices
Last updated
@Dara.network / Gooey.AI / support@gooey.ai
Last updated
Generating lipsyncβd videos is amazing and they are even better with a custom voice. Companies like ElevenLabs enable you to create custom voices by uploading voice samples and we are very happy to announce that you can now use your custom ElevenLabs voice inside of the Gooey.AI Lipsync Maker (and Copilot) Workflows.
Sign up on Eleven Labs: https://elevenlabs.io/sign-up
Go to the Voice Lab section (https://elevenlabs.io/voice-lab) and click on βAdd Generative or Cloned Voiceβ
You will be shown multiple options for where to generate the new voice from. As of now, they generated voices and community voices for free users. They offer Instant Voice Cloning and Professional Voice Cloning for their subscribers.
Generated voices are created by adjusting parameters such as gender, age, and accent. This doesnβt take audio samples, and so it might not be exactly like you had imagined.
Community voices are those made publicly available by other users. Eleven Labs does have a good collection of community voices but you might not find exactly what you are looking for.
If you subscribe for their lowest paying tier that costs $5 / month, you can make use of Instant Voice Cloning to extract a voice from audio samples within seconds. Weβll use this to create a clone of Nelson Mandelaβs voice, so we can recreate his speeches from text.
For our use case, we converted two of his interview clips β Nelson Mandelaβs special message to INTERPOL 75th General Assembly and βNelson Mandelaβs message to BFSβ β to mp3 format, and uploaded them in the Eleven Labs instant voice cloning interface.
We agree to the
Note: there is more to know about the kinds of custom voices and you can visit their VoiceLab documentation to know more about it.
This will open up the Profile Settings and show you the API key. Make it visible and copy it to use in Gooey.
Source for these images is the Eleven Labs documentation.
Visit the workflow page for LipSync with Text and add a video or image you want to extract the face from in βInput Faceβ. You will need to remove the default uploaded file and upload your own video or image file.
Open the βSettingsβ dropdown, and in βVoice Settingsβ, set the βSpeech Providerβ to βEleven Labsβ.
Select the checkbox that says βUse custom API key + Voice IDβ and enter the copied API key into the field that says βYour Eleven Labs API Keyβ
This should fetch all the available voices from your account into the βVoice IDβ field.
Choose the voice that you want to run in the Voice ID field. In our case it is the voice we added in Eleven Labs with the name βNelson Mandelaβ, we select that.
Now we can copy some text from his 1964 speech, βI am Prepared to Dieβ, and add it into the βInput Textβ field.
We hit βSubmitβ and voila! We get an accurate sounding version of the speech with a newer photo.
This is the run URL if you want to tweak and explore more: https://gooey.ai/lipsync-maker/?run_id=sbr6yi8i&uid=MPhrEpmVYkept8yJjsBzJPL0Tuj1
There are more Eleven Labs options that you can explore in the Gooey dashboard, such as the model to use and the voice settings. You can play around with them on Gooey and learn more about how to use them effectively from the Eleven Labs documentation.
On the Eleven Labs website, click on the profile icon in the top right corner, and select the Profile option.