Overview of's Replica offerings- Stock Replicas and Personal Replicas, all powered by the Phoenix AI model. Get tips on how to create the perfect replica, and how to get a high quality output.
A Replica is a realistic video model of a human created using the Phoenix Model. The Phoenix model is a fully-synthetic 3D based model that generates realistic replica videos from just a script, complete with natural face (lip, cheek, nose, chin) movements and expressions synchronized with your script and generated voice. Developed by our team, the model uses a novel approach that bypasses traditional methods and constructs dynamic, three-dimensional facial scenes using neural radiance fields (NeRFs).
Replicas are created using just 2 minutes of training data, and are designed to learn how someone speaks and sounds, how they look, and how they move their face while speaking. Using a Replica you can generate hyper-realistic videos that look and sound just like you- from just text, in up to 30 languages.
It's important to provide a high-quality input video in order to get great outputs from a Replica. Your Replica will attempt to mimic your gestures and movements, as well as your accent, even if you generate a video in a different language.
Here's an example of an output from one of our Stock Replicas:
Personal Replicas allow you to train a new Replica of a human using the Phoenix model, from just 2 minutes of training data. Personal Replicas take between 4-6 hours to train. You can only train Replicas using training data that has a verbal consent statement. Personal Replicas go through Voice and Face ID checks to ensure consent is present.
Learn how to create a high-quality personal replica with just a few minutes of training data.
Personal Replicas allow you to train a new Replica of a human using the Phoenix model, from just 2 minutes of training data. Personal Replicas take between 4-6 hours to train, and are available on all plans except for Starter.
You can create a Replica via the Avatar dashboard. Navigate to the Replicas tab in our portal. Here, you'll be able to record in app or upload footage to create a new Replica.
Your journey to creating a personal Replica begins with a simple requirement: a two-minute video of you engaging with the camera. There is no predefined script beyond the consent statement, you can discuss anything that showcases your natural speaking style and expertise.
Our platform simplifies the first step. Use your webcam through the developer portal to capture the essence of your persona. Achieving the best possible Replica involves attention to detail. Here's how:
Here's an example of high quality training footage:
An integral part of the process involves reading a specific authorization phrase. This step confirms your consent and kicks off the Replica creation process.
“I, [FULL NAME], am currently speaking and give consent to Tavus to create an AI clone of me by using the audio and video samples I provide. I understand that this AI clone can be used to create videos that look and sound like me.”
If you are uploading training footage, it's important that it is in the correct format:
We highly recommend the full training to be done in the language you are most likely to use for the generated videos. This does not prohibit future videos from being created in a different language if desired!
Your replica will be processed in the background upon submission. This process will take around 4-6 hours. If you're not happy with your personal replica, be sure to contact us. enables the creation of videos in a multitude of languages, expanding the reach of content globally. When you input a script in any of the supported languages, the resulting video features your replica articulating the message in that specific language.
For example, by providing a script in Spanish, as shown in the example below, your replica will deliver the content in Spanish, mirroring natural language nuances and expressions. You can even mix and match languages in the same script.
Please note that the voice cloning model attempts to maintain your accent even whilst speaking a different language. This can sometimes result in, for example, an American Accent while speaking Spanish.
Ensure your face is evenly lit with no shadows.
Your space should be silent or almost silent.
Keep your background clear.
Use a high-quality camera with at least 2K pixels.
Start with your phone or computer's microphone.
Disable any software-based audio enhancements.
Maintain eye level with the camera and act naturally.
Be yourself and relax.
If possible, avoid beards, glasses, and accessories.
This comprehensive guide ensures you capture the highest quality footage for your replica, leading to a more authentic and engaging digital representation.
Review the following checklist to ensure your video recording is optimized for use as training footage for a Tavus digital replica.
