How many seconds does OpenAI need to clone your voice?

Answer: Fifteen.

  • Facebook
  • LinkedIn
  • Twitter
  • linkText
  • Email
Shutterstock
The latest offering from software company OpenAI is equal parts impressive and concerning. Called Voice Engine, it pretty much does exactly what its name suggests by accurately replicating the sound of someone’s voice.

The tool only needs to listen to a 15-second example of someone speaking in order to do so. Voice Engine has been in the works since 2022 and was built on OpenAI’s pre-existing text-to-speech API, which it has used for its popular AI chatbot ChatGPT. The company envisions Voice Engine as a helpful tool for language translation, reading assistance and assisting people who have speech conditions.

They are also aware of the technology’s potential for misuse, which is why it is not yet publicly available. OpenAI is currently taking feedback from “U.S. and international partners from across government, media, entertainment, education, civil society and beyond” in order to improve the tool’s safeguards against nefarious uses.
  • Facebook
  • LinkedIn
  • Twitter
  • linkText
  • Email