Resemble AI’s next-generation AI audio detection model, Detect-2B, is 94% accurate

by | Jun 27, 2024 | Technology

Don’t miss OpenAI, Chevron, Nvidia, Kaiser Permanente, and Capital One leaders only at VentureBeat Transform 2024. Gain essential insights about GenAI and expand your network at this exclusive three day event. Learn More

Voice cloning company Resemble AI has released the next generation of its deepfake detection model, which has an accuracy of around 94%. 

Detect-2B uses a series of pre-trained sub-models and fine-tuning to examine an audio clip and determine whether it was generated with AI. 

“Building upon the strong foundation of our original Detect model, DETECT-2B represents a major leap forward in terms of model architecture, training data, and overall performance. The result is an extremely robust and accurate deepfake detection model that achieves a remarkable level of performance when evaluated against a massive dataset of real and fake audio clips,” the company said in a blog post. 

According to Resemble, Detect-2B’s sub-models “consist of a frozen audio representation model with an adaptation module inserted into its key layers.” The adaption module shifts the models’ focus towards artifacts — or the accidental sounds left in a recording — that often identify real audio from fake ones. Most AI-generated audio clips can sound “too clean.” Detect-2B can predict how much of the audio is made by AI without retraining the model every time it listens to a new clip. The sub-models are also trained on large datasets. 

Countdown to VB Transform 2024

Join enterprise leaders in San Francisco from July 9 to 11 for our flagship AI event. Connect with peers, explore the opportunities and challenges of Generative AI, and learn how to integrate AI applications into your industry. Register Now

Detect-2B aggregates its prediction scores and compares these to “a carefully tuned threshold” be …

Article Attribution | Read More at Article Source

[mwai_chat context=”Let’s have a discussion about this article:nn
Don’t miss OpenAI, Chevron, Nvidia, Kaiser Permanente, and Capital One leaders only at VentureBeat Transform 2024. Gain essential insights about GenAI and expand your network at this exclusive three day event. Learn More

Voice cloning company Resemble AI has released the next generation of its deepfake detection model, which has an accuracy of around 94%. 

Detect-2B uses a series of pre-trained sub-models and fine-tuning to examine an audio clip and determine whether it was generated with AI. 

“Building upon the strong foundation of our original Detect model, DETECT-2B represents a major leap forward in terms of model architecture, training data, and overall performance. The result is an extremely robust and accurate deepfake detection model that achieves a remarkable level of performance when evaluated against a massive dataset of real and fake audio clips,” the company said in a blog post. 

According to Resemble, Detect-2B’s sub-models “consist of a frozen audio representation model with an adaptation module inserted into its key layers.” The adaption module shifts the models’ focus towards artifacts — or the accidental sounds left in a recording — that often identify real audio from fake ones. Most AI-generated audio clips can sound “too clean.” Detect-2B can predict how much of the audio is made by AI without retraining the model every time it listens to a new clip. The sub-models are also trained on large datasets. 

Countdown to VB Transform 2024

Join enterprise leaders in San Francisco from July 9 to 11 for our flagship AI event. Connect with peers, explore the opportunities and challenges of Generative AI, and learn how to integrate AI applications into your industry. Register Now

Detect-2B aggregates its prediction scores and compares these to “a carefully tuned threshold” be …nnDiscussion:nn” ai_name=”RocketNews AI: ” start_sentence=”Can I tell you more about this article?” text_input_placeholder=”Type ‘Yes'”]

Share This