Security researchers unveil deepfake AI audio attack that hijacks live conversations

The AI was able to manipulate a live conversation between two people without either noticing.

Join us on social networks

IBM Security researchers recently discovered a “surprisingly and scarily easy” technique to hijack and manipulate live conversations using artificial intelligence (AI).

The attack, called “audio-jacking,” relies on generative AI — a class of AI that includes OpenAI’s ChatGPT and Meta’s Llama-2 — and deepfake audio technology.

Audio jacking

In the experiment, researchers instructed the AI to process audio from two sources in a live communication — such as a phone conversation. Upon hearing a specific keyword or phrase, the AI is further instructed to intercept the related audio and manipulate it before sending it on to the intended recipient.

According to a blog post from IBM Security, the experiment ended with the AI successfully intercepting a speaker’s audio when they were prompted by the other human speaker to give their bank account information. The AI then replaced the authentic voice with deepfake audio giving a different account number. The attack was undetected by the “victims” in the experiment.

Image source: IBM Security

Generative AI

The blog points out that, while executing the attack would require some level of social engineering or phishing, developing the AI system itself posed little challenge:

“Building this PoC (proof-of-concept) was surprisingly and scarily easy. We spent most of the time figuring out how to capture audio from the microphone and feed the audio to generative AI.”

Traditionally, building a system to autonomously intercept specific audio strings and replace them with audio files generated on the fly would have required a multi-disciplinary computer science effort.

But modern generative AI does the heavy lifting itself. “We only need three seconds of an individual’s voice to clone it,” reads the blog, adding that, nowadays, these kinds of deepfakes are done via API.

The threat of audio jacking goes beyond tricking unwitting victims into depositing funds into the wrong account. The researchers also point out that it could function as an invisible form of censorship with the potential to change the content of live news broadcasts or political speeches in real time.

All Dutch and English crypto news!

Laad meer

Security researchers unveil deepfake AI audio attack that hijacks live conversations

Join us on social networks

Audio jacking

Generative AI

All Dutch and English crypto news!

SEC next chair must be named before US election — Tyler Winklevoss

Bitcoin forming ‘massive’ bullish wedge pattern as trader eyes $85K

RFK Jr. promises BTC strategic reserve, dollar backed by hard assets

SEC approves Grayscale Bitcoin Mini Trust for Trading on NYSE Arca

Beste exchanges

Review Bitvavo

Review Bitcoinmeester

Review Coinbase

Review Knaken

Review BTC Direct

Koop je crypto bij Bitvavo