thewayne | Using AI to hide secret messages via Steganography

Interesting stuff. I especially enjoyed the bits talking about older methods of hiding messages in plain sight, like marking words in print with invisible ink.

Steganography is an interesting art. It's not cryptography as the technically the text is plainly available - if you know how to read it. One method of steganography was encoding messages in photographs and then posting them online. There's lots of wasted bits in photos, so you alter the bits, which doesn't really alter the image, post the photo, the recipient knows how to decode the bits, the message is passed. But the technique is detectable because the image doesn't compress as well as an unaltered photo.

Detecting textual steganography requires that you analyze the message text and develop a word probability distribution. The word 'the' is one of the most commonly occurring words used in written and spoken communications, 'analysis' less so. By comparing normal text to steganographic text, you can make assumptions as to whether or not text contains a hidden message.

The text that the message is hidden IN is called the cover text. It might be something like a visit to a local museum, and then the AI will alter that text to inject your secret message. You can then send the altered message and the recipient can re-process it and extract your secret message.

Now, here's the interesting bit. By using AI, the difference in probability distributions can be reduced to zero. So an enemy - a censor, a hostile state actor, whatever - cannot accurately say that any given message contains stenographic text!

Word probability doesn't tell you what the hidden message is, just the likelihood of whether or not there is a hidden message there, which may mean an increased likelihood of a person or group coming under tighter scrutiny.

The problem that I see with this is they're talking about a "plug-in for an app like WhatsApp or Signal would do the heavy algorithmic lifting". I'm a little confused at this point. If they need to match the probability distribution of the cover text with the PD of the secret message, and it's done by an AI which is a supercomputer or a computer cluster, will you be able to do that with just a plugin on a smart phone? I'd like to see some more solid proof of concept here rather than 'our math models demonstrate' sort of stuff before human rights workers in bad places put themselves at risk with stuff like this.

https://www.quantamagazine.org/secret-messages-can-hide-in-ai-generated-media-20230518/

Threaded | Flat

Dear James Bond,
We have a job for you. LOL!!!!!!!!!!!!!!!!
Hugs, Jon

>> Steganography is an interesting art. It's not cryptography as the technically the text is plainly available - if you know how to read it. <<

If you want to be really secure then you stack your obfuscations. For example, first you put your message into a rare language. Then you put that into code. If you choose something like Morse code, it doesn't even look like a message unless someone knows exactly what to look for. Then you print that encoded message with invisible ink or steganography or whatnot. Then you insert that page somewhere secret, like inside the pasteboard cover of a book. And if you were a complete bastard about it, the original message was all kennings, archaic idioms, or references to memories that only you and your intended recipient shared in the first place.

Even if your enemies suspect there is a message somewhere, they will have one hell of a time locating it and then slogging through all the layers of protection, and honestly by the time they do that, it's probably outdated.

One challenge I've wondered about is embedded messages that have some defense against people trying to block them, e.g., by corrupting plenty of "low" bits in images or audio knowing that it won't perceptibly change how they seem. Of course, if you keep the messages short, it's statistically easier to still pull a damaged message out of the noise. though even then there are questions like if it's worth lengthening them by using an error-correcting code.

It seems like LLMs make it easier to hide things in them, because they are already doing things according to machine patterns, and we tend to use machines to try and analyze large corpuses of material for their irregularities. Human codebreakers might be able to find it, but at that point, it's done the job of being too expensive to crack.

Using AI to hide secret messages via Steganography

no subject

Well ...

Re: Well ...

Re: Well ...

no subject

(no subject)

no subject