Most Viewed Content:

Cygnus space cargo spacecraft arrives at International Space Station with only half of its solar array

NASA astronaut Nicole Mann, backed up by NASA astronaut...

India’s censorship body gave power to remove pirated Movies from platforms

India’s Ministry of Information and Broadcasting announced that its...

Toyota responds to continued production cuts in the next 3 months: easing pressure on dealer earnings

In response to the news that "production will continue...

AI beats AI: Google research team uses GPT-4 to beat AI-Guardian auditing system

Google’s research team is conducting an experiment, they use OpenAI’s GPT-4 to break other AI models of security measures, the team has now broken the AI-Guardian audit system and shared the relevant technical details.

After inquiring that AI-Guardian is a kind of AI audit system, can detect whether there is improper content in the picture, and whether the picture itself has been modified by other AI, if the picture is detected to exist the above signs, it will prompt the administrator to come to deal with.

In a paper titled “LLM-assisted development of AI-Guardian,” Google Deep Mind researcher Nicholas Carlini discusses the use of GPT-4 to “design the attack method, write the attack principle,” and then use GPT-4 to “design the attack method, write the attack principle,” and then use GPT-4 to “design the attack method, write the attack principle, and then write the attack principle. and using them to spoof AI-Guardian’s defensive mechanisms.”

▲ Source Google Research Team

It is reported that GPT-4 can send out a series of wrong scripts and interpretations to deceive AI-Guardian, and the paper mentioned that GPT-4 can make AI-Guardian think that “a picture of someone holding a gun” is “a picture of someone holding a harmless apple”, thus making AI-Guardian think that “a picture of someone holding a gun” is “a picture of someone holding a harmless apple”. GPT-4 can make AI-Guardian think that “a picture of someone holding a gun” is “a picture of someone holding a harmless apple”, thus allowing AI-Guardian to directly release the relevant image input source. Google’s research team said that with the help of GPT-4, they successfully “cracked” AI-Guardian’s defenses, reducing the model’s accuracy from 98% to just 8%.

The relevant technical documentation has been posted on ArXiv for those interested, but the developers of AI-Guardian have also pointed out that the Google research team’s attack will no longer be available in future versions of AI-Guardian, and given that other models will follow suit, Google’s attack will only be used for reference purposes in the future. for reference purposes.

Latest

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Newsletter

Don't miss

Starting from 48,900, Geely Panda Karting officially starts pre-sale

Geely Panda Karting officially started pre-sale. The pre-sale price...

Ford: Expand charging network, fuel/ hybrid/ pure electric in parallel

Recently, Ford released the company's comprehensive annual report for...

Chery’s two new cars are exposed, targeting overseas markets

Recently, some media exposed the actual cars of two...

New Trumpchi Shadow Leopard to launch on May 1, upgraded performance rims

Recently, we learned from the official that the 2024...

Samsung Galaxy S25 Ultra expected to feature 5000mAh + 45W Combo

Technology media WccFtech recently reported that Samsung will not...
James Lopez
James Lopezhttps://www.techgoing.com
James Lopez joined Techgoing as Senior News Editor in 2022. He's been a tech blogger since before the word was invented, and will never log off.

SAIC Volkswagen Tiguan L PRO is expected to be launched on May 15

Recently, we learned that SAIC Volkswagen’s new Tiguan L PRO is expected to be launched on May 15. The new car is launched in...

HMD Global’s TA-1592 unveiled, rebranded version of the Nokia XR21

Phone model TA-1592 owned by HMD Global has appeared in the retailer database. This phone is actually a rebranded version of the Nokia XR21...

2024 Beijing Auto Show: Aion Y Plus new colors unveiled

At the 2024 Beijing Auto Show, the Aion brand officially brought a new color matching of AION Y Plus. The car uses the name...