Google DeepMind researchers have identified six attack methods that can manipulate autonomous AI agents online, warning of risks such as hidden instructions, persuasive language, and poisoned data sources. These attacks can influence agent decisions, override safeguards, and even hijack actions. The study highlights the need for defenses such as adversarial training and input filtering.

Leave a Reply