Dying by a Thousand Prompts: Open Mannequin Vulnerability Evaluation

Source link : https://tech365.info/dying-by-a-thousand-prompts-open-mannequin-vulnerability-evaluation/

AI fashions have turn into more and more democratized, and the proliferation and adoption of open weight fashions has contributed considerably to this actuality. Open-weight fashions present researchers, builders, and AI fans with a stable basis for limitless use circumstances and functions. As of August 2025, main U.S., Chinese language, and European fashions have round 400M complete downloads on HuggingFace. With an abundance of alternative within the open weight mannequin ecosystem and the flexibility to fine-tune open fashions for particular functions, it’s extra vital than ever to grasp what precisely you’re getting with an open-weight mannequin—together with its safety posture.

Cisco AI Protection safety researchers performed a comparative AI safety evaluation of eight open-weight giant language fashions (LLMs), revealing profound susceptibility to adversarial manipulation, significantly in multi-turn situations the place success charges have been noticed to be 2x to 10x larger than single-turn assaults. Utilizing Cisco’s AI Validation platform, which performs automated algorithmic vulnerability testing, we evaluated fashions from Alibaba (Qwen3-32B), DeepSeek (v3.1), Google (Gemma 3-1B-IT), Meta (Llama 3.3-70B-Instruct), Microsoft (Phi-4), Mistral (Giant-2 also referred to as Giant-Instruct-2047), OpenAI (GPT-OSS-20b), and Zhipu AI (GLM 4.5-Air).

Under, we’ll present an outline of our mannequin safety evaluation, evaluate findings, and share the total…

—-

Author : tech365

Publish date : 2025-11-06 15:15:00

Copyright for syndicated content belongs to the linked Source.

—-

1 – 2 – 3 – 4 – 5 – 6 – 7 – 8

Dying by a Thousand Prompts: Open Mannequin Vulnerability Evaluation

Ukrainian Drone Assaults Disrupt Oil Infrastructure in Yaroslavl Region

Gibraltar celebrates its national day as it looks ahead to easier border crossings with Spain – The Independent

What Is Turo? The Car Rental App Behind Recent Incidents in New Orleans and Las Vegas

San Diego Comic-Con Ignites Excitement with Its First Ever International Edition in Spain This September

Ukrainian Drone Assaults Disrupt Oil Infrastructure in Yaroslavl Region

Gibraltar celebrates its national day as it looks ahead to easier border crossings with Spain – The Independent

What Is Turo? The Car Rental App Behind Recent Incidents in New Orleans and Las Vegas

San Diego Comic-Con Ignites Excitement with Its First Ever International Edition in Spain This September

The right way to stay dementia from robbing your family members in their sense of personhood – guidelines for caregivers

‘Task’ Episode 1 Recap: Is HBO’s New Philly Suburbs Crime Drama Darker Than ‘Mare of Easttown’?