SemanticScuttle - klotz.me » klotz: spam+llm

klotz: spam* + llm*

AI models fall for the same scams that we do

Large language models (LLMs) that power chatbots can be used to scam humans, but these AI models are also susceptible to being scammed themselves, with varying degrees of gullibility among different models.

2024-10-26 Tags: llm, phish, spam, email by klotz

Mapping the Mind of a Large Language Model May 21, 2024

"...a feature that activates when Claude reads a scam email (this presumably supports the model’s ability to recognize such emails and warn you not to respond to them). Normally, if one asks Claude to generate a scam email, it will refuse to do so. But when we ask the same question with the feature artificially activated sufficiently strongly, this overcomes Claude's harmlessness training and it responds by drafting a scam email."

2024-05-21 Tags: claude, anthropic, llm, ontology, features, semantic web, spam, email by klotz

First / Previous / Next / Last / Page 1 of 0

SemanticScuttle - klotz.me

klotz: spam* + llm*

Linked Tags

Related Tags