This piece is missing the most important reason OpenClaw is dangerous: LLMs are ...

cedws · 2026-02-18T23:06:39 1771455999

It feels like everyone is just collectively ignoring this. LLMs are way less useful when you have to carefully review and approve every action it wants to take, and even that’s vulnerable to review exhaustion and human error. But giving LLMs unrestricted access to a bunch of stuff via MCP servers and praying nothing goes wrong is extremely dangerous.

All it takes is a tiny snippet from any source to poison the context and then an attacker has remote code execution AND can leverage the LLM itself to figure out how best to exfiltrate and cause the most damage. We are in a security nightmare and everyone is asleep. Claude Code isn’t even sandboxed by default for christ sakes, that’s the least it could do!

niyikiza · 2026-02-19T01:02:29 1771462949

Right on. Human-in-the-loop doesn't scale at agent speed. Sandboxing constrains tool execution environments, but says nothing about which actions an agent is authorized to take. That gets even worse once agents start delegating to other agents.I've been building a capability-based authz solution: task-scoped permissions that can only narrow through delegation, cryptographically enforced, offline verification. MIT/Apache2.0, Rust Core. https://github.com/tenuo-ai/tenuo

amelius · 2026-02-18T20:04:57 1771445097

Yeah, if a software engineer came up with such vulnerable idea, they would be fired instantly.

Wait a second, LLMs are the product of software engineers.

Legend2440 · 2026-02-18T20:25:54 1771446354

This is just the price of being on the bleeding edge.

Unfortunately, prompt injection does strongly limit what you can safely use LLMs for. But people are willing to accept the limitations because they do a lot of really awesome things that can't be done any other way.

They will figure out a solution to prompt injection eventually, probably by training LLMs in a way that separates instructions and data.

cherioo · 2026-02-18T20:53:17 1771447997

It’s like money laundering, but now responsibility laundering.

Anthropic released Claude saying “hey be careful. But now that enables the masses to build OpenClaw and go “hold my bear”. Now the masses people using OpenClaw had no idea what responsibility they should hold.

I think eventually we will have laws like “you are responsible for your AI’s work”. Much like how driver is (often) responsible for car crashes, not the car companies.

koakuma-chan · 2026-02-18T20:10:25 1771445425

Would they? I don't think anyone cares about security.

jstummbillig · 2026-02-18T20:22:09 1771446129

People have the grandest ideas about standards in software engineering since right about ai started dabbling in software engineering. It's uncanny.

tstrimple · 2026-02-19T03:30:10 1771471810

It seems like folks have forgotten again what the S in IOT stands for. This shit has been terrible for... well since always really.

theahura · 2026-02-18T20:24:58 1771446298

Hey, author here. I don't think that the security vulns are the most important reason OC is dangerous. Security vulnerabilities are bad but the blast radius is limited to the person who gets pwnd. By comparison, OpenClaw has demonstrated potential to really hurt _other_ people, and it is not hard to see how it could do so en masse.

enraged_camel · 2026-02-18T20:32:32 1771446752

>> Security vulnerabilities are bad but the blast radius is limited to the person who gets pwnd

No? Via prompt injection an attacker can gain access to the entire machine, which can have things like credentials to company systems (e.g. env variables). They can also learn private details about the victim’s friends and family and use those as part of a wider phishing campaign. There are dozens of similar scenarios where the blast radius reaches well beyond the victim.

pizlonator · 2026-02-18T20:45:26 1771447526

Agree with author - it's especially scary that even without getting hacked, openclaw did something harmful

That's not to say that prompt injection isn't also scary. It's just that software getting hacked by bad actors has always been a thing. Software doing something scary when no human did anything malicious is worse.

sejje · 2026-02-18T22:05:59 1771452359

No? Because I wouldn't give it access to those things. I wouldn't let it loose on my personal PC.

If I store my wallet on the sidewalk, that would probably be a problem. So I won't.

A prompt injection could exfiltrate an LLM API key, and some ai-generated code.

enraged_camel · 2026-02-19T05:17:20 1771478240

>> No? Because I wouldn't give it access to those things.

Not everyone is like that. In fact, OpenClaw's true "power" is unlocked when the user gives it full access. That's what the overwhelming majority of hype is coming from. Most people who actually get a lot of value out of it don't run it on e.g. docker containers on VPSs that can only be accessed via Tailscale + SSH.

simonw · 2026-02-18T21:40:10 1771450810

I think there is a much higher risk of it hurting the people are using it directly, especially once bad people realize how vulnerable they are.

Not to mention a bad person who takes control of a network of OpenClaw instances via their insecurities can do the other bad things you are describing at a much greater scale.