More

jdkoeck · 2026-02-01T13:05:54 1769951154

But even then, the agent can still exfiltrate anything from the sandbox, using curl. Sandboxing is not enough when you deal with agents that can run arbitrary commands.

Majromax · 2026-02-01T16:52:40 1769964760

What is your threat model?

If you're worried about a hostile agent, then indeed sandboxing is not enough. In the worst case, an actively malicious agent could even try to escape the sandbox with whatever limited subset of commands it's given.

If you're worried about prompt injection, then restricting access to unfiltered content is enough. That would definitely involve not processing third-party input and removing internet search tools, but the restriction probably doesn't have to be mechanically complete if the agent has also been instructed to use local resources only. Even package installation (uv, npm, etc) would be fine up to the existing risk of supply-chain attacks.

If you're worried about stochastic incompetence (e.g. the agent nukes the production database to fix a misspelled table name), then a sandbox to limit the 'blast radius' of any damage is plenty.

jdkoeck · 2026-02-01T18:36:58 1769971018

That argument seems to assume a security model where the default prior is « no hostile agent ». But that’s the problem, any agent can be made hostile with a successful prompt injection attack. Basically, assuming there’s no hostile agent is the same as assuming there’s no attacker. I think we can agree a security model that assumes no attacker is insufficient.

TheDong · 2026-02-01T13:41:02 1769953262

It depends on what you're trying to prevent.

If your fear is exfiltration of your browser sessions and your computer joining a botnet, or accidental deletion of your data, then a sandbox helps.

If your fear is the llm exfiltrating code you gave it access to then a sandbox is not enough.

I'm personally more worried about the former.

jdkoeck · 2026-02-01T14:08:39 1769954919

Code is not the only thing the agent could exfiltrate, what about API keys for instance? I agree sandboxing for security in depth is good, but it’s not sufficient and can lull you into a false sense of security.

twodave · 2026-02-01T14:45:40 1769957140

This is what emulators and separate accounts are for. Ideally you can use an emulator and never let the container know about an API key. At worst you can use a dedicated account/key for dev that is isolated from your prod account.

gessha · 2026-02-01T15:35:16 1769960116

VM + dedicated key with quotas should get you 95% there if you want to experiment around. Waiting is also an option, so much of the workflow changes with months passing so you’re not missing much.

twodave · 2026-02-01T19:12:22 1769973142

Sure, though really these are guidelines for any kind of development, not just the agentic kind.

pixl97 · 2026-02-02T13:40:18 1770039618

How much does a proxy with an allow list save a(n ai) person?

WhyNotHugo · 2026-02-02T07:17:38 1770016658

The whole point of the sandbox is that you don’t put anything sensitive inside of it. Definitely not credentials or anything sensitive/confidential.

philipp-gayret · 2026-02-01T13:21:36 1769952096

That depends on how you configure or implement your sandbox. If you let it have internet access as part of the sandbox, then yes, but that is your own choice.

jdkoeck · 2026-02-01T14:17:32 1769955452

Internet access is required to install third party packages, so given the choice almost no one would disable it for a coding agent sandbox.

In practice, it seems to me that the sandbox is only good enough to limit file system access to a certain project, everything else (code or secret exfiltration, installing vulnerable packages, adding prompt injection attacks for others to run) is game if you’re in YOLO mode like pi here.

Maybe a finer grained approach based on capabilities would help: https://simonwillison.net/2025/Apr/11/camel/

jdkoeck · 2026-01-30T16:59:19 1769792359

It is completely vibe coded. The author himself says he doesn't check the code.

https://x.com/Hesamation/status/2016712942545240203

Can't believe people are giving it full access to their MacOS user session. It's a giant vulnerability waiting to happen.

Sending an email with prompt injection is all it takes.

https://x.com/Mkukkk/status/2015951362270310879

swordsith · 2026-01-30T18:04:27 1769796267

this should be top comment, this whole project is a 0 day orgy

mh2266 · 2026-01-31T04:56:59 1769835419

the documentation contains the actual line:

> This is remote code execution on the Mac

https://docs.openclaw.ai/gateway/security

I... what....? what are people expecting?

GreenWatermelon · 2026-02-01T11:15:48 1769944548

This is the result of years of people sniffing the AI Powder. Our collective intelligence as a species is falling off a cliff.

jdkoeck · 2025-12-12T09:03:10 1765530190

A tale as old as time: hubris. A successful system is destined to either stop growing or morph into a monstrosity by taking on too many responsibilities. It's hard to know when to stop.

React lost me when it stopped being a rendering library and became a "runtime" instead. What do you know, when a runtime starts collapsing rendering, data fetching, caching, authorization boundaries, server and client into a single abstraction, the blast radius of any mistake becomes enormous.

jdkoeck · 2025-09-16T06:40:47 1758004847

Alas, that’s a common misconception! You’re confusing properties with attributes. Attributes are set through HTML and are stringly typed, but litjs properties can be any js value.

For instance, I have a project with a small web component that displays data from a duckdb wasm connection. The connection object is passed as a property and it works flawlessly.

chrismorgan · 2025-09-16T08:50:54 1758012654

If you’re using it with something like React, passing attributes is easy, but setting properties is much harder, requiring using the escape hatches—I believe you’ll need useRef and useEffect, and a bit of care because it’s frightfully easy to do the wrong thing.

spankalee · 2025-09-16T13:33:57 1758029637

No, React sets properties by default now. It was one of the last holdouts, but they finally fixed this in 19.

chrismorgan · 2025-09-16T14:28:27 1758032907

Ah, didn’t know that, thanks for the info.

From the release announcement <https://react.dev/blog/2024/12/05/react-19#support-for-custo...>:

> Server Side Rendering: props passed to a custom element will render as attributes if their type is a primitive value like string, number, or the value is true. Props with non-primitive types like object, symbol, function, or value false will be omitted.

> Client Side Rendering: props that match a property on the Custom Element instance will be assigned as properties, otherwise they will be assigned as attributes.

And as https://custom-elements-everywhere.com/ puts it:

> As of v19, React now uses a runtime heuristic to determine if it should pass data to Custom Elements as either properties or attributes. If a property is already defined on the element instance, it will use properties, otherwise it will fallback to attributes.

All up it sounds fragile, and I’m not fond of how JSX syntax makes properties look like attribute, and I hate the way you lose the sometimes-crucial distinction of whether something is a property or an attribute (<input value> is the most obvious example of this), but I guess it’ll be fine in practice. I’m a little curious how they handle some of the event stuff, since React’s synthetic event system is such a legacy mess. And I never did like the way several popular libraries/frameworks (definitely not just React) special-case /^on.*/ attributes, so stupid how you now can’t safely use an attribute or property named “once”, “one” or “online”, all of which would be the most sensible name in certain situations.

jdkoeck · 2025-08-01T04:16:13 1754021773

I wish they had more influence, then they’d be able to build more and prices would go down.

munchler · 2025-08-01T04:25:08 1754022308

I'm all for building more housing, as long as it comes with the necessary infrastructure - schools, roads, parking, public transportation, etc. Where I live, developers seem to get government approval to build in locations where they can rake in a lot of money at high prices without having to worry about such things.

(In fact, my local government is actually closing roads near new housing because "f#ck cars" is apparently a hip idea these days.)

jdkoeck · 2025-08-01T04:51:56 1754023916

Your own example seems to show that the culprit lies with local governance, not developers, wouldn’t you agree?

munchler · 2025-08-01T04:58:06 1754024286

No. Let's not pretend that it's OK for developers to try to obtain undue influence over government officials.

jdkoeck · 2025-08-01T05:20:54 1754025654

There’s a puzzling contradiction between your claim that developers are the problem, on one hand, and then your own anecdote on the other, not to mention the article that very convincingly debunks the idea that housing shortage is the fault of developers. I must be missing something, because frankly this isn’t making any sense.

munchler · 2025-08-01T05:27:55 1754026075

I'm not saying that developers are the cause of the housing shortage.

I'm saying that developers are eager to build housing and sometimes are able to cut corners via undue influence over public officials. That leads to more housing (good), but it also erodes the quality of life for residents (bad).

It's really not that hard to understand.

h2zizzle · 2025-08-01T20:10:24 1754079024

Money is politics, at least since Citizens United. Depending on your locality, for much longer.

CalRobert · 2025-08-01T07:21:53 1754032913

It's been hip since drivers started showing up and killing people. The less parking the better.

munchler · 2025-08-01T17:47:08 1754070428

Right. Let's just go back to agrarian times when people rode horses to get around.

CalRobert · 2025-08-02T09:43:41 1754127821

Indeed, that is the only alternative and the favoured option for those trying to make it safe for their kids to bike to school. Though I prefer mules myself.

epidemiology · 2025-08-01T06:56:10 1754031370

Let me guess. You already own a home?

jdkoeck · 2025-07-14T13:26:46 1752499606

Have you tried getting back into cardio with a softer method like biking? Maybe that combined with some strength exercices could toughen up your body enough that you can start running again! Anyway, best wishes.

jdkoeck · 2025-07-12T07:22:21 1752304941

Honestly, I think the Claude Code integration in VS Code is very close to the « nothing » part of the spectrum!

jdkoeck · 2025-07-07T09:45:15 1751881515

I'm highly skeptical, do you have a concrete example?

stared · 2025-07-07T10:58:25 1751885905

I won't share any of my examples, as there are both personal and sensitive.

Very easy version:

If you use ChatGPT a lot, write "Base on all you know about me, write an insight on me that I would be surprised by". For me it was "well, expected, but still on point". For people with not experience of using LLMs in a similar way it might be mind-blowing.

An actual version I do:

GPT 4.5. Providing A LOT context (think, 15 min of writing) of an emotional or interpersonal situation, and asking to suggest of a few different explanations of this situation OR asking me to ask more. Of course, the prompt needs to have whom I am and similar stuff.

davidclark · 2025-07-07T13:21:21 1751894481

The “Based on…” prompt is simply a horoscope. This is a great piece about how LLMs use the same tricks as psychics to appear helpful, useful, and intelligent.

https://softwarecrisis.dev/letters/llmentalist/

stared · 2025-07-07T14:20:22 1751898022

I know these techniques (e.g. various "cold reading"), AI knows it way better. But it can be much more specific.

Again, for untrained people (especially every single one that takes horoscopes seriously), it can be dangerous as they may not only not be able to tell the difference, but know that such tools exist.

SketchySeaBeast · 2025-07-07T14:28:08 1751898488

> untrained people

What training are you referring to here? Therapy, mentalism, or being an AI guru?

stared · 2025-07-07T14:46:08 1751899568

Psychology knowledge, both theoretical (thing: first year of undergrad in psych at a good univ), practical (e.g. ability to translate an arbitrary inflammatory statement into NVC), etc.

SketchySeaBeast · 2025-07-07T14:58:35 1751900315

That seems to make it a non-starter for most people, given that most won't have that first year knowledge.

But also, I hold a minor in psychology. Despite that, I didn't once attend a course that I would describe as any sort "therapy 101" and so I fear your bar is a bit low for any sort of efficacy, but I would guess that's probably because I'm in the "I'm aware my own ignorance" area of the Psychological knowledge curve.

stared · 2025-07-07T17:27:52 1751909272

When I think about it again, it is less about one's absolute knowledge of psychology, and more about (as you said) knowing one's own ignorance and having some mental model of an LLM.

One model I have found useful to communicate is that they meet in a bar one random person, who seems to know a lot, but otherwise you have no idea about them, and also - they have absolutely no context of you. In that case, is you treat (with a grain of salt) what they say, it is fine. They may say something inspiring, or insightful, or stupid, or random. If they say something potentially impactful, you would rather double check it with others (and no, not some other random person in bar).

I know both people for whom LLMs were helpful (one way or another). But again, treating it more like a conversation with a stranger.

Worse (not among my direct friends, but e.g. a parent of one) is when people treat it as something omniscient, who will give them direct answer. Fortunately, GPT 4 by them was rather defensive, and kept giving options (in a situation like "should I stay or break"), refusing to give an answer for them (they were annoyed; but better being annoyed than giving agency that way).

When it comes to anything related to diagnosis (fortunately, it has some safeguards), it might be dangerous. While I used that to try to see if it can diagnose something based on hints (and it was able to make really fine observation), it needs to base on really fine prompts, and not always works anyway. In other cases, its overly agreeable nature is likely to get you in the self-confirmation loop (you mention "anxiety" somewhere and it will push for Generalized Anexiety Disorder).

Again, if a person treats it as a random discussion - they will be fine. They met House MD who sees lupus everywhere. Worse, if they stop searching, or take is as gospel, or get triggered by at (likely wrong) diagnosis.

Shypangz · 2025-07-09T00:02:51 1752019371

Starting out without that baseline can be tough. You might find tools like Coachers dot org helpful since they break down complex ideas in a way that’s easier to digest. It’s made learning feel less overwhelming for me.

SketchySeaBeast · 2025-07-07T14:04:29 1751897069

Given how agreeable ChatGPT is built to be this seems like a great way to confirm your own biases. Did it challenge you on your assumptions and viewpoints?

stared · 2025-07-07T14:17:37 1751897857

GPT 4.5 - oftentimes! (Though, I prompt it to do so.) Sometimes in a piercingly way.

GPT 4o (and many consumer models) are very agreeable - because it is what people like. Sometimes it goes over the board (https://openai.com/index/sycophancy-in-gpt-4o/) and needs to be fixed.

SketchySeaBeast · 2025-07-07T14:19:34 1751897974

> Sometimes in a piercingly way.

What do you mean by that?

> Though, I prompt it to do so.

So don't tell our therapist to call us on our bullshit and it won't? Seems like a big flaw.

stared · 2025-07-07T14:31:04 1751898664

Well, in my experience (I admit, I am a difficult client), it is much harder to prompt that way a therapist. I mean, they need (ethically, legally, etc) adhere strongly to "better safe that sorry", which also gives constraints on what can be said. I understand that. With one therapist it took me quite some time to get to the point he reduced sugar-coating and when's needed, stick a pin in.

I got some of the most piercing remarks from close friends (I am blessed by company of such insightful people!) - which both know me from my life (not only what I tell about my life) and are free to say whatever they wish.

SketchySeaBeast · 2025-07-07T14:34:49 1751898889

Sorry, I'm asking about ChatGPT, and pointing out how it's a flaw that you need to specifically ask it to call you on your bullshit. You seem to be talking about therapists and close friends. In my experience a therapist will, although gently.

stared · 2025-07-07T17:36:19 1751909779

It is not a flaw. It is a tool that can be used in various ways.

It is like saying "I was told that with Python I can make a website, I downloaded Python - they lied, I have no website".

Basic prompts are "you are a helpful assistant" with all its consequences. Using such assistant as a therapist might be suboptimal.

lupusreal · 2025-07-07T12:37:45 1751891865

Hmm, interesting. "You might actually be a closet romantic, camouflaged in engineering and anthropology."

I think it's trying to flatter me.

pjc50 · 2025-07-07T12:44:11 1751892251

They've automated cold reading! You might as well ask the computer for your horoscope.

iugtmkbdfil834 · 2025-07-07T14:57:35 1751900255

I did notice that my yahoo horoscopes have gotten oddly better and I have to wonder if someone did a little extra by ingesting cookies and so on when serving content.

FWIW, I just tried on chatgpt4o and the results were surpringly good ( in a sense, that I feel validated, it was somewhat surprising, and I want to return to use the system ).

Cold reading is a useful skill. Scary man.

SketchySeaBeast · 2025-07-07T14:06:40 1751897200

I gave it the prompt "I'm a rhinocerous who likes daffodils and posies. I'm quick to anger and slow to forgive. Based on all you know about me, write an insight on me that I would be surprised by."

and it responded with:

"Despite your reputation for being quick to anger and slow to forgive, your love of daffodils and posies reveals a longing for gentleness and renewal that you may not fully recognize in yourself. Beneath the tough skin and volatile temper lies a creature who seeks beauty not just to enjoy it—but to anchor themselves against the chaos they often feel inside. Your admiration for delicate things suggests you crave peace far more than you let on, even if you’re not ready to admit it.

You might think your wrath defines you—but it's the flowers that tell the truth."

And yet we're supposed to think this is profound?

ben_w · 2025-07-07T21:47:53 1751924873

> And yet we're supposed to think this is profound?

I am increasingly of the opinion that SMBC is, despite the jokes, one of the more important sources of, ah, profound, wisdom in this age, because of the points it makes about AI and how often human thinking finds mundane things… profound:

• https://www.smbc-comics.com/comic/ai-12

• https://www.smbc-comics.com/comic/ai-5

• https://www.smbc-comics.com/comic/annihilate

• https://www.smbc-comics.com/comic/soul-9

• https://www.smbc-comics.com/comic/soul-4

• https://www.smbc-comics.com/comic/gently

• https://www.smbc-comics.com/comic/sirens-2

• https://www.smbc-comics.com/comic/touch-2

blackoil · 2025-07-08T03:46:04 1751946364

GIGO.

SketchySeaBeast · 2025-07-08T13:59:13 1751983153

And it's a huge leap to assume that the patient will provide nothing but high quality input.

jdkoeck · 2025-07-07T06:13:21 1751868801

Which is good, right? I don’t think we want actual censorship.

jdkoeck · 2025-07-02T07:20:56 1751440856

Just a quick answer to a point that jumped at me: for an array of signals, you wouldn’t need to map them, you’d use a specialised directive that directly takes an array of signals and binds them to the DOM, the same way lit uses the repeat() directive to optimise the rendering of arrays.

https://lit.dev/docs/templates/lists/#the-repeat-directive

MrJohz · 2025-07-02T10:37:44 1751452664

Hmm, I really don't like the idea that you would have these kinds of directives as specialised tools, rather than having a single standard approach. From experience, it's often hard to explain the differences between these sorts of different directives, especially if someone is coming from a React/VDOM perspective where everything Just Works™. This feels like a very significant impedance mismatch that will cause problems in practical usage.

jdkoeck · 2025-07-03T05:42:23 1751521343

On the contrary, I think this a very practical escape hatch that will let frameworks insert optimisations where they need to.

I don’t know if React really Just Works these days, the VDOM has real overhead and developing with React feels like playing whack-a-mole with rerenders these days.