More

cheeze · 2026-02-24T03:36:24 1771904184

I'm curious why? I do this all the time. Saves me time and lets me pull information quickly.

I'm not running it in a container that has access to my local filesystem or anything...

bandrami · 2026-02-24T04:54:11 1771908851

If it has no access to your filesystem or network services that's better, but you're still giving input from an unknown party to an interpreter, with the extra bonus of that interpreter being non-deterministic by design.

But then again people today will also pipe curl to bash, so I may have lost this battle a while ago...

antisol · 2026-02-24T07:07:56 1771916876

  > But then again people today will also pipe curl to bash

OMG! I'm not alone! Thank you!

munchler · 2026-02-24T05:16:06 1771910166

> "Hey Claude, summarize, this document I downloaded from the Internet"

I think you've created confusion with this example due to its ambiguity. Let's be clear about the difference between a chatbot and an agent: Asking a chatbot (e.g. vanilla Claude) to summarize an unknown document is not risky, since all it can do is generate text. Asking an agent (e.g. Claude Code) to summarize an unknown document could indeed be risky for the reason you state.

esseph · 2026-02-24T06:10:40 1771913440

> Asking a chatbot (e.g. vanilla Claude) to summarize an unknown document is not risky, since all it can do is generate text.

Prompt injection in the document itself is a risk to the LLM/You.

astrange · 2026-02-24T05:23:12 1771910592

Claude has tools and might be connected to your Gmail etc. Usually sandboxed.

esafak · 2026-02-24T14:07:44 1771942064

You would not summarize it deterministically yourself.

cheeze · 2026-02-08T03:33:49 1770521629

> I’d agree for a home computer Linux or macOS are the only sane choices now.

Unless you care about gaming at all. Sure you have the Linux evangelists who talk about how much better support has gotten (it has!) but there are still huge glaring holes.

I run MacOS for everything except gaming. I'm not even that big of a gamer but it's the only sane option there.

sophrosyne42 · 2026-02-08T04:14:27 1770524067

For quite a few years, it has gone from "unless you care about gaming at all" to "unless you care about an extremely specific type of game". You don't have to be an evangelist to see the value linux has for gaming now.

subscribed · 2026-02-08T13:16:57 1770556617

"at all"? No. If you don't have nvidia card or don't play online games with borderline malware-behaving anti cheats, Bazzite solves it all.

For Nvidia it's cachyos right now, apparently.

More to come.

cheeze · 2026-02-06T20:13:25 1770408805

> There would also be a requirement for all playback to actually properly check the private keys

I don't think that's true. Only for someone who wanted to prove authenticity to grab the signature. No private keys would be exposed (except those which were hacked.)

If Netflix and Amazon can't keep their 4k HDR webrips from being leaked (supposedly via extracted licenses from Nvidia Shields), I have no idea how we'd expect all camera manufacturers to do it. Maybe iPhones and flagship Apple devices, but even then we'd find vulns in older devices over time.

munk-a · 2026-02-06T20:34:17 1770410057

I was thinking more about the spread of disinformation at large - but yea, that playback requirement would only be necessary for anything that wanted to be considered a potential source and trying to protect against disinformation platforms is a much larger problem then technology can solve on its own.

cheeze · 2026-02-06T20:11:47 1770408707

How do you find the camera?

"I extracted and added the noise profile to the AI generated video with a goPro to make it look legitimate"

cheeze · 2026-02-06T20:10:44 1770408644

I don't think it was AI either but I don't think that would hold up in court.

cheeze · 2026-02-05T20:45:22 1770324322

It's that Austin Powers clip of the guy slowly getting smooshed by the steam roller.

cheeze · 2026-01-28T07:25:28 1769585128

Still kind of pointless though. Someone has to check ID and im WA state, open the beers for you.

Kind of defeated the purpose of just walk out. Since I couldn't... Just walk out.

cheeze · 2026-01-27T01:12:13 1769476333

FAANG here (service oriented arch, distributed systems) and id say probably 20+ percent of code written on my team is by an LLM. it's great for frontends, works well with test generation, or following an existing paradigm.

I think a lot of people wrote it off initially as it was low quality. But gemini 3 pro or sonnet 4.5 saves me a ton of time at work these days.

Perfect? Absolutely not. Good enough for tons of run of the mill boilerplate tasks? Without question.

zx8080 · 2026-01-27T01:37:51 1769477871

> probably 20+ percent of code written on my team is by an LLM. it's great for frontends

Frontend has always been shitshow since JS dynamic web UIs invented. With it and CSS no one cares what runs page and how many Mb it takes to show one button.

But regarding the backend, the vibecoding still rare, and we are still lucky it is like that, and there was no train crush because of it. Yet.

llbbdd · 2026-01-27T03:49:36 1769485776

Backend has always been easier than frontend. AI has made backend absolutely trivial, the code only has to work on one type of machine in one environment. If you think it's rare or will remain rare you're just not being exposed to it, because it's on the backend.

bopbopbop7 · 2026-01-27T03:58:39 1769486319

Might be a surprise to you, but some backends are more than just a Nextjs endpoint that calls a database.

llbbdd · 2026-01-27T08:24:58 1769502298

No surprise at all and I'd challenge you to find any backend task that LLMs don't improve working on as much they do frontend. And ignoring that the parent comment here is just ignorant since they're talking about the web like it's still 2002. I've worked professionally at every possible layer here and unless you are literally at the leading edge, SOTA, laying track as you go, backend is dramatically easier than anything that has to run in front of users. You can tolerate latency, delays and failures on the backend that real users will riot about if it happens in front of them. The frontend performance envelope starts where the backend leaves off. It does not matter in the slightest how fast your cluster of beefy identical colocated machines does anything at all if it takes more than 100ms to do anything that the user directly cares about, on their shitty browser on a shitty machine on tethered to their phone in the mountains, and the difference is trivially measurable by people who don't work in our field, so the bar is higher.

ivantop · 2026-01-27T05:28:35 1769491715

Honestly, I am also at a faang working on a tier 0 distributed system in infra and the amount of AI generated code that is shipped on this service is probably like 40%+ at this point.

llbbdd · 2026-01-27T08:26:13 1769502373

I'm not surprised at all here, last time I worked in a FAANG there was an enormous amount of boilerplate (e.g. Spring), and it almost makes me weep for lost time to think how easy some of that would be now.

ivantop · 2026-01-29T22:57:59 1769727479

It’s not just boilerplate. This is a low level C++ service where latency and performance is critical (don’t want to get into too much detail since I’ll dox myself). I used to think the same thing as you: “Surely my job is safe because this system is very complex”. I used to think this would just replace front end engineers who write boilerplate react code. 95% of our codebase is not boilerplate. AI has found optimizations in how we store items, AI has alerted us to production issues (with some degree of accuracy, of course). I worry that traditional software engineering as we know it will disappear and these hybrid AI jobs will be what’s left.

halfcat · 2026-01-27T02:19:44 1769480384

I think you’re onto something. Frontend tends to not actually solve problems, rather it’s mostly hiding and showing parts of a page. Sometimes frontend makes something possible that wasn’t possible before, and sometimes the frontend is the product, but usually the frontend is an optimization that makes something more efficient, and the problem is being solved on the backend.

It’s been interesting to observe when people rave about AI or want to show you the thing they built, to stop and notice what’s at stake. I’m finding more and more, the more manic someone comes across about AI, the lower the stakes of whatever they made.

llbbdd · 2026-01-27T03:47:51 1769485671

Spoken like someone deeply unfamiliar with the problem domain since like 2005, sorry. It's an entirely different class of problems on the front end, most of them dealing with making users happy and comfortable, which is much more challenging than any of the rote byte pushing happening on the backend nowadays.

Applejinx · 2026-01-27T13:33:06 1769520786

Is it, though? That sounds very subjective, and from what I can tell 'enshittification' is a popular user term for the result, so I'm not sure it's going that great.

llbbdd · 2026-01-27T22:52:18 1769554338

If you search Google Trends for enshittification, half the results contain Doctorow as well [0]. Normal people have no idea who that is. And that's just Google, which everyone on HN hates to the point of vibrating angrily because there isn't an obvious part of the name to replace derogatorily with a dollar sign. Nobody uses this term outside of Hacker News, and even on HN it's code for "this site doesn't work when I disable Javascript", which is not a real requirement real customers have.

User experience does involve a lot of subjectivity [1] and that's part of what makes it hard. You have to satisfy the computer and the person in front of it, and their wants are often at odds with each other. You have to make them both happy at 60 FPS minimum.

[0] https://trends.google.com/explore?q=enshittification&date=al...

[1] https://emsh.cat/good-taste/

8organicbits · 2026-01-27T01:30:09 1769477409

As someone currently outside FAANG, can you point to where that added productivity is going? Is any of it customer visible?

Looking at the quality crisis at Microsoft, between GitHub reliability and broken Windows updates, I fear LLMs are hurting them.

I totally see how LLMs make you feel more productive, but I don't think I'm seeing end customer visible benefits.

mediaman · 2026-01-27T01:36:25 1769477785

I think much of the rot in FAANG is more organizational than about LLMs. They got a lot bigger, headcount-wise, in 2020-2023.

Ultimately I doubt LLMs have much of an impact on code quality either way compared to the increased coordination costs, increased politics, and the increase of new commercial objectives (generating ads and services revenue in new places). None of those things are good for product quality.

That also probably means that LLMs aren't going to make this better, if the problem is organizational and commercial in the first place.

asadotzler · 2026-01-27T18:58:38 1769540318

Does great for front ends mean considerate A11Y? In the projects I've looked over, that's almost never the case and the A11Y implementation is hardly worthy of being called prototype, much less production. Mock up seems to be the best label. I'll bet you think because the surface looks right that runs down to the roots so you call it good at front ends. This is the problem with LLMs, they do not do the hard work and they teach people that the hard work they cannot do is fine left undone or partially done and the more people "program" like this the worse the situation gets for real human beings trying to live in a world dominated by software.

simonw · 2026-01-27T19:58:22 1769543902

It turns out if you tell a coding agent "make it accessible" you'll get better results than you would from most professional front-end developers.

I'm not satisfied yet: I want coding agents to be able to actively test on screen readers as part of their iteration loop.

I've not found a system that can do that well yet out of the box, but GuidePup is very promising: https://github.com/guidepup/guidepup

cheeze · 2026-01-12T23:05:14 1768259114

Does this... help with chess?

Nition · 2026-01-12T23:59:45 1768262385

I think that might have been the joke.

fragmede · 2026-01-12T23:45:12 1768261512

you can see the possible moves they're thinking of making

DaanDL · 2026-01-13T13:07:37 1768309657

You can look into their minds? Don't think drawing arrows is a thing on the air line chess apps.

4d4m · 2026-01-13T01:24:29 1768267469

Super fun

cheeze · 2026-01-09T18:18:57 1767982737

Can you share more on this?