More

mckirk · 2026-03-05T23:07:36 1772752056

I think what that research found is that _auto-generated_ agent instructions made results slightly worse, but human-written ones made them slightly better, presumably because anything the model could auto-generate, it could also find out in-context.

But especially for conventions that would be difficult to pick up on in-context, these instruction files absolutely make sense. (Though it might be worth it to split them into multiple sub-files the model only reads when it needs that specific workflow.)

mckirk · 2026-03-04T10:57:57 1772621877

This is the first time I've heard the term 'apecode', and I will make sure to use it at every opportunity.

qalmakka · 2026-03-04T11:15:08 1772622908

It is not mine unfortunately, it comes from a very funny write-up by rsaksida: https://rsaksida.com/blog/ape-coding/

mckirk · 2026-02-18T10:09:58 1771409398

This is only done at the DNS level, so using a different DNS (such as Quad9) solves that issue. For background info, I can recommend [1, 2].

[1]: https://www.youtube.com/watch?v=Uxmu25mUZgg [2]: https://cuiiliste.de/

sltkr · 2026-02-18T22:45:22 1771454722

I never understood why Quad9, which is based in Switzerland, can get away with not applying the Swiss censorship to their DNS servers.

throawayonthe · 2026-02-18T10:22:25 1771410145

how can this be done at the dns level? shouldn't ssl certificates prevent third party content from being shown in the browser?

zygentoma · 2026-02-18T10:26:21 1771410381

Well, you get the warning, but as long as HSTS is not active, you can still click on "Accept the risk and continue" …

[EDIT:] Just checked a bit closer, they are using an LetsEncrypt cert for "cuii.telefonica.de", which is obviously the wrong domain, but as I said above, as long as HSTS is not active for "annas-archive.li", you can still bypass via the button.

sceptic123 · 2026-02-18T12:15:08 1771416908

My ISP currently makes them not resolve (with scary sounding domains):

  ; <<>> DiG 9.10.6 <<>> @192.168.1.254 annas-archive.li
  ; (1 server found)
  ;; global options: +cmd
  ;; Got answer:
  ;; ->>HEADER<<- opcode: QUERY, status: NOERROR, id: 18716
  ;; flags: qr rd ra; QUERY: 1, ANSWER: 3, AUTHORITY: 0, ADDITIONAL: 1

  ;; OPT PSEUDOSECTION:
  ; EDNS: version: 0, flags:; udp: 4096
  ;; QUESTION SECTION:
  ;annas-archive.li.  IN A

  ;; ANSWER SECTION:
  annas-archive.li. 845 IN CNAME www.ukispcourtorders.co.uk.
  www.ukispcourtorders.co.uk. 511 IN CNAME ukispblk.vo.llnwd.net.
  ukispblk.vo.llnwd.net. 845 IN CNAME ukispblk.vo.llnwd.net.edgesuite.net.

  ;; Query time: 3 msec
  ;; SERVER: 192.168.1.254#53(192.168.1.254)
  ;; WHEN: Wed Feb 18 12:06:25 GMT 2026
  ;; MSG SIZE  rcvd: 169

gzread · 2026-02-18T12:38:59 1771418339

It does. The browser won't load the content because it detects your connection was tampered with.

dizhn · 2026-02-18T12:35:28 1771418128

They redirect to a different url.

tmalsburg2 · 2026-02-18T13:02:00 1771419720

If the censoring is at the DNS level, can the admin please replace the domain name in the url with the ip address to which it should resolve? Thank you.

niij · 2026-02-18T15:15:03 1771427703

Your country's broken internet is your problem. If you are having DNS queries censored then change your DNS resolver on your client side. If you still get intercepted look into DoH.

mckirk · 2026-01-30T20:35:09 1769805309

That's the 'built by the lowest bidder' feature. Probably pretty realistic in a lot of places.

netsharc · 2026-01-30T21:29:55 1769808595

Huh, I wonder if they trained it by feeding it architectural renders and "what actually got built" photos...

simsla · 2026-01-30T22:20:05 1769811605

It's probably just prompt based. Actual fine-tuning for these kind of use cases is getting less common than it used to be.

IAmGraydon · 2026-01-31T14:17:07 1769869027

Likely. You can go into Nano Banana or ChatGPT right now, upload a pretty architectural rendering, and tell it to make it look old, weathered, winter, etc and it will come out looking very similar. Give it an example to really dial it in.

mckirk · 2026-01-19T16:03:46 1768838626

Note that this is the Flash variant, which is only 31B parameters in total.

And yet, in terms of coding performance (at least as measured by SWE-Bench Verified), it seems to be roughly on par with o3/GPT-5 mini, which would be pretty impressive if it translated to real-world usage, for something you can realistically run at home.

mckirk · 2025-12-27T00:14:28 1766794468

For me, these books are in the rare category of 'wait I didn't know it was allowed to come up with a story _this good_'. I envy all those that have yet to read it for the first time.

mckirk · 2025-12-15T12:29:21 1765801761

I'm not sure whether that was intended, but 'operating at scale' actually made me laugh out loud :D

benterix · 2025-12-15T17:25:49 1765819549

I have to admit an unintended chuckle, too.

mckirk · 2025-12-10T17:47:50 1765388870

"Do you or a loved one suffer from an abundance of brain cells? Speak to your doctor today about whether The Jersey Shore might be right for you!"

mckirk · 2025-12-05T09:06:31 1764925591

"A cable!"

"How do you know?"

"I'm holding it!"

mckirk · 2025-11-27T15:13:22 1764256402

Unfortunately proving anything about a concrete imperative implementation is orders of magnitude more complex than working with an abstraction, because you have to deal with pesky 'reality' and truly take care of every possible edge-case, so it only makes sense for the most critical applications. And sometimes there just isn't a framework to do even that, depending on your use case, and you'd have to sit down a PhD student for a while to build it. And even then you're still working with an abstraction of some kind, since you have to assume some kind of CPU architecture etc.

It really is more difficult to work with 'concrete implementations' to a degree that's fairly unintuitive if you haven't seen it first-hand.

dietr1ch · 2025-11-27T16:07:50 1764259670

I can't fathom how crazy it gets to model once you try to consider compilers, architectures, timings, temperatures, bit-flips & ECCs, cache misses, pseudo and "truly" random devices, threads, other processes, system load, I/O errors, networking.

To me it seems mandatory to work with some abstraction underneath that allows factoring a lot of different cases into a smaller set of possibilities that needs to be analysed.

It's also how we manage to think in a world where tiny little details do give you a likely insignificantly different world-state to think about.