More

spenczar5 · 2026-03-14T03:56:04 1773460564

Sure, llms.txt is a convention for this.

Compare https://docs.firetiger.com with https://docs.firetiger.com/llms.txt and https://docs.firetiger.com/llms-full.txt for a realy example.

ghiculescu · 2026-03-14T03:58:20 1773460700

Why does the article say that’s useless?

zeeg · 2026-03-14T04:50:44 1773463844

It’s not useful if it’s never read by agents - that’s the premise of the statement.

bensyverson · 2026-03-14T11:42:57 1773488577

But will agents know to send a "Accept: text/markdown" header?

zeeg · 2026-03-15T06:55:49 1773557749

Did you consider looking to see what they actually already do? There's a reason this works.

spenczar5 · 2026-03-12T17:34:46 1773336886

That's a pretty interesting idea! I guess 160+ is sort of doing some of that for us - it compiles to SQL WHERE clauses, right - but generally, we found good results giving it a SQL dialect directly.

I think some of the reason is that there's so much coverage of writing SQL in its training set.

hwernetti · 2026-03-13T15:07:10 1773414430

Good point, that makes a lot of sense to use a tool that has plenty of sample usage data available.

spenczar5 · 2026-03-12T15:52:13 1773330733

Yes! This works really well from Sonnet 4.5 onwards, in our experience. Sonnet 4.0 was a little rocky - we had to give it tons of documentation - but by now it works without much effort.

One thing that works very well is just giving it one or two example valid programs/statements in the custom language. It usually picks up what you're getting at very quickly.

When it slips up, you get good signal you can capture for improving the language. If you're doing things in a standard agent-y loop, a good error message also helps it course-correct.

mmmehulll · 2026-03-12T16:02:37 1773331357

That’s really interesting. The “one or two examples + good error messages” part feels especially important. It suggests the limiting factor may be less finetuning and more whether the model is given a tight representation and a feedback loop it can recover from.

spenczar5 · 2026-03-12T15:33:02 1773329582

Author here! I am pretty jazzed about these ideas and happy to dig into more detail than a blog post allows.

spenczar5 · 2026-03-10T18:43:17 1773168197

Is this a clone of the Google AIPs? Like https://aep.dev/160/ seems to just copy https://google.aip.dev/160.

rambleraptor · 2026-03-10T20:35:53 1773174953

The AEPs were originally based off Google AIPs, but we did a hard fork and have altered a lot since then. For one thing, the AIPs were entirely protobuf focused, while we're focusing equally on protobuf + OpenAPI.

The CRUD methods are great examples where we deviate from the AIPs.

spenczar5 · 2026-02-25T18:29:03 1772044143

"Cheap" how? I have a friend who works on Seattle's bus planning. Removing a stop is a _lot_ of political work. When an elderly person depends on that bus stop being within a block so they can get to their doctor, and you're proposing to move it six blocks further away, that's essentially a _political_ cost.

It might better in the system throughput, and those benefits may even outweigh the misery put on that one person. But in the US, we largely sort that out by using cool-down times, hearings, and "community input."

Net result, according to my friend at least, is that bus stops feel _very_ sticky and hard to change.

nickorlow · 2026-02-25T18:32:17 1772044337

I think the article means 'cheap' as in it doesn't really require any new/expensive infrastructure and could theoretically be done overnight.

Though, as you mention it's a big political ask (which is unfortunate).

spenczar5 · 2026-02-10T21:04:26 1770757466

Its unexported for that reason. You only change it in tests.

spenczar5 · 2026-02-03T03:31:10 1770089470

It's in the article that you're commenting on, https://www.spacex.com/updates#xai-joins-spacex.

titzer · 2026-02-03T05:21:30 1770096090

Oh, ffs.

spikels · 2026-02-03T05:49:12 1770097752

Haha. It's less than 1,000 words that would take less than 5 minutes to read.

I bet much less than half of the hundreds of HN commenters here bother to read it. Many are clearly unfamiliar with its content.

titzer · 2026-02-03T05:58:39 1770098319

I can't, I don't want it in my head :/

spenczar5 · 2026-01-02T18:22:12 1767378132

I feel like I see an independent low-noise phone project like, every 3 months. Clearly there is some latent demand here. I wonder why the big players (Google, Apple, Samsung, HTC) haven't made a big-corp product for this market.

I am always reluctant to jump on with these independent ambitious projects. The first version is understandably rough, and the company seems to fold before they get to a second or third version.

But maybe advances in manufacturing in China are making high-quality, small-batch products like this more tractable?

jrmg · 2026-01-02T18:26:33 1767378393

I feel like I see an independent low-noise phone project like, every 3 months. Clearly there is some latent demand here.

I don’t know - it feels to me that this is evidence that there _isn’t_ sufficient demand to sustain a successful product like this.

altairprime · 2026-01-02T19:58:09 1767383889

Same reason Acura stopped making small cars like the Integra/RSX: costs scale more slowly than revenue as car size increases, so selling to the small car market segment results in unearned potential profits — even if the small car segment is a majority, it’s better to make a higher profit per unit on fewer unit sales if your most primary goal is to min/max labor/profit.

(Small phones, unlike small cars, also have costs in UI development to maintain their form factor’s OS support, which can create an additional pressure to withhold devices for a viable and profitable market.)

cptskippy · 2026-01-02T18:23:57 1767378237

> I wonder why the big players (Google, Apple, Samsung, HTC) haven't made a big-corp product for this market.

Because it impacts ARPU. It's really not that difficult, you're the product being sold.

rchaud · 2026-01-02T21:34:46 1767389686

Big corps were the ones to move away from Blackbery en masse towards a BYOD system. Before that, Samsung and Nokia both had a series of keyboard phones running Windows Mobile 6 or SymbianOS. I had the Samsung Blackjack II in 2008.

mystifyingpoi · 2026-01-02T19:26:47 1767382007

> Clearly there is some latent demand here

No, there demand is negligible. It's just typical hacker news people who want to suddenly become productive Silicon Valley trope hustle style, or people who want to change their damaging habits in a day, so instead of uninstalling TikTok which takes 15 seconds to do, they will spend money a separate device.

Although the keyboard may be useful.

spenczar5 · 2025-11-07T16:54:08 1762534448

"But accepting the full S3Client here ties UploadReport to an interface that’s too broad. A fake must implement all the methods just to satisfy it."

This isn't really true. Your mock inplementation can embed the interface, but only implement the one required method. Calling the unimplemented methods will panic, but that's not unreasonable for mocks.

That is:

    type mockS3 struct {
        S3Client
    }

    func (m mockS3) PutObject(...) {
        ...
    }

You don't have to implement all the other methods.

Defining a zillion interfaces, all the permutations of methods in use, makes it hard to cone up with good names, and thus hard to read.

skybrian · 2025-11-07T19:45:18 1762544718

While you can do that, having unused methods that don't work is a footgun. It's cleaner if they don't exist at all.

lenkite · 2025-11-08T07:29:21 1762586961

Not to mention, introducing all the permutations of methods as separate interfaces on the "consumer side" means extreme combinatorial explosion of interfaces. It is far better to judge the most common patterns and make single-method interfaces for these on the provider side.

Lots of such frequently-quoted Go "principles" are invalid and are regularly broken within the standard library and many popular Go projects. And if you point them out, you will be snootily advised by the Go gurus on /r/golang or even here on HN that every principle has exceptions. (Even if there are tens of thousands of such exceptions).

the_gipsy · 2025-11-07T17:22:27 1762536147

Is this pattern commonly used? Any drawbacks?

Sounds much better than the interface boilerplate if it's just for the sake of testing.

jgdxno · 2025-11-07T18:18:55 1762539535

At work we use it heavily. You don't really see "a zillion interfaces" after a while, only set of dependencies of a package which is easy to read, and easy to understand.

"makes it hard to cone up with good names" is not really a problem, if you have a `CreateRequest` method you name the interface `RequestCreator`. If you have a request CRUD interface, it's probably a `RequestRepository`.

The benefits outweigh the drawbacks 10 to one. The most rewarding thing about this pattern is how easy it is to split up large implementations, and _keep_ them small.

durbatuluk · 2025-11-07T19:37:38 1762544258

Any method you forget to overwrite from the embed struct gives a false "impression" you can call any method from mockS3. Most of time code inside test will be:

    // embedded S3Client not properly initialized
    mock := mockS3{}
    // somewhere inside the business logic
    s3.UploadReport(...) // surprise

Go is flexible, you can define a complete interface at producer and consumers still can use their own interface only with required methods if they want.