Hacker Newsnew | past | comments | ask | show | jobs | submitlogin
X Engineering Year Retrospective (twitter.com/xeng)
63 points by sirobg on Oct 27, 2023 | hide | past | favorite | 36 comments


I'm inclined to believe that they consolidated their code across a lot of their microservices and simplified their architecture since that was stressed a lot from the moment Musk acquired Twitter. But we also can't really verify or disprove it.

I'm a bit confused about their so-called improvements to video recommendation quality and bot detection. I've seen a lot of sentiment from people that they see more bots, hate speech, and irrelevant content on their timelines. Maybe what I'm hearing is just anecdotal evidence or stories in a bubble?

The Sacramento data center migration to Portland is an entertaining story detailed here[1]. Here's the Hacker News thread on it[2].

They have a GPU supercompute cluster?? It seems like they have the capability to do training and inference with state-of-the-art algorithms at massive scales then. Why have Twitter's recommendations and ad revenue (even before the acquisition) been so poor then?

[1] https://www.cnbc.com/2023/09/11/elon-musk-moved-twitter-serv...

[2] https://news.ycombinator.com/item?id=37470110


> I'm a bit confused about their so-called improvements to video recommendation quality and bot detection.

Let's not dismiss the possibility that they're just lying. I don't think they're trustworthy enough to be given the benefit of the doubt.


They're just running most of the bots in-house on the GPU supercompute cluster now, and billing Putin for it directly, now that the middleman Yevgeny Prigozhin is out of the picture.


> I've seen a lot of sentiment from people that they see more bots, hate speech, and irrelevant content on their timelines. Maybe what I'm hearing is just anecdotal evidence or stories in a bubble?

1. Note that they do not unambiguously define bots; it's "bots and content scrapers" lumped together:

> Blocked bots and content scrapers at a rate +37% greater than 2022. On average, we prevent more than 1M bots signup attacks each day and we’ve reduced DM spam by 95%.

2. I suspect it's a matter of signal-to-noise. Yes, the absolute bot count could be down, but how does it compare to the number of humans using X/Twitter and the volume of content which they are contributing each day? I've tried to find reliable statistics on this, but to no avail. My anecdotal experience is that fewer people I've followed are still regularly using X/Twitter since the Musk acquisition. Some are over at Post.News, some at Threads, some have shifted to Mastodon. It's a very fragmented experience now.


Sometimes I click a link to twitter, get log-in-walled or see the head of a tweet thread that's missing the rest of the thread. I wonder if they measure that as a successful "scrape bot blocked"?


> that they see more bots

My go-to here are the scams advertising support for metamask problems. Those still appear every day and are trivial to identify. I'm not sure I consider Twitter having a bot detection success until trivial issues like that are solved. They're still below the level of "text match and auto ban" solutions.


70000 hard coded references to the data centre in Sacramento...

At Twitter scale, 700 quick hacks would be excusable, 7000 iffy, 70000 just makes the old engineering team look like bozos.

Maybe it's embellished?


>They have a GPU supercompute cluster?? It seems like they have the capability to do training and inference with state-of-the-art algorithms at massive scales then. Why have Twitter's recommendations and ad revenue (even before the acquisition) been so poor then?

In general, assuming you are in some sort of decent state better ML doesn't make your revenue 50% better. It makes it 5% better per year.


> I've seen a lot of sentiment from people that they see more bots, hate speech, and irrelevant content on their timelines. Maybe what I'm hearing is just anecdotal evidence or stories in a bubble?

TBF, this is my year-over-year experience on all social media platforms.


love how those comments in [2] have aged haha


"Shutdown the Sacramento data center and re-provisioned the 5,200 racks and 148,000 servers, which generated more than $100M in annual savings."

Yeah, about that.. https://www.cnbc.com/2023/09/11/elon-musk-moved-twitter-serv...

"Musk turned to his security guard and asked to borrow his pocket knife. Using it, he was able to lift one of the air vents in the floor, which allowed him to pry open the floor panels. He then crawled under the server floor himself, used the knife to jimmy open an electrical cabinet, pulled the server plugs, and waited to see what happened. Nothing exploded. The server was ready to be moved."


This remains a fascinating, Howard Roark-like tale. I mean, it worked.

I don't use Twixter at all. But it seems like he got rid of most of the engineering org without getting rid of most of the technical aspects of Twitter. It's not super stable, but it's not a smoldering wreck. If he hadn't scared off the advertisers with his personality and social behaviors, it might be financially in a much better place. The human side of content moderation was definitely going to cost more, I am steering away from the content impact on Twixter financials.


Roark dynamited his own creation because he had no other recourse and was prepared to face the consequences. Musk dynamited something he bought out of foolhardiness and is expending his energy avoiding consequences. Similar in some regard, but different.


> it worked

Yes, and it would probably work much of the time. The times when it doesn't work and you have to write off equipment worth more than the cost of a professional move, which shifts liability as well, is why more risk-averse companies don't do it.


lol, sure. Dude is so capable, he is the RL MacGyver. Also, people still think he's Iron Man - even though Iron Man was semi-rational.


> Among the changes we made was a shift of all media/blob artifacts out of the cloud, which reduced our overall cloud data storage size by 60%, and separately, we succeeded in reducing cloud data processing costs by 75%.

Huh. I feel like that's the one place to not leave cloud. 90% of why I want to use AWS is S3.


Cloud storage is expensive, I’m willing to bet that’s a lot of money saved at their scale.


Maintaining a massive scale infra with the same availability and durability as S3 is really expensive. It's possible but hard.


But do you need S3 durability for meme gifs that will be seen once and then just waste disk space for decades?


At least temporarily old images weren't available and some people cared: https://www.forbes.com/sites/mattnovak/2023/08/19/twitter-de...


There's a difference between "all images" and "one in 100 million images"

S3 durability is "for every ten million objects stored, you can expect to incur an average loss of a single object once every 10,000 years". They could drop the durability by probably 5 9's and nobody would even notice.


Maybe not, but you could switch to single-zone S3 in that case and cut your costs down ~70%.

Migrating storage off of cloud is just really hard.


That's why Musk is forward thinking, leading the charge for us techies to follow.


Lol, 'forward thinking' by moving one of the most expensive cloud costs internal in an app that clearly doesn't need that storage near their other cloud assets and operates at a scale few other services do? Woooow, he's sooooo smart. Nobody else has thought of this


I hope their Trust & Security team has a roadmap that prioritizes topic and thread-jacking. The advent of GPT and their recent changes to creator compensation has created a massive drive for engagement by any means; the results are fairly predictable. Clicking on any trending topic usually surfaces completely unrelated videos with that topic's hashtags (and all of the other trending topic hashtags) appended to the tweet.

Then you have the "content creators" who use GPT to summarize or add details to a post from a larger content aggregator in hopes of bandwagoning engagement. I see a lot of this type of behavior from popular History-focused accounts and the mega-accounts that post engagement bait content (think canned, "desert island"-type questions and polls). It's less malicious but certainly reinforces cynicism towards the state of Twitter/X and the broader social web.


I've seen this in other places I've worked. Too many people and too much money to spend can be a lot worse than too few and not enough.


Yes, twitter is doing so well currently as a business


It's garbage, yes, but all these companies overhired for years and it was a no-brainer that they all needed to fix that. Hard to criticize that decision in principle. Though the execution of it seemed like a clown show.


If your main focus is growth or potential growth then inefficiencies are fine if they allow increased product velocity. A unified platform for N features is more efficient but communication overheads will cut velocity on improvements for all those N features. Of course if growth is no longer a focus that’s a different story.


Yea, I don’t know anything about cars but I’m pretty sure all these mechanics could work naked. Praise me for saving uniform cost.


"Shrank site traffic by 80%, yielding cost savings and cratering the company as a whole"

I hope everyone there is getting paid in advance for their work...


I’m curious if you have a source for that? There has been lots of speculation on how much X traffic has reduced, but the best data I can find basically says that it might’ve dropped from the 2nd most visited website to the 5th most visited website in the world (according to SimilarWeb).


The numbers I see are declines in traffic of 15-20% with ad revenue declines of more than 50%.

https://variety.com/2023/digital/news/musk-twitter-x-acquisi...

I think for many sites this would be pretty horrible. I feel like the bar for X is that it still runs.


They did this by shipping so many bugs (that I’ve seen) that the feeds only load 20% of the time.

Saves a ton of bandwidth i bet.



That project must be so much fun to work on. I'd pay just to shadow it.




Consider applying for YC's Summer 2026 batch! Applications are open till May 4

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: