X Engineering Year Retrospective

swimwiththebeat · on Oct 27, 2023

I'm inclined to believe that they consolidated their code across a lot of their microservices and simplified their architecture since that was stressed a lot from the moment Musk acquired Twitter. But we also can't really verify or disprove it.

I'm a bit confused about their so-called improvements to video recommendation quality and bot detection. I've seen a lot of sentiment from people that they see more bots, hate speech, and irrelevant content on their timelines. Maybe what I'm hearing is just anecdotal evidence or stories in a bubble?

The Sacramento data center migration to Portland is an entertaining story detailed here[1]. Here's the Hacker News thread on it[2].

They have a GPU supercompute cluster?? It seems like they have the capability to do training and inference with state-of-the-art algorithms at massive scales then. Why have Twitter's recommendations and ad revenue (even before the acquisition) been so poor then?

[1] https://www.cnbc.com/2023/09/11/elon-musk-moved-twitter-serv...

[2] https://news.ycombinator.com/item?id=37470110

JohnFen · on Oct 27, 2023

> I'm a bit confused about their so-called improvements to video recommendation quality and bot detection.

Let's not dismiss the possibility that they're just lying. I don't think they're trustworthy enough to be given the benefit of the doubt.

DonHopkins · on Oct 28, 2023

They're just running most of the bots in-house on the GPU supercompute cluster now, and billing Putin for it directly, now that the middleman Yevgeny Prigozhin is out of the picture.

murphyslab · on Oct 28, 2023

> I've seen a lot of sentiment from people that they see more bots, hate speech, and irrelevant content on their timelines. Maybe what I'm hearing is just anecdotal evidence or stories in a bubble?

1. Note that they do not unambiguously define bots; it's "bots and content scrapers" lumped together:

> Blocked bots and content scrapers at a rate +37% greater than 2022. On average, we prevent more than 1M bots signup attacks each day and we’ve reduced DM spam by 95%.

2. I suspect it's a matter of signal-to-noise. Yes, the absolute bot count could be down, but how does it compare to the number of humans using X/Twitter and the volume of content which they are contributing each day? I've tried to find reliable statistics on this, but to no avail. My anecdotal experience is that fewer people I've followed are still regularly using X/Twitter since the Musk acquisition. Some are over at Post.News, some at Threads, some have shifted to Mastodon. It's a very fragmented experience now.

antifa · on Oct 28, 2023

Sometimes I click a link to twitter, get log-in-walled or see the head of a tweet thread that's missing the rest of the thread. I wonder if they measure that as a successful "scrape bot blocked"?

viraptor · on Oct 28, 2023

> that they see more bots

My go-to here are the scams advertising support for metamask problems. Those still appear every day and are trivial to identify. I'm not sure I consider Twitter having a bot detection success until trivial issues like that are solved. They're still below the level of "text match and auto ban" solutions.

MichaelZuo · on Oct 28, 2023

70000 hard coded references to the data centre in Sacramento...

At Twitter scale, 700 quick hacks would be excusable, 7000 iffy, 70000 just makes the old engineering team look like bozos.

Maybe it's embellished?

marcinzm · on Oct 27, 2023

>They have a GPU supercompute cluster?? It seems like they have the capability to do training and inference with state-of-the-art algorithms at massive scales then. Why have Twitter's recommendations and ad revenue (even before the acquisition) been so poor then?

In general, assuming you are in some sort of decent state better ML doesn't make your revenue 50% better. It makes it 5% better per year.

antifa · on Oct 28, 2023

> I've seen a lot of sentiment from people that they see more bots, hate speech, and irrelevant content on their timelines. Maybe what I'm hearing is just anecdotal evidence or stories in a bubble?

TBF, this is my year-over-year experience on all social media platforms.

abledon · on Oct 28, 2023

love how those comments in [2] have aged haha

cratermoon · on Oct 27, 2023

"Shutdown the Sacramento data center and re-provisioned the 5,200 racks and 148,000 servers, which generated more than $100M in annual savings."

Yeah, about that.. https://www.cnbc.com/2023/09/11/elon-musk-moved-twitter-serv...

"Musk turned to his security guard and asked to borrow his pocket knife. Using it, he was able to lift one of the air vents in the floor, which allowed him to pry open the floor panels. He then crawled under the server floor himself, used the knife to jimmy open an electrical cabinet, pulled the server plugs, and waited to see what happened. Nothing exploded. The server was ready to be moved."

quantified · on Oct 27, 2023

This remains a fascinating, Howard Roark-like tale. I mean, it worked.

I don't use Twixter at all. But it seems like he got rid of most of the engineering org without getting rid of most of the technical aspects of Twitter. It's not super stable, but it's not a smoldering wreck. If he hadn't scared off the advertisers with his personality and social behaviors, it might be financially in a much better place. The human side of content moderation was definitely going to cost more, I am steering away from the content impact on Twixter financials.

lebean · on Oct 28, 2023

Roark dynamited his own creation because he had no other recourse and was prepared to face the consequences. Musk dynamited something he bought out of foolhardiness and is expending his energy avoiding consequences. Similar in some regard, but different.

cratermoon · on Oct 28, 2023

> it worked

Yes, and it would probably work much of the time. The times when it doesn't work and you have to write off equipment worth more than the cost of a professional move, which shifts liability as well, is why more risk-averse companies don't do it.

jprd · on Oct 27, 2023

lol, sure. Dude is so capable, he is the RL MacGyver. Also, people still think he's Iron Man - even though Iron Man was semi-rational.

insanitybit · on Oct 27, 2023

> Among the changes we made was a shift of all media/blob artifacts out of the cloud, which reduced our overall cloud data storage size by 60%, and separately, we succeeded in reducing cloud data processing costs by 75%.

Huh. I feel like that's the one place to not leave cloud. 90% of why I want to use AWS is S3.

smaps · on Oct 27, 2023

Cloud storage is expensive, I’m willing to bet that’s a lot of money saved at their scale.

insanitybit · on Oct 28, 2023

Maintaining a massive scale infra with the same availability and durability as S3 is really expensive. It's possible but hard.

kalleboo · on Oct 28, 2023

But do you need S3 durability for meme gifs that will be seen once and then just waste disk space for decades?

johannes1234321 · on Oct 28, 2023

At least temporarily old images weren't available and some people cared: https://www.forbes.com/sites/mattnovak/2023/08/19/twitter-de...

kalleboo · on Oct 28, 2023

There's a difference between "all images" and "one in 100 million images"

S3 durability is "for every ten million objects stored, you can expect to incur an average loss of a single object once every 10,000 years". They could drop the durability by probably 5 9's and nobody would even notice.

insanitybit · on Oct 28, 2023

Maybe not, but you could switch to single-zone S3 in that case and cut your costs down ~70%.

Migrating storage off of cloud is just really hard.

Freedom2 · on Oct 27, 2023

That's why Musk is forward thinking, leading the charge for us techies to follow.

monkaiju · on Oct 27, 2023

Lol, 'forward thinking' by moving one of the most expensive cloud costs internal in an app that clearly doesn't need that storage near their other cloud assets and operates at a scale few other services do? Woooow, he's sooooo smart. Nobody else has thought of this

csteubs · on Oct 27, 2023

I hope their Trust & Security team has a roadmap that prioritizes topic and thread-jacking. The advent of GPT and their recent changes to creator compensation has created a massive drive for engagement by any means; the results are fairly predictable. Clicking on any trending topic usually surfaces completely unrelated videos with that topic's hashtags (and all of the other trending topic hashtags) appended to the tweet.

Then you have the "content creators" who use GPT to summarize or add details to a post from a larger content aggregator in hopes of bandwagoning engagement. I see a lot of this type of behavior from popular History-focused accounts and the mega-accounts that post engagement bait content (think canned, "desert island"-type questions and polls). It's less malicious but certainly reinforces cynicism towards the state of Twitter/X and the broader social web.

tomohawk · on Oct 27, 2023

I've seen this in other places I've worked. Too many people and too much money to spend can be a lot worse than too few and not enough.

VHRanger · on Oct 27, 2023

Yes, twitter is doing so well currently as a business

add-sub-mul-div · on Oct 27, 2023

It's garbage, yes, but all these companies overhired for years and it was a no-brainer that they all needed to fix that. Hard to criticize that decision in principle. Though the execution of it seemed like a clown show.

marcinzm · on Oct 27, 2023

If your main focus is growth or potential growth then inefficiencies are fine if they allow increased product velocity. A unified platform for N features is more efficient but communication overheads will cut velocity on improvements for all those N features. Of course if growth is no longer a focus that’s a different story.

whatasaas · on Oct 28, 2023

Yea, I don’t know anything about cars but I’m pretty sure all these mechanics could work naked. Praise me for saving uniform cost.

outside1234 · on Oct 27, 2023

"Shrank site traffic by 80%, yielding cost savings and cratering the company as a whole"

I hope everyone there is getting paid in advance for their work...

tymekpavel · on Oct 27, 2023

I’m curious if you have a source for that? There has been lots of speculation on how much X traffic has reduced, but the best data I can find basically says that it might’ve dropped from the 2nd most visited website to the 5th most visited website in the world (according to SimilarWeb).

kenjackson · on Oct 27, 2023

The numbers I see are declines in traffic of 15-20% with ad revenue declines of more than 50%.

https://variety.com/2023/digital/news/musk-twitter-x-acquisi...

I think for many sites this would be pretty horrible. I feel like the bar for X is that it still runs.

yoav · on Oct 28, 2023

They did this by shipping so many bugs (that I’ve seen) that the feeds only load 20% of the time.

Saves a ton of bandwidth i bet.

rabf · on Oct 27, 2023

Changelog: https://twitter.com/cb_doge/status/1717668534078034024

imperialdrive · on Oct 27, 2023

That project must be so much fun to work on. I'd pay just to shadow it.