A user on the online forum 4chan has leaked a massive 270GB of data purportedly belonging to The New York Times. This leak includes what is claimed to be the source code for the newspaper’s digital operations.
It’s mostly node modules
“send nodes”
270GB of mostly node modules?
You’re right, it would be bigger if it was node
Sounds pretty average
I hate Web 3.0
Node has been around longer than web3
NPM nightmares intensify
Web 3.0 ≠ web3
I also hate making things from smaller pieces, the engineering in software engineering. /s
reminds me of the time someone said “Who is this 4chan?” on tv and it became a meme. good times
He can’t keep getting away with it.
NY Times has a freaking great data visualisations, they are (were?) employing a wizard in this space, doing custom extensions on d3.js.
270GB feels insane for the source code of a single organisation. Is there media assets or backups in there too?
EDIT: yep, multiple subsidiaries and slack Comms which could inflate it by a lot. we post a whole lot of uncompressed shit on our slack
Source code… for a website?
Subscription software. Tracking software. Ad tools. Promotion tools. Tools for journalists.
The website is just what you see.
Yeah, I guess I didn’t consider all the other operational shit that goes into providing content and funding for the website.
It’s why our PCs have gotten insanely fast but websites still load like fucking trash. All the back end spying shit takes up a ton of cpu cycles. If you don’t already have em run ublock origin and no script and the internet is so fucking speedy 😆
I hadn’t noticed but then again I run Ublock Origin on Firefox.
Yeah. You got yourself covered no script helps with JavaScript being pesky. But breaks a lot of shit tbh.
You can still make it work, it’s just more stuff to click on. I used to use NoScript too, but eventually stopped using it.
That’s not what makes websites slow. It’s React.
Retards with React. “I’m optimizing user experience”
Oh haven’t heard of this will check it out.
Removed by mod
Just seeing how something is approached helps.
I sometimes rebuild software from one language to another for practice.
I expect that paywall to be fully useless soon.
I just received an email, is this related?
Critical support
Thats a lot of data but surly its not all their articles cos I’d very much like to train mixtral7x8b on it along with 4chan data and shir from the dark web. Surly there is a project where such a model is public and being trained on literally everything regardless of legality.
EDIT: why am i getting downvoted?