526
submitted 1 month ago by [email protected] to c/[email protected]

A user on the online forum 4chan has leaked a massive 270GB of data purportedly belonging to The New York Times. This leak includes what is claimed to be the source code for the newspaper’s digital operations.

top 50 comments
sorted by: hot top controversial new old
[-] [email protected] 178 points 1 month ago
[-] [email protected] 120 points 4 weeks ago

"send nodes"

[-] [email protected] 25 points 4 weeks ago
[-] [email protected] 41 points 4 weeks ago

Node has been around longer than web3

NPM nightmares intensify

load more comments (5 replies)
load more comments (2 replies)
[-] [email protected] 22 points 4 weeks ago

270GB of mostly node modules?

[-] [email protected] 52 points 4 weeks ago

You're right, it would be bigger if it was node

[-] [email protected] 8 points 4 weeks ago

Sounds pretty average

load more comments (1 replies)
[-] [email protected] 3 points 4 weeks ago

pnpm store would probably save them a lot.of grief if true

[-] [email protected] 4 points 4 weeks ago

Nah, having big node_modules folders is a security feature. It’s like keeping your valuables in a bag of trash.

It also takes a long time to make a dump, so you have a higher chance of noticing it happening.

[-] [email protected] 55 points 4 weeks ago* (last edited 4 weeks ago)

270GB feels insane for the source code of a single organisation. Is there media assets or backups in there too?

EDIT: yep, multiple subsidiaries and slack Comms which could inflate it by a lot. we post a whole lot of uncompressed shit on our slack

[-] [email protected] 53 points 4 weeks ago

NY Times has a freaking great data visualisations, they are (were?) employing a wizard in this space, doing custom extensions on d3.js.

[-] [email protected] 47 points 4 weeks ago

Source code... for a website?

[-] [email protected] 113 points 4 weeks ago

Subscription software. Tracking software. Ad tools. Promotion tools. Tools for journalists.

The website is just what you see.

[-] [email protected] 19 points 4 weeks ago

Yeah, I guess I didn't consider all the other operational shit that goes into providing content and funding for the website.

[-] [email protected] 22 points 4 weeks ago

It’s why our PCs have gotten insanely fast but websites still load like fucking trash. All the back end spying shit takes up a ton of cpu cycles. If you don’t already have em run ublock origin and no script and the internet is so fucking speedy 😆

[-] [email protected] 7 points 4 weeks ago

I hadn't noticed but then again I run Ublock Origin on Firefox.

[-] [email protected] 5 points 4 weeks ago

Yeah. You got yourself covered no script helps with JavaScript being pesky. But breaks a lot of shit tbh.

load more comments (1 replies)
load more comments (2 replies)
[-] [email protected] 66 points 4 weeks ago* (last edited 4 weeks ago)

Anything more complicated than a static website is going to have a significant amount of server-side code.

Also, the article explains that it's not just the website, but ALL of their repos, which would include their smartphone apps, backend tools, etc.

[-] [email protected] 41 points 4 weeks ago* (last edited 4 weeks ago)

I doubt this will affect much ... that's a lot more source code than I'd expect though, dang.

Presumably a lot of it is for internal operations (custom editing software or something of that ilk).

[-] [email protected] 15 points 4 weeks ago

It sounds like it's not all source code, from the article.

[-] [email protected] 39 points 4 weeks ago

Now everyone will get to run Wordle!

[-] [email protected] 8 points 4 weeks ago* (last edited 4 weeks ago)

In case anyone missed the hubbub: [ETA: This is from March 2024; unconnected to this hack/leak]

https://apnews.com/article/new-york-times-wordle-clones-takedown-dmca-35d32b7548f7312ea74a2065b2cd31a6

The Times has filed several Digital Millennium Copyright Act, or DMCA, takedown notices to developers of Wordle-inspired games, which cited infringement on the Times’ ownership of the Wordle name, as well as its look and feel — such as the layout and color scheme of green, gray and yellow tiles.

Numerous impacted developers have also taken to social media to share their frustrations. Many said that their games, which range from Wordle-like offerings in other languages to more guessing games, would be taken down as a result.

Still, Brauneis said he believes the Times’ arguments for Wordle copyright infringement are on “a little bit shaky ground” for several reasons. Rules of a game, for example, are not covered by copyright — and that can include the layout of the game itself, he said.

[-] [email protected] 4 points 4 weeks ago

I prefer Connections.

[-] [email protected] 23 points 1 month ago* (last edited 4 weeks ago)

We still have no legal right to use, change and share its source code, control it both ourselves and in groups. It's still anti-libre software.

[-] [email protected] 59 points 4 weeks ago* (last edited 4 weeks ago)

Anything that may help develop better adblockers/paywall bypasses or exposes how/what of our personal information is collected is a win in my book. And this may very well be none of those things.

load more comments (4 replies)
[-] [email protected] 4 points 4 weeks ago

Just seeing how something is approached helps.

I sometimes rebuild software from one language to another for practice.

load more comments (1 replies)
[-] [email protected] 16 points 4 weeks ago

Did this leak happen before or after NYT published an investigation detailing how Israeli forces were raping and torturing defenseless Palestinian detainees brought in from the Gaza Strip?

[-] [email protected] 14 points 4 weeks ago

I have not read the news in a really long time just cause paywalls are annoying as frick.

[-] [email protected] 9 points 4 weeks ago

Consider paying for the news...?

[-] [email protected] 16 points 4 weeks ago

I'd only do that if you want independent news.

[-] [email protected] 7 points 4 weeks ago

I'm not sure what you're saying here ...

[-] [email protected] 9 points 4 weeks ago

Pay for news if you want it to be independent, and not beholden to sponsors.

I'd go as far as to say that paying for news (if you have the means to do so comfortably), is your duty as a commitment to democracy.

[-] [email protected] 6 points 4 weeks ago

Ahh, yes I agree on all points; thanks for the clarification!

load more comments (3 replies)
[-] [email protected] 4 points 4 weeks ago

He probably means one of these (or both):

  1. New York Times is a huge corporation. The commenter would only support a site which is run by one creator, or with a genuine small team, which is transparent and not an asshole.

  2. New York Times is biased politically or accepting bribery attempts from other corpos to make them look in a better light.

[-] [email protected] 7 points 4 weeks ago
  1. New York Times is a huge corporation. The commenter would only support a site which is run by one creator, or with a genuine small team, which is transparent and not an asshole.

Yeah but good luck chasing multiple stories across the world as a small team.

[-] [email protected] 7 points 4 weeks ago

Jesus Christ, no. It's almost like you're trying to sow distrust in the news and facts.

The NYT isn't perfect, but it's some of the most reliable news the world has.

As of March 2023, The New York Times Company employs 5,800 individuals,[101] including 1,700 journalists according to deputy managing editor Sam Dolnick.[122] Journalists for The New York Times may not run for public office, provide financial support to political candidates or causes, endorse candidates, or demonstrate public support for causes or movements.[123] Journalists are subject to the guidelines established in "Ethical Journalism" and "Guidelines on Integrity".[124] According to the former, Times journalists must abstain from using sources with a personal relationship to them and must not accept reimbursements or inducements from individuals who may be written about in The New York Times, with exceptions for gifts of nominal value.[125] The latter requires attribution and exact quotations, though exceptions are made for linguistic anomalies. Staff writers are expected to ensure the veracity of all written claims, but may delegate researching obscure facts to the research desk.[126] In March 2021, the Times established a committee to avoid journalistic conflicts of interest with work written for The New York Times, following columnist David Brooks's resignation from the Aspen Institute for his undisclosed work on the initiative Weave.[127]

load more comments (5 replies)
load more comments (1 replies)
load more comments (1 replies)
[-] [email protected] 5 points 4 weeks ago

You can go to archive.is and put in the url of a news story you want to read in the second box and it will usually let you bypass the paywall.

[-] [email protected] 10 points 4 weeks ago

Oh. They stopped seeding the torrent at 85%...

[-] [email protected] 10 points 4 weeks ago

I expect that paywall to be fully useless soon.

[-] [email protected] 35 points 4 weeks ago

That's a really silly take ... a Paywall is just an authorization mechanism.

That's like saying the source code of lemmy leaks and you expect your account to be compromised any second.

[-] [email protected] 23 points 4 weeks ago

I can sell you a copy of lemmys source code, are you interested?

[-] [email protected] 6 points 4 weeks ago

I'll sell it for cheaper!

[-] [email protected] 6 points 4 weeks ago

I can give you 25 schmeckles.

load more comments (1 replies)
load more comments
view more: next ›
this post was submitted on 07 Jun 2024
526 points (98.3% liked)

Technology

55744 readers
3701 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS