this post was submitted on 01 Jul 2023
1002 points (96.5% liked)

Technology

59600 readers
4059 users here now

This is a most excellent place for technology news and articles.


Our Rules


  1. Follow the lemmy.world rules.
  2. Only tech related content.
  3. Be excellent to each another!
  4. Mod approved content bots can post up to 10 articles per day.
  5. Threads asking for personal tech support may be deleted.
  6. Politics threads may be removed.
  7. No memes allowed as posts, OK to post as comments.
  8. Only approved bots from the list below, to ask if your bot can be added please contact us.
  9. Check for duplicates before posting, duplicates may be removed

Approved Bots


founded 1 year ago
MODERATORS
 

Elon Musk said verified accounts would be limited to reading 6,000 posts per day while unverified users will be limited to 600.

you are viewing a single comment's thread
view the rest of the comments
[–] [email protected] 23 points 1 year ago (2 children)

No you misunderstand they desperately want them to be trained with their data. They just want them to pay hundreds of thousands to millions of dollars to do so. Twitter is not buckling under the weight of data scraping, Elon is just pissed that companies are data scraping instead paying his exorbitant API fees.

[–] [email protected] 9 points 1 year ago (1 children)

They just want them to pay hundreds of thousands to millions of dollars to do so.

This is the hilarious part to me: some companies might pay these fees, but there will be many more who won’t and will instead use actual web scrapers to get their data anyways. As the number of individuals training LLM models increases in the next couple of years, this will create a much more significant traffic load compared to API calls.

[–] [email protected] 4 points 1 year ago* (last edited 1 year ago)

Yeah he doesn’t seem to understand he’s not selling the data, the data is public, he’s selling convenience. And if the convenience isn’t worth the price you’ve set, people will just take the extra effort and avoid the expense.

[–] [email protected] 4 points 1 year ago

Exactly. I do selenium scripting as my main task for work, and as soon as I heard about how high the api rates were my first through was "Jesus, it might slower than straight api calls, and the dynamic xpaths might suck, but I could write a script that scrapes the website for cheaper." Twitter is hurting for cash right now, and I imagine his effort to raise funds is the end goal here. He instituted the api policy, learned about another side effect, and continues to with the most extreme, devoid of nuance response each time.

All "in my opinion," of course.