|
![]() |
|
Thread Tools | Search this Thread |
![]() |
#1 | |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 363
Karma: 3003003
Join Date: Jul 2023
Device: Scribe, OA2, Glo HD, PRS-350
|
Meta admits to training LLM AI with terabytes of torrented copyrighted works.
https://arstechnica.com/tech-policy/...i-authors-say/
Quote:
|
|
![]() |
![]() |
![]() |
#2 |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 43,345
Karma: 165170674
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Reading that did make me laugh—again—at OpenAI's hypocrisy in complaining the DeepSeek may, in part, have been trained on data distilled from them. Their attitude that using data they mined from the Internet regardless of rights is okay but now someone may be mining their data and that's absolutely horrific appeals to my sense of humour.
|
![]() |
![]() |
Advert | |
|
![]() |
#3 |
Custom User Title
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 10,286
Karma: 72663495
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
Also pirated work irony.
|
![]() |
![]() |
![]() |
#4 |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 425
Karma: 8522810
Join Date: Dec 2010
Location: Wisconsin, USA
Device: Kindle PW3
|
The torrented part is pretty damning. I think there's a reasonable argument in favor of them being able to use copyrighted works as fair use if they had legally obtained them.
|
![]() |
![]() |
![]() |
#5 |
Addict
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 273
Karma: 3000000
Join Date: Nov 2015
Device: none
|
On a side note, I find it curious that the complete archive is just 81.7 terabytes. 1500 euro for five 18T HDD drives, and anyone can have a home backup with plenty of space to spare.
|
![]() |
![]() |
Advert | |
|
![]() |
#6 |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 43,345
Karma: 165170674
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
I'd go for 6 or 7 drives to allow a RAID 5 or RAID 6 array. Our new backup array at work used 18TB drives (Western Digital Red Pro NAS) and we had several failures in the first 6 months. RAID allowed us to simply swap the failed drive and then the array rebuilt itself. Not exactly fast but all automated. With it's full 12 drives, gave us about 170TB of usable storage over a 10GB fibre connection.
|
![]() |
![]() |
![]() |
#7 |
Custom User Title
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 10,286
Karma: 72663495
Join Date: Oct 2018
Location: Canada
Device: Kobo Libra H2O, formerly Aura HD
|
|
![]() |
![]() |
![]() |
#8 |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,401
Karma: 102739835
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
I used to use RAID 5 with ultra wide & fast 10,000 rpm 20G SCSI drives. HW EISA RAID controller with onboard RAM, on board battery and a big UPS. Later PCI. Any server needs a UPS. Really workstions too; that's the advantage of a laptop. Very noisy and power hungry. It lived in the attic above the bathroom and the noise baffled visitors.
The RAID 5 rebuild time gets excessive with 250G+ drives. A mirror is simpler. Also decent HW RAID controllers for modern drives are not everyday things. Highest end thing I built was for a college in 1998 or 1999. It had two shelves each with own UPS and each end of each SCSI bus connected to a SCSI buffer to a Pentium Pro with dual channel HW RAID controller. The two Pentium pro servers also on separate UPS. Ran NT4.0 Enterprise with 1st non-beta MS Cluster SW developed by DEC. It was some sort of combo RAID so you could lose a shelf. Maybe RAID 5 per shelf, mirrored? Now our server lives in a fireproof, waterproof shed, with its own UPS, but that is fed from main Solar + Grid UPS (6000+ Wh). Just a single drive and backups because down time no longer matters. No live services. In the old days the Server handled proxy for Internet, all mail, Windows Update, Print server, and a VPN server for people away from home to securely do email etc. |
![]() |
![]() |
![]() |
#9 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,401
Karma: 102739835
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
Considering what LLMs do, the idea of fair use even via a licence is obnoxious. Most authors I know would never agree. It's a "licence" to let the LLM users plagiarise. Really ANYTHING other than using PD works is violation of rights of ANY content creator, including bloggers and forum posts. A use that was never intended. They can't be forbidden from using PD content. |
|
![]() |
![]() |
![]() |
#10 | |
eReader Wrangler
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 7,769
Karma: 50741051
Join Date: Mar 2013
Location: Boise, ID
Device: PB HD3, GL3, Tolino Vision 4, Voyage, Clara HD
|
Quote:
|
|
![]() |
![]() |
![]() |
#11 | |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 43,345
Karma: 165170674
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Quote:
|
|
![]() |
![]() |
![]() |
#12 | |
Evangelist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 425
Karma: 8522810
Join Date: Dec 2010
Location: Wisconsin, USA
Device: Kindle PW3
|
Quote:
If I could get chatGPT to quote me a full chapter out of Harry Potter word for word, that would be one thing. But I'm pretty sure it can't. |
|
![]() |
![]() |
![]() |
#13 | |
Bibliophagist
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 43,345
Karma: 165170674
Join Date: Jul 2010
Location: Vancouver
Device: Kobo Sage, Libra Colour, Lenovo M8 FHD, Paperwhite 4, Tolino epos
|
Quote:
You may also want to take a look at the Fair Use/Fair Dealing laws in your area. |
|
![]() |
![]() |
![]() |
#14 | |
Still reading
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 13,401
Karma: 102739835
Join Date: Jun 2017
Location: Ireland
Device: All 4 Kinds: epub eink, Kindle, android eink, NxtPaper
|
Quote:
It's not the same thing, also the big corps didn't even buy a copy. They used pirate copies. Totally false analogy that also reveals you don't understand how LLM work or copyright. |
|
![]() |
![]() |
![]() |
#15 |
Grand Sorcerer
![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() Posts: 28,262
Karma: 203719142
Join Date: Jan 2010
Device: Nexus 7, Kindle Fire HD
|
Sorry. But I think what you actually meant to say was that "that also reveals that your interpretation of the legal ramifications of how LLM works does not match my own." Because let's face it. Your interpretation of AI and LLM in general are a bit more philosophical (not to mention semantic) than most's.
|
![]() |
![]() |
![]() |
Thread Tools | Search this Thread |
|
![]() |
||||
Thread | Thread Starter | Forum | Replies | Last Post |
LLM created tags | scruffynerf | Plugins | 8 | 11-05-2024 11:39 AM |
Amazon admits Kindles made by illegally hired underpaid workers | GeoffR | News | 9 | 06-11-2018 04:58 AM |
John Scalzi admits he's a Hack. :) | kennyc | Writers' Corner | 16 | 02-02-2013 09:24 AM |
Wikipedia can be torrented... | spirits | Amazon Kindle | 0 | 10-26-2008 11:41 PM |
Obelisk -- legal distribution of format-shifted copyrighted works | llasram | Workshop | 26 | 10-11-2008 12:37 PM |