• qaz@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    edit-2
    28 days ago

    I’ve visited CivitAi a while back and it was full of lewd models and LoRA’s. Based on that I do think that AI generated porn unironically has the most potential of all their AI products so far. It makes sense for them to do this.

    • tal@olio.cafe
      link
      fedilink
      English
      arrow-up
      0
      ·
      edit-2
      29 days ago

      I suspect that if employees at Meta who are tasked with hoovering up training data from everywhere they can find are just watching porn, it probably won’t go over well on their annual reviews.

      I would give good odds that the human at Meta most-closely responsible for the BitTorrent download at issue probably has never even seen this particular torrent by name or URL. The scope of data involved in training is too large for direct human involvement. They probably did something along the lines of writing a bot in Python or similar to spider websites and feed every torrent it could find into a torrent downloader. That downloader’s output then gets dumped into some massive internal collection of data that gets used by some other team as part of the training process. The humans just create tools and set them in motion, never actually see the overwhelming majority of the data that they’re processing.