• PityPityBangBang@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    1
    ·
    edit-2
    7 hours ago

    Perhaps that so many people have quoted that chapter in college and high school papers, book review and film reviews, and cultural criticism that maybe there is a weird “shoot the moon” situation where a “works of origin” begin to look like a “works of derivation” in LLMs.

    • Echo Dot@feddit.uk
      link
      fedilink
      English
      arrow-up
      2
      ·
      2 hours ago

      The problem is it’s not plagiarism detector (it would also be a pretty bad one since it can’t detect quotes) it’s an AI detector.

      It’s saying that a direct quote is AI, Which obviously isn’t true, it’s a quote, which is a different thing.

      If 10% of my thesis is quoting other works that’s not the same as my thesis being 10% AI generated. The distinction needs to be made.

    • Hacksaw@lemmy.ca
      link
      fedilink
      English
      arrow-up
      10
      ·
      5 hours ago

      Yeah, or perhaps there is no need to make up excuses for the Copyright Infringement, world bruning, infinite lying machine lying about what text is real vs generated by it. LLMs lie, LLM based LLM detectors lie about lies.

    • tempest@lemmy.ca
      link
      fedilink
      English
      arrow-up
      13
      ·
      7 hours ago

      Frankenstein is out of copyright.

      I would be unsurprised if you couldn’t tease out the entire book. I wonder if Mary Shelly was a fan of dashes.

      • 8oow3291d@feddit.dk
        link
        fedilink
        English
        arrow-up
        6
        ·
        6 hours ago

        Being out of copyright is kinda irrelevant. There are lawsuits right now, because the AI firms apparently fed the AI’s tons of copyrighted books.

        • tempest@lemmy.ca
          link
          fedilink
          English
          arrow-up
          4
          ·
          edit-2
          5 hours ago

          It is and it isn’t. Those lawsuits mean they at least try to stop it from producing copyrighted work. They won’t make Simpsons characters or produce anything from the house of mouse without major cajoling or some trickery in the prompt.

          For the text from Frankenstein they are not even going to try.

          Incidentally after writing this content I tried to get chatgpt to reproduce the first paragraph of chapter 3. It refused and offered a summary. I “reminded” it that the book is in the public domain and then it reproduced it without issue.

          • OwOarchist@pawb.social
            link
            fedilink
            English
            arrow-up
            2
            ·
            4 hours ago

            I tried to get chatgpt to reproduce the first paragraph of chapter 3. It refused and offered a summary. I “reminded” it that the book is in the public domain and then it reproduced it without issue.

            I bet you could do exactly the same thing for a book that’s still copyrighted.

            • tempest@lemmy.ca
              link
              fedilink
              English
              arrow-up
              2
              ·
              3 hours ago

              I did see posts of someone doing it with Harry Potter but I think it took a little more effort

          • 8oow3291d@feddit.dk
            link
            fedilink
            English
            arrow-up
            2
            ·
            5 hours ago

            They still obviously trained it on the copyrighted text. Which I think is what some claim is illegal without payment?

            Mind you, I don’t think copyright should cover that, for text at least. It is not in society’s interest.