• 0 Posts
  • 14 Comments
Joined 1 year ago
cake
Cake day: July 17th, 2023

help-circle











  • Couple of things here - what do you do with the open source models already published? There’s terabytes of data encapsulated in those. Some have published corpora, some don’t. How do you plan to determine that a work comes from an unregistered AI?

    Also, with respect to “within the country” - VPNs exist. TOR exists. SD cards exist. What’s your plan to control the flow of trained models without violating civil rights?

    This is a teflon slope covered in oil. (IMO)


  • I agree that under the current system of “idea ownership” someone needs to be held responsible, but in my opinion it’s ultimately a futile action. The moment that arbitrary individuals are allowed to download these models and use them independently (HuggingFace, et al), all control of whatever is in the model is lost. Shutting down Open AI or Anthropic doesn’t remove the models from people’s computers, and doesn’t eliminate the knowledge of how to train them.

    I have a gut feeling this is going to change the face of copyright, and it’s going to be painful. We collectively weren’t ready.


  • Sure, but that particular horse has left the barn. There will be cases where identification is easy(-ier) but as shown in Oracle v Google, there are only so many ways to express ideas in code.

    For example, I just asked Claude 2 “Write a program in C to count from 1 to some arbitrary number specified on the command line.” Can you tell me the origin of this line from the result?

    for(int i=1; i<=n; i++) {

    I mean, if it’s from a copyrighted work, I certainly don’t want to use it in an open-source project!

    EDIT: Guessing there’s a bug in HTML entity handling.