Yeah, but anything you create automatically has a copyright, so for example this comment is not in the public domain. Its use is limited to the context I am using it in; that is, I expect it to be copied for federation purposes, but I wouldn’t say that AI is covered in this context, just genuine readership, moderation, and bots that are ‘part of the community’.
At least that’s the EU stance afaik. Like if I saw this comment on a billboard somewhere I’d see that as a clear breach of copyright and even privacy.
Well, it’s one thing to say an ‘artificial agent’ looks at someone’s work on deviant art and learns from it. It’s another to use that to make money, as I personally can’t imagine many of the posters would have been on board with that.
I’m fine with that, but let’s put some rules against this.
Any AI models should be able to determine the source of their data to a defined level of accuracy.
There should be a well-defined way to block data from being used by AI. If one of these ways (e.g. robots.txt) has been breached, the model has to be rebuilt without the data, and reparations made to the content owners.
A neural network is basically nothing more than a set of weights. If one word makes a weight go up by 0.0001 and then another word makes it go down by 0.0001, and you do that billions of times for billions of weights, how do you determine what in the data created those weights? Every single thing that’s in the training data had some kind of effect on everything else.
It’s like combining billions of buckets of water together in a pool and then taking out 1 cup from that and trying to figure out which buckets contributed to that cup. It doesn’t make any sense.
Respectfully, I worked for Alexa AI on compositional ML, and we were largely able to do exactly this with customer utterances, so to say it is impossible is simply not true. Many companies have to have some degree of ability to remove troublesome data, and while tracing data inside a model is rather difficult (historically it would be done during the building of datasets or measured at evaluation time) it’s definitely something that most big tech companies will do.
Sorry, I misinterpreted what you meant. You said “any AI models” so I thought you were talking about the model itself should somehow know where the data came from. Obviously the companies training the models can catalog their data sources.
But besides that, if you work on AI you should know better than anyone that removing training data is counter to the goal of fixing overfitting. You need more data to make the model more generalized. All you’d be doing is making it more likely to reproduce existing material because it has less to work off of. That’s worse for everyone.
It’s not impossible lol. All a company would need to do is keep track of where they were getting content. If I use a script to download as much of the internet as possible and end up with a bunch of copyrighted content I could still get in trouble, hell there was even a guy arrested for downloading jstor without authorization.. Stop letting these guys get away with crimes just because you like the idea of the end product
No officer, this is not a pirated movie. It’s generated by an AI model I created and trained with data from the internet and the fact that it’s 99% identical to an existing movie is irrelevant.
Ok… so from now on … when I see a “repackaged” Microsoft product that for some reason… which I don’t care to know… doesn’t ask for a payment… I can use it without restrictions ?!! that’s really nice of you Microsoft … thank you.
Mister/miss, LLMs that can run locally are fine. It’s the infrastructure and the large scale of commercial cloud LLMs that create some issues. You have to read some researches on this topic.
“See I like AI because I’m selfish. Also those bad things are in the past, I’m using an ethical AI system now! But also, who gives a fuck because I only care about myself!”
Yeah you get it guy! Maybe you can be Trumps secretary of technology!
All of the resources and energy spent to get you this product you like. You can’t discount what it took to create something just because the final product is small and efficient. Take a look at the manufacturing footprint of nearly all complex hardware.
I’m not saying you created the AI but you are one of its supporters, without which there would be no AI.
If this was all just pitched as developing a new plain English coding language, I think the hype following it would be far more appropriate, but then the funding wouldn’t follow to support the massive development costs of AI.
Its become a circle of hype chasing money chasing hype.
Its not you that is the problem so to speak though, its the collective “you’s” who think the same way.
I really want phones like this to actually work and to succeed, but there are so many things these companies have to get just right – it’s a huge undertaking.
Releasing a phone that’s admittedly unfinished seems really risky. People are getting sick of unfinished products being tossed at them for full price, with the empty promises from the company that those missing features will be added in later.
I have the Samsung one right now, but the problem with it is that I can’t charge while listening to music and I ain’t gonna sacrifice sound quality with a 2 in 1 dongle.
These specs actually seem really solid for the price point, I’m glad to see decent alternative smartphones popping up that actually have some power.
What’s bugging me is the lack of information about the software. Apparently this is Android with a layer like Hallium to run a Debian userspace on top? And yet they don’t advertise that fact. It’s just a little off putting that this product seems to be aimed at Linux/general tech enthusiasts, yet the company seemed to miss the fact that those customers tend to really like knowing what they’re running under the hood.
I mean, I use maybe 3-4gb at any given time, without limiting myself. I personally don’t need heaps of RAM, 6gb is enough to have some overhead for me.
I haven’t looked at too many prices recently, I’ve had the same phone for a while, but this doesn’t seem to unreasonable imo, especially considering this is the first product from a small, new company.
theregister.com
Hot