Hacker Newsnew | past | comments | ask | show | jobs | submit | vfistri2's commentslogin

I would call it very woke, exaggerating political correctness to the point of being clownish.


While there is only 1 biggest player (openAI) I would say we can, but if it gets to a point where we have multiple players i suppose it would be extremely hard without proper controls already in place.


To me this is quite an interesting idea, I am not sure if now is the correct time to pause it. Or how to spot when it is correct time. What are your thoughts?


I feel like github is not the right place to put a blog, reading any text there that is longer than single line gets me eyestrain.


I agree, and I've seen more "blogs" on GitHub lately. it's very annoying.


Would be interested in art copyright too. But honestly I have no idea how to prove that ML model was trained on my art.


Yes, the proof... Actually, there must be some diff tool to compare models before and after processing some source? I'm not sure, but it must be possible to detect pieces of come ingested data in the model itself. I've seen the famous "wolf misdetection" investigation screenshots, when the AI, apparently labeled a dog as a wolf because there was snow around on the picture.


For Stable Diffusion I think the average number of bits in the model compared to the number of training images is in the order of 6-8 bits per image. There is no "storage" of the training images. It's 250 TB data in, and 1.4 GB in the weight file or so depending on the precision.. I think those 250 TB are compressed as well, so maybe 25,000 TB raw data in distilled down to 1.4 GB. I fairly certain you could never prove an AI saw your image. You'd have to sue the company and by discovery look at their training data.

There are probably pathological cases where a repeating image is more strongly overfit in the training data and could be reproduced in much more detail than this average though. But the systems learn similar to the human brain, they learn the gist of a style or scene and how it relates to words. It's not a search engine, it doesn't copy/paste any block of pixels...

One interesting example is that since SD's original training set included some stock photo watermarked images, it learned that there was a concept of watermark, which can end up in the middle of generated images. Not in an intelligible way, but you can see roughly how it interpreted this detail. And in those cases you DO have a very very repeating similar pixel bitmap in the training data.


yesterday there was a rant about parsing yml and its different versions/datatypes support


One of the most inspiring "small" things a man can do. On a side note It really shows the guy doing this video is fully enjoying what he does.


even though intentions are noble it awfully reminds me of matrix scenes


just wait until amazon starts playing it on their website, (it is doable with autoplay after first click) :)


Of course, of course. This is why supermarkets are so well known for blaring intense, high-octane dubstep at thumping volumes.


My tinny laptop speakers will protect me!


So people start dancing stead of buying?


Here's a preview of how that would work: https://youtu.be/JEq10L7u3SM?t=122 (warning, loud)


If mouse over "buy now" play sub bass


&& if speakers are of sufficient size


would be interested to contribute if its open source


Still early stages but building on top of this. - https://github.com/travisjeffery/jocko

Since, the author has moved onto other projects. decided it would be an interesting challenge


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: