More

vfistri2 · on March 31, 2023

I would call it very woke, exaggerating political correctness to the point of being clownish.

vfistri2 · on March 30, 2023

While there is only 1 biggest player (openAI) I would say we can, but if it gets to a point where we have multiple players i suppose it would be extremely hard without proper controls already in place.

vfistri2 · on March 30, 2023

To me this is quite an interesting idea, I am not sure if now is the correct time to pause it. Or how to spot when it is correct time. What are your thoughts?

vfistri2 · on March 17, 2023

I feel like github is not the right place to put a blog, reading any text there that is longer than single line gets me eyestrain.

acdw · on March 17, 2023

I agree, and I've seen more "blogs" on GitHub lately. it's very annoying.

vfistri2 · on Jan 16, 2023

Would be interested in art copyright too. But honestly I have no idea how to prove that ML model was trained on my art.

IYasha · on Jan 16, 2023

Yes, the proof... Actually, there must be some diff tool to compare models before and after processing some source? I'm not sure, but it must be possible to detect pieces of come ingested data in the model itself. I've seen the famous "wolf misdetection" investigation screenshots, when the AI, apparently labeled a dog as a wolf because there was snow around on the picture.

l33tman · on Jan 16, 2023

For Stable Diffusion I think the average number of bits in the model compared to the number of training images is in the order of 6-8 bits per image. There is no "storage" of the training images. It's 250 TB data in, and 1.4 GB in the weight file or so depending on the precision.. I think those 250 TB are compressed as well, so maybe 25,000 TB raw data in distilled down to 1.4 GB. I fairly certain you could never prove an AI saw your image. You'd have to sue the company and by discovery look at their training data.

There are probably pathological cases where a repeating image is more strongly overfit in the training data and could be reproduced in much more detail than this average though. But the systems learn similar to the human brain, they learn the gist of a style or scene and how it relates to words. It's not a search engine, it doesn't copy/paste any block of pixels...

One interesting example is that since SD's original training set included some stock photo watermarked images, it learned that there was a concept of watermark, which can end up in the middle of generated images. Not in an intelligible way, but you can see roughly how it interpreted this detail. And in those cases you DO have a very very repeating similar pixel bitmap in the training data.

vfistri2 · on Jan 13, 2023

yesterday there was a rant about parsing yml and its different versions/datatypes support

vfistri2 · on Dec 14, 2022

One of the most inspiring "small" things a man can do. On a side note It really shows the guy doing this video is fully enjoying what he does.

vfistri2 · on Dec 12, 2022

even though intentions are noble it awfully reminds me of matrix scenes

vfistri2 · on Nov 8, 2022

just wait until amazon starts playing it on their website, (it is doable with autoplay after first click) :)

chaorace · on Nov 8, 2022

Of course, of course. This is why supermarkets are so well known for blaring intense, high-octane dubstep at thumping volumes.

thebeastie · on Nov 8, 2022

My tinny laptop speakers will protect me!

amelius · on Nov 8, 2022

So people start dancing stead of buying?

senko · on Nov 8, 2022

Here's a preview of how that would work: https://youtu.be/JEq10L7u3SM?t=122 (warning, loud)

ta988 · on Nov 8, 2022

If mouse over "buy now" play sub bass

dylan604 · on Nov 8, 2022

&& if speakers are of sufficient size

vfistri2 · on Sept 16, 2022

would be interested to contribute if its open source

Dopameaner · on Sept 16, 2022

Still early stages but building on top of this. - https://github.com/travisjeffery/jocko

Since, the author has moved onto other projects. decided it would be an interesting challenge