@Hexarei

Hexarei@programming.dev · 2 months ago

Are you objectifying me??

Most people pay for that privilege

Hexarei@programming.dev · 1 year ago

Autism+ADHD life, I can’t stand to have emails in my inbox for more than a day and I also can’t be diligent enough to achieve that

Hexarei@programming.dev · 1 year ago

I’m gonna stop responding to this asanine thread now before you continue to demean us both with your nonsense.

Hexarei@programming.dev · 1 year ago

Simpler language is fine when it’s accurate.

Your simplification is inaccurate and could mislead people into thinking GPTs are just advanced regex matching engines.

They are not. They are closer to autocorrect on steroids.

Hexarei@programming.dev · 1 year ago

Analysis. It uses it, but not by “matching it”. The training data is not included in the final model. No GPT can access its training data at runtime.

Training analyzes the contents of the training data and creates a statistical model representing the likelihoods of various tokens based on a complex series of mathematical transformations that encode various attributes of the tokens making up the training data.

3Blue1Brown has a great series on the actual math behind it, I would highly recommend educating yourself on what GPTs actually do. It’s way more interesting than simple matching.

Hexarei@programming.dev · edit-2 1 year ago

You said it matches text to its training data, which it does not do.

Your single-phrase statement only works for very short, non-repetitive phrases. As soon as your phrase repeats a token more than a few times, the statistics for the tokens change and could result in nonsensical output that repeats through subsections of the training data.

And even then for that single non-repetitive phrases, the reason you would get that single phrase back is not because it would be “matching on” the phrase. It is because the token weights would effectively encode that the statistical likelihood of the “next token” in the generated output is 100% for a given token when the evaluated token precedes it in the training phrase. Or in other words: Your training data being a single phrase maniplates the statistics so that the most likely output is that single phrase.

However, that is a far cry from simple “matching” against the training data. Which is what you said it does.

Hexarei@programming.dev · 1 year ago

They do not store anything verbatim; They instead store the directions in which various words and related concepts relate to one another in some gigantic multidimensional space.

I highly suggest you go learn what they actually do before you continue talking out of your ass about them

Hexarei@programming.dev · 1 year ago

That’s not how GPTs work

Hexarei@programming.dev · 1 year ago

“Today I learned learned”

Hexarei@programming.dev · 1 year ago

Bro we promise bro, we’re deleting the data - We know bro, you thought we didn’t collect it but bro we’re deleting it we promise now we’re cool bro just keep using it bro we don’t collect more data bro we promise

Hexarei@programming.dev · 1 year ago

Meanwhile, for my homelab I just use split DNS and a (properly registered+set up) .house domain - But that’s because I have services that I want to have working with one name both inside and outside of my network

Hexarei@programming.dev · 1 year ago

Yep, as someone who just recently setup a hyperconverged mini proxmox cluster running ceph for a kubernetes cluster atop it, storage is hard to do right. Wasn’t until after I migrated my minor services to the new cluster that I realized that ceph’s rbd csi can’t be used by multiple pods at once, so having replicas of something like Nextcloud means I’ll have to use object storage instead of block storage. I mean. I can do that, I just don’t want to lol. It also heavily complicates installing apps into Nextcloud.

Hexarei@programming.dev · 1 year ago

Certbot also does DNS challenge, fwiw

Hexarei@programming.dev · 1 year ago

DNS challenge makes it even easier, since you don’t have to go through the process of transferring it yourself

Hexarei@programming.dev · 1 year ago

Worth mentioning: Anyone using TachiyomiJ2K (I use it for Surface Duo dual-screen support) or another fork with support who has some self-hosting prowess, there’s always Suwayomi - It will let you “migrate” to a third-party sources repo even if your app doesn’t support it, since it becomes your device’s only local extension.

Hexarei@programming.dev · 1 year ago

https://youtu.be/oCACBp1OLOw?si=1aBSRWdCE97ggA4p

Hexarei@programming.dev · 1 year ago

Others have addressed the root and trust questions, so I thought I’d mention the “mess” question:

Even the messiest bowl of ravioli is easier to untangle than a bowl of spaghetti.

The mounts/networks/rules and such aren’t “mess”, they are isolation. They’re commoditization. They’re abstraction - Ways to tell whatever is running in the container what it wants to hear, so that you can treat the container as a “black box” that solves the problem you want solved.

Think of Docker containers less like pets and more like cattle, and it very quickly justifies a lot of that stuff because it makes the container disposable, even if the data it’s handling isn’t.

Hexarei@programming.dev · 1 year ago

Ah, neat! I just looked it up and it does look useful.

I’ve never really had any trouble with dark reader speed-wise - though it gives one major bonus that no other extension has so far: Attempting to match the appearance of darkened websites to my system theme (Catppuccin)

Hexarei@programming.dev · 1 year ago

I can’t tell if you’re agreeing with me, disagreeing with me, or suggesting some alternative

Hexarei@programming.dev · 1 year ago

I highly recommend the Dark Reader extension for your browser