death of hypercopium imminent

Ranter

Liebranca

1112

Comments

4

djsumdog

6579

15h

I'm glad you're talking about the context window and the model. So many people use these things as black boxes and have no idea how they work.

They can't "hallucinate" or "lie" .. they literally have no intent. What you have discovered is the truth: They are random next word (really token) generators. Anything they get right is literally by accident. People seem fine with things being right accidentally 80% of the time ... just waiting for that structural engineer who uses AI and it gets load bearing wrong and 8 people die in a bridge collapse.

3Blue1Brown has some excellent videos of how LLMs work, including the vector space addition, the transformers, preceptron blocks and attention blocks. There's a good 7 min layman's video too. I'd recommend going through them. They're eye opening.
2

Pogromist

2416

11h

Your last paragraphs sounds kinda like AI hallucinations, didn't understand anything
1

BordedDev

1458

11h

"AI"s are complex prediction engines. Try making the same request in C++ or another language where it is commonly implemented, and you'll most likely get a far better result
1

tamagotchi

228

10h

Please always mention that models you did use. In context window Gemini is best now, it can have a book around 1500 pages as context window.
2

Liebranca

1112

7h

@Pogromist I was trying to see for myself whether using any of these for coding was a good or bad idea; a lot of folks will say something along the lines of "it's just another tool", but my intuition, and now my own experience, tell me that it's a pretty awful one. That's more or less what I was trying to say.

@BordedDev Yep, I wasn't so much interested in whether they could handle drudgework like fetching the correct boilerpaste, but whether prediction was good enough to handle solving an actual problem. It isn't.

@tamagotchi I've made a point of not naming them as in the end they all have the exact same Achilles' heel; conclusion being it just doesn't matter which model you use.

I've read many comments that very much read like this: "just try [a fortune-teller], it's way better than [a fortune-teller]". Sorry, but that's just missing the point entirely.
1

tamagotchi

228

4h

@Liebranca I can assure you, it makes a lot of difference if you take claude opus or anything else. And let users decide and test them themselves by reproducing things. Now it is like i'm complaining that snake is running slow on all models while the fact is that snake only runs slow on a Nokia 3330. Nokia 3310 had the best snek.
1

Liebranca

1112

58m

@tamagotchi A problem inextricable to a type of system does not magically go away when you move from one instance to another.
1

tamagotchi

228

22m

@Liebranca now you just invented a word. Admit it.
1

BordedDev

1458

20m

@Liebranca @tamagotchi I wonder if you dissemble a bunch of games and train them on that what the results area. They are just tools after all, that greatly reduce the google search loop. I definitely think they will take over a lot of "grunt" dev aka web, and probably a good chunk of game dev. Just think about how many open source (or let's be honest source available, not like they'll make the difference when training) there are. I bet it's also great at making minecraft mods.

Add Comment

rant