linux-nerds.org

Your browser does not seem to support JavaScript. As a result, your viewing experience will be diminished, and you have been placed in read-only mode.

Please download a browser that supports JavaScript, or enable it if it's disabled (i.e. NoScript).

Microsoft Copilot joins ChatGPT at the feet of the mighty Atari 2600 Video Chess

Technology

47 Beiträge 29 Kommentatoren 0 Aufrufe

E exlisper@lemmy.curiana.net

I have a better LLM benchmark:

"I have a priest, a child and a bag of candy and I have to take them to the other side of the river. I can only take one person/thing at a time. In what order should I take them?"

Claude Sonnet 4 decided that it's inappropriate and refused to answer. When I explain that the constraint is not to leave child alone with candy he provided a solution that leaves the child alone with candy.

Grok would provide a solution that doesn't leave the child alone with a priest but wouldn't explain why.

ChatGPT would say that "The priest can't be left alone with the child (or vice versa) for moral or safety concerns." directly and then provide wrong solution.

But yeah, they will know how to play chess...
L This user is from outside of this forum
L This user is from outside of this forum
lifeinmultiplechoice@lemmy.world

schrieb zuletzt editiert von lifeinmultiplechoice@lemmy.world

#23

The answer is simple, eat the candy with or without them, and take the kid across the river. Drive them home to their guardian. The priest is an adult, he can figure his own shit out.
1 Antwort Letzte Antwort

23
V vegeta@lemmy.world

This post did not contain any content.
M This user is from outside of this forum
M This user is from outside of this forum
muntedcrocodile@hilariouschaos.com

schrieb zuletzt editiert von

#24

Average Human joins Microsoft Copilot, and ChatGPT at the feet of the mighty Atari 2600 Video Chess
1 Antwort Letzte Antwort

2
P plebcouncilman@sh.itjust.works

So what you are saying is that it has a purpose. Also if an artist is inspired by another artist, and they have a generally similar art style as the artist they are inspired by, are they stealing? Was HP Lovecraft stealing from Lord Dunsany when he imitated his style? Where all those monks that transcribed Greek works stealing from the Greeks?

I will say that most AIs are unethical because they have been trained on pirated works. But an AI trained on publicly available works (ie news articles, blogs etc) and movies, books and music for which access to was paid for is as ethical as you or me emulating an artist or building on an idea that we read to create something new. And if that’s unethical then all human art in history is unethical because all artists are inspired by other artists, no one creates in a vacuum.
T This user is from outside of this forum
T This user is from outside of this forum
thedruid@lemmy.world

schrieb zuletzt editiert von

#25

A. I does not create, it regurgitates and clarifies inspiration,? Sure anything can be used for inspiration. But unless a person puts hands and heart to it, it's not art.

Following a recipe on a box does not a chef makr
P 1 Antwort Letzte Antwort

4
T thedruid@lemmy.world

A. I does not create, it regurgitates and clarifies inspiration,? Sure anything can be used for inspiration. But unless a person puts hands and heart to it, it's not art.

Following a recipe on a box does not a chef makr
P This user is from outside of this forum
P This user is from outside of this forum
plebcouncilman@sh.itjust.works

schrieb zuletzt editiert von plebcouncilman@sh.itjust.works

#26

Art has no rules my man.

You can do all kinds of mental gymnastics you want but there’s no difference between an artist looking at Frank Frazetta’s art and basing their style off of it and an AI doing the same thing. You might not like it, but it’s the truth.

Do I think the art has the same value? Not necessarily. But I also never thought that all art has the same value. There has always been trash production line art and good art.

But also I have to say that I’ve already seen some people use AI as a tool for art and make some really cool stuff that I don’t think any other artist would have made and it’s more unique than most of the stuff out there. You can use it as the tool it is or complain and cry about it to no avail.

The chef example is especially good since most chefs are just following recipes and altering simply a few things here and there. AI essentially does the same thing. Honestly like no one has come up with a good argument to change my mind that the way AI operates is exactly how humans learn and create new things. If you’ve engaged in art you know that you are always imitating and taking from the art you consume to make your own.
T 1 Antwort Letzte Antwort

2
P postnataldrip@lemmy.world

I bet Video Chess is pretty shit as an LLM too.

Wish people would stop desperately looking for ways to write buzzword stories
S This user is from outside of this forum
S This user is from outside of this forum
sp3ctr4l@lemmy.dbzer0.com

schrieb zuletzt editiert von sp3ctr4l@lemmy.dbzer0.com

#27

It is entirely disingenuous to just pretend that LLMs are not being widely promoted, marketed, and discussed as AGI, as a superintelligence that people are familiar with from SciFi shows/movies, that is vastly more capable and knowledgeable than basically any single human.

Yes, people who actually understand tech understand that LLMs are not AGI, that your metaphor of wrong tool wrong job is apt.

... But seemingly about +90% of humanity, including the people who own and profit from LLMs, including all the other business owners/managers who just want to lower their employee headcount ... do not understand this, that an LLM is actually basically an extremely advanced text autocorrect system, that frequently and confidently lies, spits out nonsense, hallucinates, etc.

If you think it isn't reasonable to continuously point out that LLMs are not superintelligences, then you likely live in a bubble of tech nerds who probably still think their jobs or retirement are secure.

They're not.

If corpos keep smashing """AI""" into basically every industry to replace as many workers as possible... the economy will collapse, as capitalism doesn't work without consumers who have jobs, and an avalanche of errors will cascade and snowball through every system that replaces humans with them...

...and even if those two things were not broadly true...

...the amount of literal power/energy, clean water and financial capital that is required to run the whole economy on these services is wildly unsustainable, both short term economically, and medium term ecologically.
A 1 Antwort Letzte Antwort

10
P plebcouncilman@sh.itjust.works

Art has no rules my man.

You can do all kinds of mental gymnastics you want but there’s no difference between an artist looking at Frank Frazetta’s art and basing their style off of it and an AI doing the same thing. You might not like it, but it’s the truth.

Do I think the art has the same value? Not necessarily. But I also never thought that all art has the same value. There has always been trash production line art and good art.

But also I have to say that I’ve already seen some people use AI as a tool for art and make some really cool stuff that I don’t think any other artist would have made and it’s more unique than most of the stuff out there. You can use it as the tool it is or complain and cry about it to no avail.

The chef example is especially good since most chefs are just following recipes and altering simply a few things here and there. AI essentially does the same thing. Honestly like no one has come up with a good argument to change my mind that the way AI operates is exactly how humans learn and create new things. If you’ve engaged in art you know that you are always imitating and taking from the art you consume to make your own.
T This user is from outside of this forum
T This user is from outside of this forum
thedruid@lemmy.world

schrieb zuletzt editiert von

#28

Fuck that.
I'll prove you wrong right now.

I want you to paint me picture of a cow in a field.
Did I do that,?

Nope. I commissioned you to.

Now if you the commissioned guy used a. I to make the item , how much credit should you get?
None. .. describing what you want to a machine is a child's play game.

Humans adults create. Machines mimic.

Humans who think a. I is art are liars and con men afraid of being caught.
P 1 Antwort Letzte Antwort

0
T thedruid@lemmy.world

Fuck that.
I'll prove you wrong right now.

I want you to paint me picture of a cow in a field.
Did I do that,?

Nope. I commissioned you to.

Now if you the commissioned guy used a. I to make the item , how much credit should you get?
None. .. describing what you want to a machine is a child's play game.

Humans adults create. Machines mimic.

Humans who think a. I is art are liars and con men afraid of being caught.
P This user is from outside of this forum
P This user is from outside of this forum
plebcouncilman@sh.itjust.works

schrieb zuletzt editiert von plebcouncilman@sh.itjust.works

#29

What you are describing has nothing to do with the tool. It’s dishonesty which is different.

The idea is that instead of commissioning the cow on the field, you go to the AI and ask it for that and it gives you a cow in the field. If you claim you made it, you are lying but that would be true even if you paid an artist and then claimed the same.

So with AI made art you’ll say “this art was made by an Ai” and no one will be confused as to who takes the credit, because it belongs to the algorithm.

Have you ever made art in your life? Because a big part of art is mimicking. Like 98% of it is mimicking. I draw, write and have dabbled in making music and playing instruments. You can’t learn these skills without mimicking. And most artists don’t ever do anything truly original, that’s a rarity and even when it happens you can trace the influences to other artists if you know how to look.

You could argue that AI has not developed its own style yet but that’s bullshit too imo because everyone knows the default AI art style when they see it, so that means that AI has a distinctive style. Is it unique? Maybe not, but neither is the art style of most artists or writers or even musicians.
T 1 Antwort Letzte Antwort

0
E exlisper@lemmy.curiana.net

I have a better LLM benchmark:

"I have a priest, a child and a bag of candy and I have to take them to the other side of the river. I can only take one person/thing at a time. In what order should I take them?"

Claude Sonnet 4 decided that it's inappropriate and refused to answer. When I explain that the constraint is not to leave child alone with candy he provided a solution that leaves the child alone with candy.

Grok would provide a solution that doesn't leave the child alone with a priest but wouldn't explain why.

ChatGPT would say that "The priest can't be left alone with the child (or vice versa) for moral or safety concerns." directly and then provide wrong solution.

But yeah, they will know how to play chess...
B This user is from outside of this forum
B This user is from outside of this forum
blargh513@sh.itjust.works

schrieb zuletzt editiert von

#30

Perplexity says:

The priest cannot be left alone with the child (or there is some risk).

Not bad, and it solved it correctly.
1 Antwort Letzte Antwort

4
V vegeta@lemmy.world

This post did not contain any content.
B This user is from outside of this forum
B This user is from outside of this forum
bananaisaberry@lemmy.zip

schrieb zuletzt editiert von

#31

Next up, we asked a shoe to write a haiku but it was beaten by a 30 year old HaikuMaker.
K 1 Antwort Letzte Antwort

12
C cocodapuf@lemmy.world

I did say that, because this isn't a pie chart situation, it's a Venn diagram situation.

For instance, AI art is 99% theft and 60% garbage. It's both because there's overlap.

Stolen and bad aren't opposites, why would this be a dichotomy?
A This user is from outside of this forum
A This user is from outside of this forum
aesthelete@lemmy.world

schrieb zuletzt editiert von

#32

That's fine but regular art isn't 2/3 theft either.

I do buy the 1/3 shite though. It may even be a bit higher than that. Though beauty is in the eye of the beholder, etc.

It's a matter of taste for sure but I'd say AI art is >90% shite, 100% theft.

I don't like the glossy looking hyperreal shit it puts out at all.
1 Antwort Letzte Antwort

1
C cocodapuf@lemmy.world

Oh, I enjoy lots of great art! But do you think I watch every film? Listen to every band? There's tons of shit out there!

Do you really believe, of all the songs that are written every day, that less than a third are crap? Even Taylor Swift doesn't publish everything she does. Sometimes you work on something for weeks and then end up tossing it in the bin. More often, you work on something for 30 minutes before deciding "I'm gonna start over, try something different". The majority of art is crap, but then you keep the stuff you think works.

And what's that expression, "good artists copy, great artists steal". I mean, that's a bit satirical, but the fact is, everything is derivative to some degree. It's not that there aren't new ideas, it's just that our new ideas are based on older ones. We stand on the shoulders of giants (or at least, on the shoulders of some people who came before us).

All I was really saying, was that the accusation "2 parts copying, 1 part crap", well honestly that's par for the course, that's how humans work. (And we do some great work that way).
F This user is from outside of this forum
F This user is from outside of this forum
finitebanjo@lemmy.world

schrieb zuletzt editiert von

#33

Don't care didn't ask didn't read
1 Antwort Letzte Antwort

0
B bananaisaberry@lemmy.zip

Next up, we asked a shoe to write a haiku but it was beaten by a 30 year old HaikuMaker.
K This user is from outside of this forum
K This user is from outside of this forum
kingporkchop@lemmy.ca

schrieb zuletzt editiert von

#34

I once spent 45 minutes trying to get ChatGPT to write a haiku. It couldn't do it. It explained what syllables were, and the rules for the syllables in a haiku, but it didn't understand it.
V 1 Antwort Letzte Antwort

4
K kingporkchop@lemmy.ca

I once spent 45 minutes trying to get ChatGPT to write a haiku. It couldn't do it. It explained what syllables were, and the rules for the syllables in a haiku, but it didn't understand it.
V This user is from outside of this forum
V This user is from outside of this forum
vegeta@lemmy.world

schrieb zuletzt editiert von

#35

For S&G, Just asked it to do one:
H 1 Antwort Letzte Antwort

3
P plebcouncilman@sh.itjust.works

What you are describing has nothing to do with the tool. It’s dishonesty which is different.

The idea is that instead of commissioning the cow on the field, you go to the AI and ask it for that and it gives you a cow in the field. If you claim you made it, you are lying but that would be true even if you paid an artist and then claimed the same.

So with AI made art you’ll say “this art was made by an Ai” and no one will be confused as to who takes the credit, because it belongs to the algorithm.

Have you ever made art in your life? Because a big part of art is mimicking. Like 98% of it is mimicking. I draw, write and have dabbled in making music and playing instruments. You can’t learn these skills without mimicking. And most artists don’t ever do anything truly original, that’s a rarity and even when it happens you can trace the influences to other artists if you know how to look.

You could argue that AI has not developed its own style yet but that’s bullshit too imo because everyone knows the default AI art style when they see it, so that means that AI has a distinctive style. Is it unique? Maybe not, but neither is the art style of most artists or writers or even musicians.
T This user is from outside of this forum
T This user is from outside of this forum
thedruid@lemmy.world

schrieb zuletzt editiert von

#36

Nope. Dishonesty is what is happening when I One conflates fine tuning an a. I prompt with art.

A.i is not art.

It's not. At all. It's tracing. Fine as a learning tool. Not art.
1 Antwort Letzte Antwort

0
E exlisper@lemmy.curiana.net

I have a better LLM benchmark:

"I have a priest, a child and a bag of candy and I have to take them to the other side of the river. I can only take one person/thing at a time. In what order should I take them?"

Claude Sonnet 4 decided that it's inappropriate and refused to answer. When I explain that the constraint is not to leave child alone with candy he provided a solution that leaves the child alone with candy.

Grok would provide a solution that doesn't leave the child alone with a priest but wouldn't explain why.

ChatGPT would say that "The priest can't be left alone with the child (or vice versa) for moral or safety concerns." directly and then provide wrong solution.

But yeah, they will know how to play chess...
P This user is from outside of this forum
P This user is from outside of this forum
pamasich@kbin.earth

schrieb zuletzt editiert von

#37
I just asked ChatGPT too (your exact prompt there) and it did give me the correct solution.
Take the child over

Go back alone

Take the candy over

Bring the child back

Take the priest over

Go back alone

Take the child over again
It didn't comment on moral concerns, though it did applaud itself for keeping the priest and the child separated without elaborating on why.
T 1 Antwort Letzte Antwort

6
A andallthat@lemmy.world

but... but.... reasoning models! AGI! Singularity!
Seriously, what you're saying is true, but it's not what OpenAI & Co are trying to peddle, so these experiments are a good way to call them out on their BS.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#38

To reinforce this, just had a meeting with a software executive who has no coding experience but is nearly certain he's going to lay off nearly all his employees because the value is all in the requirements he manages and he can feed those to a prompt just as well as any human can.

He does tutorial fodder introductory applications and assumes all the work is that way. So he is confident that he will save the company a lot of money by laying off these obsolete computer guys and focus on his "irreplaceable" insight. He's convinced that all the negative feedback is just people trying to protect their jobs or people stubbornly not with new technology.
1 Antwort Letzte Antwort

5
W webghost0101@sopuli.xyz

Tbf they don’t really claim that when you read the research, thats mostly media hype and ceo assholes spinning words.

Its good at lots specific tasks like rewriting emails and summarising gives text, short roleplay, boilerplate code. Some undiscovered uses.

Anthropic latest claims they would not hire their own ai because of how hard it failed at the test they give, They didnt do that expecting validation but to measure how far we are still off from ai doing meaningful full work.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#39

Because the business leaders are famously diligent about putting aside the marketing push and reading into the nuance of the research instead.
1 Antwort Letzte Antwort

1
B baatliwala@lemmy.world

I really want to see an LLM vs LLM chess match. It'll be messy as hell.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#40

I remember seeing that, and early on it seemed fairly reasonable then it started materializing pieces out of nowhere and convincing each other that they had already lost.
1 Antwort Letzte Antwort

1
S stsquad@lemmy.ml

I thought CoPilot was just a rebagged ChatGPT anyway?

It's a silly experiment anyway, there are very good AI chess grandmasters but they were actually trained to play chess, not predict the next word in a text.
J This user is from outside of this forum
J This user is from outside of this forum
jj4211@lemmy.world

schrieb zuletzt editiert von

#41

The research I saw mentioning LLMs as being fairly good at chess had the caveat that they allowed up to 20 attempts to cover for it just making up invalid moves that merely sounded like legit moves.
1 Antwort Letzte Antwort

3
S stsquad@lemmy.ml

I thought CoPilot was just a rebagged ChatGPT anyway?

It's a silly experiment anyway, there are very good AI chess grandmasters but they were actually trained to play chess, not predict the next word in a text.
W This user is from outside of this forum
W This user is from outside of this forum
webghost0101@sopuli.xyz

schrieb zuletzt editiert von webghost0101@sopuli.xyz

#42

I thought CoPilot was just a rebagged ChatGPT anyway?

Hahaha. No. (Though your not
Complety wrong)

Copilot relies on a few different llms and tries to pick the ~~best one for the job~~ cheapest microsoft thinks it can get away with.

I was given a paid copilot license for work and i used to have chatgpt pro before i moved to claude.

This “paid enterprise tier” is by far the dummest llm i have ever used. Worse then gpt 3.5
1 Antwort Letzte Antwort

3

Anmelden zum Antworten

P

Inside the face scanning tech behind social media age limits
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

1

25 Stimmen

1 Beiträge

5 Aufrufe

Niemand hat geantwortet
P

The Trump administration is making an unprecedented reach for data held by states
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
2

93 Stimmen

2 Beiträge

7 Aufrufe

S

I wouldn't call it unprecedented, just more obvious
I

Apple sued by shareholders for allegedly overstating AI progress
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
75

499 Stimmen

75 Beiträge

110 Aufrufe

F

For this comment, I want to be absolutely clear that I do not give a shit about AI, and that it in no way factored into my decision to buy this iPhone 16 Pro Max. With that disclaimer out of the way: I very much look forward to a class action lawsuit. Apple advertised specific features as coming ‘very soon’ and gave short timeframes when asked directly. And they basically did not deliver on those advertising promises. Basically, I think there’s a good case to be made here that Apple knowingly engaged in false advertising in order to sell a phone that otherwise would not have sold as well. Those promised AI features WERE a deciding factor for a lot of people to upgrade to an iPhone 16. So, I’ll be looking forward to some form of compensation. It’s the principle of it.
P

OpenAI warns that its upcoming models could pose a higher risk of enabling the creation of biological weapons and says it is stepping up testing of such models
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
15

33 Stimmen

15 Beiträge

7 Aufrufe

E

And they all suck, my boss is still alive.
P

The New Digg’s Plan to Use AI for Community Moderation
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
17

1

32 Stimmen

17 Beiträge

26 Aufrufe

L

trying to be reddit 2.0
P

Selling Surveillance as Convenience
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
13

1

112 Stimmen

13 Beiträge

16 Aufrufe

E

Trying to get my peers to care about their own privacy is exhausting. I wish their choices don't effect me, but like this article states.. They do in the long run. I will remain stubborn and only compromise rather than give in.
H

We built our own AI assistant (J-TECH AI) to showcase what we can do – here’s what it does and why
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
1

2

0 Stimmen

1 Beiträge

3 Aufrufe

Niemand hat geantwortet
O

A fake Facebook event disguised as a math problem has been one of its top posts for 6 months
Beobachtet Ignoriert Geplant Angeheftet Gesperrt Verschoben Technology technology
168

1

104 Stimmen

168 Beiträge

65 Aufrufe

S

At least that’s not how I’ve been taught in school If you had a bad teacher that doesn't mean everyone else had a bad teacher. You’re not teaching kids how to prove the quadratic formula, do you? We teach them how to do proofs, including several specific ones. No, you teach them how to use it instead. We teach them how to use everything, and how to do proofs as well. Your whole argument is just one big strawman. Again, with the order of operations Happens to be the topic of the post. It’s not a thing Yes it is! I’ve given you two examples that don’t follow any So you could not do the brackets first and still get the right answer? Nope! 2×2×(2-2)/2=0 2×2×2-2/2=7 That’s kinda random, but sure? Not random at all, given you were talking about students understanding how Maths works. 2+3×4 then it’s not an order of operation that plays the role here Yes it is! If I have 1 2-litre bottle of milk, and 4 3-litre bottles of milk, there's only 1 correct answer for how many litres of milk of have, and it ain't 20! Even elementary school kids know how to work it out just by counting up. They all derive from each other No they don't. The proof of order of operations has got nothing to do with any of the properties you mentioned. For example, commutation is used to prove identity And neither is used to prove the order of operations. 2 operators, no order followed Again with a cherry-picked example that only includes operators of the same precedence. You have no property that would allow for (2+3)×4 to be equal 2+3×4 And yet we have a proof of why 14 is the only correct answer to 2+3x4, why you have to do the multiplication first. Is that not correct? Of course it is. So what? It literally has subtraction and distribution No it didn't. It had Brackets (with subtraction inside) and Multiplication and Division. I thought you taught math, no? Yep, and I just pointed out that what you just said is wrong. 2-2(1+2) has Subtraction and Distribution. 2-2 is 2 being, hear me out, subtracted from 2 Which was done first because you had it inside Brackets, therefore not done in the Subtraction step in order of operations, but the Brackets step. Also, can you explain how is that cherry-picking? You already know - you know which operations to pick to make it look like there's no such thing as order of operations. If I tell you to look up at the sky at midnight and say "look - there's no such thing as the sun", that doesn't mean there's no such thing as the sun.