Study finds that Chat GPT will cheat when given the opportunity and lie to cover it up later.

yesman@lemmy.world · edit-2 11 months ago

Study finds that Chat GPT will cheat when given the opportunity and lie to cover it up later.

Max_Power@feddit.de · edit-2 11 months ago

we deploy GPT-4 as an agent in a realistic, simulated environment, where it assumes the role of an autonomous stock trading agent

This already is total BS. If you know how such language models work you’d never take their responses at face value, even though it’s tempting because they spout their BS so confidently. Always double-check their responses before applying their “knowledge” in the real world.

The question they try to answer is flawed, no wonder the result is just as bad.

Before anyone starts crying about my language models opposition: I’m not opposed to LMs or ChatGPT. In fact, I’m running LMs locally because they help me be more productive and I’m a paying ChatGPT customer.

HiddenLayer5@lemmy.ml · edit-2 6 months ago

deleted by creator

TangledHyphae@lemmy.world · edit-2 11 months ago

I agree with your statements, I’m using it because it’s insanely good at me giving it a list of any number of instructions to include in a code template file in any language I want and it will give me a great starting template with most functions working out of the gate and I can tweak and extend from there. It’s generative, it generates exactly what I tell it to. I’m not asking it to give me stock trading tips.

dumpsterlid@lemmy.world · edit-2 11 months ago

This already is total BS. If you know how such language models work you’d never take their responses at face value, even though it’s tempting because they spout their BS so confidently. Always double-check their responses before applying their “knowledge” in the real world.

This is why I have started to really like lmsys.org’s chat bot arena because every time you ask a question you are directly comparing the responses of two separate chat bots. It is much less likely that chatbots will hallucinate in the same way and puts you in the mindset to be a critical reader who is actively evaluating the quality of the response.

(what I am talking about) https://arena.lmsys.org/