My 2.5 year old laptop can write Space Invaders in JavaScript now, using GLM-4.5 Air and MLX

some_guy@lemmy.sdf.org · 3 months ago

My 2.5 year old laptop can write Space Invaders in JavaScript now, using GLM-4.5 Air and MLX

danzania@infosec.pub · 3 months ago

I still contend that this sort of task is uniquely positioned to show off LLMs. The idea that they’ll turn into agents that can do real-world tasks remains a fantasy. Despite how impressive this is, they’re losing money and have no real path to profitability.

I’m curious. What would it take to change your mind? I’d like to check in with you in two years to see what you think then.

some_guy@lemmy.sdf.org · 3 months ago

A demo that was open to the public (as in, not stage managed) where people could have the “agents” perform complex tasks without failing on a regular basis. Large training models are notoriously bad at anything they haven’t been trained to do. They’re worlds away from being able to interpret a new situation and “figure it out.”