AI worse than humans in every way at summarising information, government trial finds

Stopthatgirl7@lemmy.world · 4 months ago

AI worse than humans in every way at summarising information, government trial finds

UnderpantsWeevil@lemmy.world · edit-2 4 months ago

Are we talking 10% worse and 95% cheaper? Or 50% worse and 10% cheaper? Or 90% worse and 95% cheaper?

Because that last one is good enough for fiscal conservatives. Hell, the second one is good enough for fiscal conservatives.

dreaddynaughty@lemmynsfw.com · 4 months ago

The linked pdf lists the deficiencies of the LLM responses. They are varied and it sometimes misses the mark completely or cant grasp vital context.

Still pretty useless comparison, they testet 10 university level humans against Llama2-70B. The model has fallen out of use completely by now and was never really great at summarization. The study didnt fine tune it either, so this isnt really representative of the current situation.

There are far better models out, that were either especially trained for summarization or can be easily fine tuned to excel at it. Not to mention the Llama3 and 3.1 series, with the crazy 405B model.

loonsun@sh.itjust.works · 4 months ago

Knowing this it seems like a very low quality study. They should probably redo this with multiple conditions.

Base Llama 3
Tuned Llama 3
Untrained human summarizer
trained/professional human summarizer

UnderpantsWeevil@lemmy.world · 4 months ago

There are far better models out

I’ve heard this refrain a few times. Still waiting for it to pan out.