Skip Navigation

Technology @lemmy.world misk @sopuli.xyz 1 mo. ago

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

arstechnica.com Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

Irrelevant red herrings lead to “catastrophic” failure of logical inference.

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

Apple @lemmy.world misk @sopuli.xyz 1 mo. ago

Apple study exposes deep cracks in LLMs’ “reasoning” capabilities

arstechnica.com /ai/2024/10/llms-cant-perform-genuine-logical-reasoning-apple-researchers-suggest/

You're viewing a single thread.

109 comments

Are the uncensored models more capable tho?
- Given the use cases they were benchmarking I would be very surprised if they were any better.

You've viewed 109 comments.