0 bookmark(s) - Sort by: Date ↓ / Title /
This article discusses the limitations of Large Language Models (LLMs) in classification tasks, focusing on their lack of uncertainty and the need for more accurate performance metrics. New benchmarks and a metric named OMNIACCURACY have been introduced to assess LLMs' capabilities in both scenarios with and without correct labels.
First / Previous / Next / Last
/ Page 1 of 0