This paper explores the cultural evolution of cooperation among LLM agents through a variant of the Donor Game, finding significant differences in cooperative behavior across various base models and initial strategies.
Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models".