SOCK: A Benchmark for Measuring Self-Replication in Large Language Models

arXiv – cs.AI Original
Anzeige

Ähnliche Artikel