Informatik / Digitales
LLMpedia – Materializing an LLM’s Encyclopedic Knowledge at Scale
What does a large language model actually know, and how reliable is that knowledge in long-form text? Benchmarks such as MMLU suggest that modern language models are highly factual, but they only test questions that researchers thought to ask. In the LLMpedia project, researchers generate and evaluate encyclopedia-style articles directly from a model’s parametric memory, making it possible to study what the model knows beyond fixed benchmarks.
Beginn
17:00
Uhr
Ende
00:00
Uhr