jueves, 25 de junio de 2026

LifeSciBench: Evaluating Language Models on Realistic, Expert-Level Tasks in the Life Sciences

https://cdn.openai.com/pdf/b4299379-0a97-4ffa-8b9b-c3fbb299caa9/lifescibench_preprint.pdf?utm_medium=email&_hsenc=p2ANqtz-_AY5V0qtmvggGfGqg0uLsTyg12MRDb48nsJM1keqS6T0pDbzf10fTMew9f8TDIV9SC7azeMrZcV4zuNu9sQlZ0LOTs1A&_hsmi=139221746&utm_content=139230715&utm_source=hs_email

No hay comentarios:

Publicar un comentario