Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models
Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models
Summary
Research Article: Dan Boneh, et al. arxiv.org, 12 April 2025. https://doi.org/10.48550/arXiv.2408.08926.
May
2025
Published : May 20th, 2025 at 04:51 pm
Updated : May 20th, 2025 at 05:09 pm