CROP crop-science multiple-choice question (benchmark ID 5093; difficulty: difficult). The model must select the single correct option (A-D); scored 1 if correct.
Subject outcomes
- gpt-3.5-turbo-0125 incorrect
- gpt-4-turbo-2024-04-09 incorrect
- claude-3-opus-20240229 incorrect
- qwen-max incorrect
