Skip to main content

Science & Engineering

VLM4Bio

VLM4Bio: evaluating pretrained vision-language models for trait discovery from biological images (Bird/Butterfly/Fish). This build covers the multiple-choice (selection) species-classification task, where the model picks the correct scientific name from four options given an organism image.

1,242items
1subjects
100%observed
Apache-2.0license
biologydomain
imagemodality
textmodality

Response matrix

Every model, scored item by item.

Each row is an AI model and each column an item, ordered so the strongest models and easiest items gather toward one corner. 1 subjects × 1,242 items, 100% of cells evaluated.

Fit to width. Hover for subject & item; click a cell for details.

VLM4Bio response matrix: AI models (rows) against items (columns)
Correct (1)Incorrect (0)Unobserved

Scale: 1 = correct · 0 = incorrect

Sample items

What the questions look like — and how subjects answer.

A spread of items across the difficulty range, each shown with a few subjects' actual answers.

Item 10% solve rateanswer: C) Setophaga castanea

What is the scientific name of the bird in the image? Options: A) Setophaga discolor, B) Setophaga citrina, C) Setophaga castanea, D) Setophaga pensylvanica. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: A) Setophaga discolor.

Item 20% solve rateanswer: D) Notropis dorsalis

What is the scientific name of the fish in the image? Options: A) Lepomis auritus, B) Percina phoxocephala, C) Alepes djedaba, D) Notropis dorsalis. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: A) Lepomis auritus.

Item 30% solve rateanswer: A) Notropis aguirrepequenoi

What is the scientific name of the fish in the image? Options: A) Notropis aguirrepequenoi, B) Caranx sexfasciatus, C) Catostomus catostomus, D) Notropis braytoni. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: B) Caranx sexfasciatus.

Item 40% solve rateanswer: D) Esox lucius

What is the scientific name of the fish in the image? Options: A) Crenimugil crenilabis, B) Atrosalarias holomelas, C) Notropis percobromus, D) Esox lucius. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: A) Crenimugil crenilabis.

Item 50% solve rateanswer: C) Cottus bairdii

What is the scientific name of the fish in the image? Options: A) Notropis heterodon, B) Notropis buchanani, C) Cottus bairdii, D) Notropis buccatus. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: D) Notropis buccatus.

Item 60% solve rateanswer: D) Morone chrysops

What is the scientific name of the fish in the image? Options: A) Fundulus zebrinus, B) Neogobius melanostomus, C) cyprinella whipplei, D) Morone chrysops. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: B) Neogobius melanostomus

Item 70% solve rateanswer: A) Parablennius yatabei

What is the scientific name of the fish in the image? Options: A) Parablennius yatabei, B) Acropoma hanedai, C) Notropis cummingsae, D) Rhinichthys obtusus. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: D) Rhinichthys obtusus.

Item 80% solve rateanswer: A) Cyprinella formosa

What is the scientific name of the fish in the image? Options: A) Cyprinella formosa, B) Takifugu alboplumbeus, C) Lepomis cyanellus, D) Notropis nazas. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b incorrect

    The answer is: C) Lepomis cyanellus

Item 9100% solve rateanswer: B) Esox americanus

Each biological species has a unique scientific name composed of two parts: the first for the genus and the second for the species within that genus. What is the scientific name of the fish in the image? Options: A) Notropis hudsonius, B) Esox americanus, C) Notropis buccatus, D) Lepomis humilis. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b correct

    The answer is: B) Esox americanus

Item 10100% solve rateanswer: D) Praealticus tanegasimae

What is the scientific name of the fish in the image? Options: A) Carassius carassius, B) Cottus asperrimus, C) Ilisha , D) Praealticus tanegasimae. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b correct

    The answer is: D) Praealticus tanegasimae.

Item 11100% solve rateanswer: A) Notropis bifrenatus

What is the scientific name of the fish in the image? Options: A) Notropis bifrenatus, B) Lepomis symmetricus, C) Lepomis F2, D) Ameiurus melas. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b correct

    The answer is: A) Notropis bifrenatus.

Item 12100% solve rateanswer: B) Lepomis macrochirus

What is the scientific name of the fish in the image? Options: A) Percopsis transmontana, B) Lepomis macrochirus, C) Sardinella gibbosa, D) Thryssa baelama. Write the answer after writing 'The answer is: '.

How subjects answered

  • llava-v1.5-7b correct

    The answer is: B) Lepomis macrochirus

Subjects

The models, agents, and reward models evaluated.

1 subjects, ranked by mean response (accuracy) across this benchmark's items.

  1. 1llava-v1.5-7b0.376