it is, however, recognised that the downside risks to growth remain high
How subjects answered
- Llama-3-70b-chat incorrect
ModelResponse(id='nscCssn-4yUbBN-93a2d192afe02d0a', created=1746307397, model='together_ai/meta-llama/Llama-3-70b-chat-hf', object='chat.completion', system_fingerprint=None, choices=[Choices(finish_reason='stop', index=0, message=Message(content='{\n"label": "uncertain",\n"justification": "The sentence contains the phrase 'remain high', which implies a degree of uncertainty about the extent of the downside risks to growth."\n}', role='assistant', tool_calls=None, function_call=None, provider_specific_fields={'refusal': None}))], usage=Usage(completion_tokens=39, prompt_tokens=446, total_tokens=485, completion_tokens_details=None, prompt_tokens_details=None), service_tier=None, prompt=[])
- FinMA-7B-full incorrect
- gemini-2.0-flash incorrect
{ "label": "uncertain", "justification": "The phrase \"downside risks to growth remain high\" indicates a possibility of negative outcomes, reflecting uncertainty about future economic performance." } - DeepSeek-V3 incorrect
ModelResponse(id='nqNqY6M-zqrih-931733956f66c943', created=1744843405, model='together_ai/deepseek-ai/DeepSeek-V3', object='chat.completion', system_fingerprint=None, choices=[Choices(finish_reason='stop', index=0, message=Message(content='
json\n{\n"label": "uncertain",\n"justification": "The use of \'remain high\' indicates a lack of definitive certainty about the extent of the downside risks."\n}\n', role='assistant', tool_calls=None, function_call=None, provider_specific_fields={'refusal': None}))], usage=Usage(completion_tokens=40, prompt_tokens=151, total_tokens=191, completion_tokens_details=None, prompt_tokens_details=None, cached_tokens=0), service_tier=None, prompt=[]) - Llama-4-Maverick incorrect
ModelResponse(id='nq4jkzR-zqrih-9303aafdcb79adc3', created=1744638581, model='together_ai/meta-llama/Llama-4-Maverick-17B-128E-Instruct-FP8', object='chat.completion', system_fingerprint=None, choices=[Choices(finish_reason='length', index=0, message=Message(content='To classify the given sentence, we need to analyze its content and tone to determine whether it conveys certainty or uncertainty.\n\nThe sentence is: "it is, however, recognised that the downside risks to growth remain high."\n\nLet's break it down:\n\n1. Label Assignment: The sentence implies that there are acknowledged risks associated with growth, which suggests a level of uncertainty about future outcomes.\n\n2. …
- Llama-4-Scout incorrect
ModelResponse(id='nq3zcm4-3NKUce-9302dd7cc95a44e1', created=1744630163, model='together_ai/meta-llama/Llama-4-Scout-17B-16E-Instruct', object='chat.completion', system_fingerprint=None, choices=[Choices(finish_reason='stop', index=0, message=Message(content='Here is the classification:\n\n
\n{\n "label": "uncertain",\n "justification": "The sentence expresses a possibility of downside risks to growth remaining high, indicated by the word \'recognised\' and \'remain high\', which implies uncertainty."\n}\n', role='assistant', tool_calls=None, function_call=None, provider_specific_fields={'refusal': None}))], usage=Usage(completion_tokens=54, prompt_tokens=161, total_tokens=215, …
