A study has found that multimodel AI models perform poorly at giving safe responses when users give multimodal inputs such as an image and text together. The new SIUO benchmark was made as a result.
A study has found that multimodel AI models perform poorly at giving safe responses when users give multimodal inputs such as an image and text together. The new SIUO benchmark was made as a result.