Knowledge Quiz
Test your understanding of this article
1.What is the primary limitation of Vision-Language Models (VLMs) like CLIP that Omni-NegCLIP aims to address?
2.Omni-NegCLIP improves CLIP's understanding of negation by modifying which component of CLIP?
3.Which of the following best describes 'presence-based negation' as defined in the article?
4.Based on the article's observation, which part of the CLIP text encoder has a stronger learning ability for negated text?
