paper
Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models
paperactiveprovisional
modeling-complex-behaviors-multi-personality-composition-and-dynamic-switching-in-vision-language-models-99128d23·1 events·first seen 7d agoAliases: Modeling Complex Behaviors: Multi-Personality Composition and Dynamic Switching in Vision-Language Models
More like this (12)
Vision-Language ModelsVision-Language-Action modelsvisual language modelTempoVLA: Learning Speed-Controllable Vision-Language-Action PoliciesVision-Language-Action modelLabVLA: Grounding Vision-Language-Action Models in Scientific LaboratoriesMulti-Faceted Interactivity Alignment in Full-Duplex Speech Modelsmulti-turn language modelsUnified Multimodal Models (UMMs)Language Modeling LossMultimodal Large Language ModelsExploring Adversarial Robustness and Safety Alignment in Multilingual Multi-Modal Large Language Models
Recent events (1)
Systematic evaluation of multi-personality conditioning and dynamic switching in vision-language models
This paper introduces explicit personality conditioning for multimodal large language models (MLLMs) and proposes an evaluation framework covering single-personality induction, multi-personality composition, and dynamic personality switching. Experiments reveal that personality induction improves image captioning but degrades performance on precise reasoning tasks like VQA. The authors find balancing and residual effects during multi-trait composition and switching, and show that existing prompt-based personality induction methods transfer poorly to multimodal settings.