technique
multimodal pretraining
techniqueactiveprovisional
multimodal-pretraining-a2a75271·1 events·first seen 20d agoAliases: multimodal pretraining
Co-occurring entities
More like this (12)
Multimodal Learningmultimodal classification modelsmultimodal agentsmultimodal embeddingtemporally ordered pre-trainingMultimodal GainUnified Multimodal Models (UMMs)multimodal neuronsSelf-Supervised PretrainingMulti-Task Learningmultimodal meta-verificationLatent World Recovery for Multimodal Learning with Missing Modalities
Recent events (1)
VLMs May Not Globally Enhance Human Alignment over LLMs During Natural Reading
This paper compares matched LLM and VLM pairs in a text-only setting to isolate the effect of multimodal training history on human-like language processing. Using whole-cortex fMRI and eye-tracking data from natural reading, the authors find that multimodal pretraining does not confer a uniform global advantage in human alignment. However, VLMs show selective advantages when sentences contain stronger visual semantic content, with converging evidence from both neural and behavioral measures. The findings suggest language-internal representations remain the primary driver of human text processing alignment.