Multimodal learning: Difference between revisions

Content deleted Content added
m uniformly
Tags: Visual edit Mobile edit Mobile web edit Advanced mobile edit
Line 5:
{{tone|date=June 2015}}
}}
{{machine learning}}
<!--- Don't mess with this line! ---><!--- Write your article below this line --->
'''Multimodal learning''' attempts to model the combination of different [[Modality (human–computer interaction)|modalities]] of data, often arising in real-world applications. An example of multi-modal data is data that combines text (typically represented as discrete word count vectors) with imaging data consisting of [[pixel]] intensities and annotation tags. As these modalities have fundamentally different statistical properties, combining them is non-trivial, which is why specialized modelling strategies and algorithms are required.