Multimodal learning: Difference between revisions

Content deleted Content added
Grammatical errors correction.
Legolag (talk | contribs)
Phrasing to make concept easier to understand for those not aware of the idea.
Line 7:
{{machine learning}}
<!--- Don't mess with this line! ---><!--- Write your article below this line --->
'''Multimodal learning''', in the context of [[machine learning]], is a type of [[deep learning]] fromusing a combination of various [[Modality (human–computer interaction)|modalities]] of data, often arising in real-world applications. An example of multi-modal data is data that combines text (typically represented as [[feature vector]]) with imaging data consisting of [[pixel]] intensities and annotation tags. As these modalities have fundamentally different statistical properties, combining them is non-trivial, which is why specialized modelling strategies and algorithms are required. The model is then trained to able to understand and work with multiple forms of data.
 
==Motivation==