Revision as of 18:44, 21 March 2024 edit Mikeblas (talk \| contribs) Administrators 82,195 edits remvoe tag; I don't see a list of general references here ← Previous edit		Revision as of 10:31, 3 April 2024 edit undo Mppria (talk \| contribs) 5 edits m Clarified concepts, making them more concrete and accessible. Tags: Visual edit Newcomer task Newcomer task: copyedit Next edit →
Line 6: {{machine learning}} <!--- Don't mess with this line! ---><!--- Write your article below this line ---> '''Multimodal learning''', in the context of [[machine learning]], is a type of [[deep learning]] using a combination of various [[Modality (human–computer interaction)\|modalities]] of data, ~~often~~such ~~arising~~as text, audio, or images, in order to create a more robust model of the real-world ~~applications~~phenomena in question. AnIn ~~example~~contrast, ofsingular ~~multi-~~modal ~~data~~learning iswould ~~data that combines~~analyze text (typically represented as [[feature vector]]) ~~with~~or imaging data (consisting of [[pixel]] intensities and annotation tags) independently. AsMultimodal ~~these~~machine ~~modalities~~learning ~~have~~combines these fundamentally different statistical ~~properties, combining them is non-trivial, which is~~analyses ~~why~~using specialized ~~modelling~~modeling strategies and algorithms, ~~are~~resulting ~~required.~~in ~~The~~a model isthat ~~then~~comes ~~trained~~closer to ~~able~~representing tothe ~~understand~~real ~~and work with multiple forms of data~~world. ==Motivation==

Multimodal learning: Difference between revisions