Revision as of 21:33, 3 April 2021 edit BattyBot (talk \| contribs) Bots 1,957,439 edits →top: General fixes, removed orphan tag Tag: AWB ← Previous edit		Revision as of 20:16, 4 November 2021 edit undo Unbuggy (talk \| contribs) 16 edits m Tweak wording, mostly for grammar, in the first paragraph Next edit →
Line 5: }} <!--- Don't mess with this line! ---><!--- Write your article below this line ---> ~~The~~Information ~~information~~in inthe real world usually comes as different modalities. For example, images are usually associated with tags and text explanations; ~~texts~~text ~~contain~~contains images to more clearly express the main idea of the article. Different modalities are characterized by ~~very~~ different statistical properties. For instance, images are usually represented as [[pixel]] intensities or outputs of [[Feature extraction\|feature extractors]], while texts are represented as discrete word count vectors. Due to the distinct statistical properties of different information resources, it is ~~very~~ important to discover the relationship between different modalities. '''Multimodal learning''' is a good model to represent the joint representations of different modalities. The '''multimodal learning model''' is also capable toof ~~fill~~supplying a missing modality ~~given~~based ~~the~~on observed ones. The multimodal learning model combines two [[Deep Boltzmann Machines\|deep Boltzmann machines]], each ~~corresponds~~corresponding to one modality. An additional hidden layer is placed on top of the two Boltzmann Machines to ~~give~~produce the joint representation. ==Motivation==

Multimodal learning: Difference between revisions