Revision as of 21:23, 6 February 2022 edit Qwerfjkl (bot) (talk \| contribs) Bots, Mass message senders 4,093,197 edits m Capitalising short description "machine learning methods using multiple input modalities" per WP:SDFORMAT (via Bandersnatch) ← Previous edit		Revision as of 06:41, 15 March 2022 edit undo 111.68.99.41 (talk) →Application Next edit →
Line 84: ==Application== Multimodal deep Boltzmann machines are successfully used in classification and missing data retrieval. The classification accuracy of multimodal deep Boltzmann machine outperforms [[support vector machine]]s, [[latent Dirichlet allocation]] and [[deep belief network]], when models are tested on data with both image-text modalities or with single modality. Multimodal deep Boltzmann machine is also able to predict the missing modality given the observed ones with reasonably good precision. Self Supervised Learning brings more interesting and powerful model for multimodality. OpenAI developed CLIP and DALL-E models that revolutionized multimodality. ==See also==

Multimodal learning: Difference between revisions