Data engineering: Difference between revisions

Content deleted Content added
m v2.05 - Fix errors for CW project (Link equal to linktext)
Line 31:
==== Data lakes ====
 
A [[Data lake|data lake]] is a centralized repository for storing, processing, and securing large volumes of data. A data lake can contain [[structured data]] from [[Relational database|relational databases]], [[semi-structured data]], [[unstructured data]], and [[binary data]]. A data lake can be created on premises or in a cloud-based environment using the services from [[Cloud computing|public cloud]] vendors such as [[Amazon (company)|Amazon]], [[Microsoft]], or [[Google]].
 
==== Files ====