Data engineering: Difference between revisions

Content deleted Content added
Priyash48 (talk | contribs)
No edit summary
Tag: Reverted
m Reverted edits by Priyash48 (talk) to last version by Chris Capoccia: nonconstructive edits
Line 55:
== Roles ==
=== Data engineer ===
A ''' data engineer''' is a type of software engineer who creates [[big data]] [[Extract, transform, load|ETL]] pipelines to manage the flow of data through the organization. This makes it possible to take huge amounts of data and translate it into [[business intelligence|insights]].<ref>{{cite report |last1=Tamir |first1=Mike |last2=Miller |first2=Steven |last3=Gagliardi |first3=Alessandro |date=11 December 2015 |title=The Data Engineer |ssrn=2762013 }}</ref> They are focused on the production readiness of data and things like formats, resilience, scaling, and security. Data engineers usually hail from a software engineering background and are proficient in programming languages like [[Java (programming language)|Java]], [[Python (programming language)|Python]], [[Scala (programming language)|Scala]], and [[Rust (programming language)|Rust]].<ref>{{Cite web|date=2019-02-07|title=Data Engineer vs. Data Scientist|url=https://prepzeewww.springboard.com/blog/data-engineer-vs-data-scientist-whats-the-difference/|access-date=2021-03-14|website=prepzeeSpringboard Blog|language=en-US}}</ref><ref name="hist1" /> They will be more familiar with databases, architecture, cloud computing, and [[Agile software development]].<ref name="hist1" />
 
=== Data scientist ===