Content deleted Content added
m Disambiguating links to Data protection (link changed to Information privacy) using DisamAssist. |
m Open access bot: url-access updated in citation with #oabot. |
||
Line 55:
== Roles ==
=== Data engineer ===
A ''' data engineer''' is a type of software engineer who creates [[big data]] [[Extract, transform, load|ETL]] pipelines to manage the flow of data through the organization. This makes it possible to take huge amounts of data and translate it into [[business intelligence|insights]].<ref>{{Cite journal|last1=Tamir|first1=Mike|last2=Miller|first2=Steven|last3=Gagliardi|first3=Alessandro|date=2015-12-11|title=The Data Engineer|url=https://papers.ssrn.com/abstract=2762013|language=en|___location=Rochester, NY|doi=10.2139/ssrn.2762013|ssrn=2762013|s2cid=113342650|url-access=subscription}}</ref> They are focused on the production readiness of data and things like formats, resilience, scaling, and security. Data engineers usually hail from a software engineering background and are proficient in programming languages like [[Java (programming language)|Java]], [[Python (programming language)|Python]], [[Scala (programming language)|Scala]], and [[Rust (programming language)|Rust]].<ref>{{Cite web|date=2019-02-07|title=Data Engineer vs. Data Scientist|url=https://www.springboard.com/blog/data-engineer-vs-data-scientist/|access-date=2021-03-14|website=Springboard Blog|language=en-US}}</ref><ref name="hist1" /> They will be more familiar with databases, architecture, cloud computing, and [[Agile software development]].<ref name="hist1" />
=== Data scientist ===
|