Content deleted Content added
No edit summary |
No edit summary |
||
Line 1:
{{refimprove|date=May 2016}}
Continuous Analytics is a process for releasing analytics code in a manner similar to [http://martinfowler.com/bliki/ContinuousDelivery.html Continuous
Line 8:
Analytics is the application of mathematics and statistics to big data. Data scientists write analytics programs to look for solutions to business problems, like forecasting demand or setting an optimal price.
Traditionally data scientists have not been part of IT development teams, like regular Java programmers. This is because their skills set them apart in their own department not normally related to IT
Continuous Analytics then is the extension of the Continuous
To make this work means getting data scientists to write their Scala, Python, and R code in the same code repository that regular programmers use, like Git or Subversion, so that software like Jenkins can pull it from there and run it through the build process. It also means saving the configuration of the the big data cluster (sets of virtual machines) in some kind of repository as well, like Docker. That facilitates sending out analytics code and big data software and objects in the same automated way as the Continuous Integration process.
<ref>{{cite web|url=http://southernpacificreview.com/2016/05/17/continuous-analytics-defined/|title=Continuous Analytics
<ref>{{cite web|last1=Pushkarev|first1=Stepan|title=Tear down the Wall between Data Science and DevOps|url=https://www.linkedin.com/pulse/tear-down-wall-between-data-science-devops-stepan-pushkarev?trk=prof-post|website=LinkedIN|publisher=LinkedIN|accessdate=17 May 2016}}</ref>
|