Chapter 15 References

Ackley, David H, Geoffrey E Hinton, and Terrence J Sejnowski. 1985. “A Learning Algorithm for Boltzmann Machines.” Cognitive Science.

“AI and Compute.” 2019.

“Apache Solr.” 2019.

“Apache Spark and Cern Open Data Analysis, an Example.” 2017.

“Apache Spark Officially Sets a New Record in Large-Scale Sorting.” 2014.

“AutoML: Automatic Machine Learning.” 2019.

“Azure Wikipedia.” 2018.

“Big Compute Nimbix.” 2019.

“Big Compute Vs Big Data.” 2013.

“Big Data Wikipedia.” 2019.

“Bioinformatics Applications on Apache Spark.” 2018.

Ceruzzi, Paul E. 2012. Computing: A Concise History. MIT Press.

Chang, Winston. 2012. R Graphics Cookbook: Practical Recipes for Visualizing Data. O’Reilly Media, Inc.

Chollet, Francois, and J.J. Allaire. 2018. Deep Learning with R. Manning Publications.

Cleveland, William S. 2001. “Data Science: An Action Plan for Expanding the Technical Areas of the Field of Statistics?”

“Cloudera Wikipedia.” 2018.

Codd, Edgar F. 1970. “A Relational Model of Data for Large Shared Data Banks.” ACM.

Cook, Darren. 2016. Practical Machine Learning with H2o: Powerful, Scalable Techniques for Deep Learning and Ai. O’Reilly Media, Inc.

“CRAN - Package Sparklyr.” 2019.

“Databricks Community Edition.” 2019.

“Databricks Documentation.” 2018.

“Databricks Wikipedia.” 2018.

“Dataproc Wikipedia.” 2018.

Dean, Jeffrey, and Sanjay Ghemawat. 2004. “MapReduce: Simplified Data Processing on Large Clusters.” In USENIX Symposium on Operating System Design and Implementation (Osdi).

Ghemawat, Sanjay, Howard Gobioff, and Shun-Tak Leung. 2003. “The Google File System.” In Proceedings of the Nineteenth Acm Symposium on Operating Systems Principles. New York, NY, USA: ACM.

Greenacre, Michael. 2017. Correspondence Analysis in Practice. Chapman; Hall/CRC.

Group, World Bank. 2016. The Data Revolution. World Bank Publications.

“Higgs Boson Machine Learning Challenge.” 2019.

Hinton, Geoffrey E, Simon Osindero, and Yee-Whye Teh. 2006. “A Fast Learning Algorithm for Deep Belief Nets.” Neural Computation 18 (7): 1527–54.

“Hortonworks Microsoft.” 2018.

“Hortonworks Wikipedia.” 2018.

“Human Genome.” 2019.

“IBM Cloud Wikipedia.” 2018.

Kim, Albert Y, and Adriana Escobedo-Land. 2015. “OKCupid Data for Introductory Statistics and Data Science Courses.” Journal of Statistics Education 23 (2).

Krizhevsky, Alex, Ilya Sutskever, and Geoffrey E Hinton. 2012. “Imagenet Classification with Deep Convolutional Neural Networks.” In Advances in Neural Information Processing Systems, 1097–1105.

Kuhn, Max, and Kjell Johnson. 2019. “Feature Engineering and Selection: A Practical Approach for Predictive Models.” Chapman; Hall/CRC.

Laudon, Kenneth C, Carol Guercio Traver, and Jane P Laudon. 1996. “Information Technology and Systems.” Cambridge, MA: Course Technology.

“MapR Wikipedia.” 2018.

“Maven Repository: Repositories.” 2019.

Minsky, Marvin, and Seymour A Papert. 2017. Perceptrons: An Introduction to Computational Geometry. MIT press.

“Netflix at Spark.” 2018.

“Profvis.” 2018.

Rosenblatt, Frank. 1958. “The Perceptron: A Probabilistic Model for Information Storage and Organization in the Brain.” Psychological Review.

“RSparkling — H2o Sparkling Water 2.3.31 Documentation.” 2019.

“RStudio Connect.” 2019.

“RStudio Profiler.” 2018.

“RStudio Server Pro.” 2019.

“Running Spark on Mesos.” 2018.

“Running Spark on Yarn.” 2018.

Samuel, Arthur L. 1959. “Some Studies in Machine Learning Using the Game of Checkers.” IBM Journal of Research and Development 3 (3): 210–29.

Silge, Julia, and David Robinson. 2017. Text Mining with R: A Tidy Approach. O’Reilly Media, Inc.

“Sort Benchmark.” 2014.

“Spark-Solr Spark Package.” 2019.

“Spark Wins Cloudsort Benchmark as the Most Efficient Engine.” 2016.

“Spark with R in Gitter.” 2019.

“The History of R’s Predecessor, S, from Co-Creator Rick Becker.” 2016.

Webster, Merriam. 2006. “Merriam-Webster Online Dictionary.” Webster, Merriam.

Wickham, Hadley, and Garrett Grolemund. 2016. R for Data Science: Import, Tidy, Transform, Visualize, and Model Data. O’Reilly Media, Inc.

Wu, C.F. Jeff. 1997. “Statistics = Data Science?”

Xie, Grolemund, Allaire. 2018. R Markdown: The Definite Guide. 1st ed. CRC Press.

Zaharia, Matei, Mosharaf Chowdhury, Michael J Franklin, Scott Shenker, and Ion Stoica. 2010. “Spark: Cluster Computing with Working Sets.” HotCloud 10 (10-10): 95.

Zou, Hui, and Trevor Hastie. 2005. “Regularization and Variable Selection via the Elastic Net.” Journal of the Royal Statistical Society: Series B (Statistical Methodology) 67 (2): 301–20.