Data Science with Java: Practical Methods for Scientists and Engineers

Data Science is booming thanks to R and Python, but Java brings the robustness, convenience, and ability to scale critical to today’s data science applications. With this practical book, Java software engineers looking to add data science skills will take a logical journey through the data science pipeline. Author Michael Brzustowicz explains the basic math theory behind each step of the data science process, as well as how to apply these concepts with Java.You’ll learn the critical roles that data IO, linear algebra, statistics, data operations, learning and prediction, and Hadoop MapReduce play in the process. Throughout this book, you’ll find code examples you can use in your applications.Examine methods for obtaining, cleaning, and arranging data into its purest formUnderstand the matrix structure that your data should takeLearn basic concepts for testing the origin and validity of dataTransform your data into stable and usable numerical valuesUnderstand supervised and unsupervised learning algorithms, and methods for evaluating their successGet up and running with MapReduce, using customized components suitable for data science algorithms

Author: Michael R. Brzustowicz PhD

Learn more