Data Algorithms by Mahmoud Parsian

Data Algorithms by Mahmoud Parsian

Regular price
Checking stock...
Regular price
Checking stock...
Zusammenfassung

If you're ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications.

The feel-good place to buy books
  • Free delivery in the UK
  • Supporting authors with AuthorSHARE
  • 100% recyclable packaging
  • B Corp - kinder to people and planet
  • Buy-back with World of Books - Sell Your Books

Data Algorithms by Mahmoud Parsian

If you are ready to dive into the MapReduce framework for processing large datasets, this practical book takes you step by step through the algorithms and tools you need to build distributed MapReduce applications with Apache Hadoop or Apache Spark. Each chapter provides a recipe for solving a massive computational problem, such as building a recommendation system. You'll learn how to implement the appropriate MapReduce solution with code that you can use in your projects. Dr. Mahmoud Parsian covers basic design patterns, optimization techniques, and data mining and machine learning solutions for problems in bioinformatics, genomics, statistics, and social network analysis. This book also includes an overview of MapReduce, Hadoop, and Spark. Topics include: Market basket analysis for a large set of transactions Data mining algorithms (K-means, KNN, and Naive Bayes) Using huge genomic data to sequence DNA and RNA Naive Bayes theorem and Markov chains for data and market prediction Recommendation algorithms and pairwise document similarity Linear regression, Cox regression, and Pearson correlation Allelic frequency and mining DNA Social network analysis (recommendation systems, counting triangles, sentiment analysis)
Mahmoud Parsian, Ph.D. in Computer Science, is a practicingsoftware professional with 30 years of experience as a developer, designer, architect, and author. For the past 15 years, he hasbeen involved in Java server-side, databases, MapReduce, anddistributed computing. Dr. Parsian currently leads Illumina'sBig Data team, which is focused on large-scale genome analyticsand distributed computing. He leads and develops scalableregression algorithms; DNA sequencing and RNA sequencing pipelinesusing Java, MapReduce, Hadoop, HBase, and Spark; and open sourcetools. He is also the author of JDBC Recipes and JDBC Metadata (bothfrom Apress).
SKU Nicht verfügbar
ISBN 13 9781491906187
ISBN 10 1491906189
Titel Data Algorithms
Autor Mahmoud Parsian
Buchzustand Nicht verfügbar
Bindungsart Paperback
Verlag O'Reilly Media
Erscheinungsjahr 2015-07-28
Seitenanzahl 778
Hinweis auf dem Einband Die Abbildung des Buches dient nur Illustrationszwecken, die tatsächliche Bindung, das Cover und die Auflage können sich davon unterscheiden.
Hinweis Nicht verfügbar