Tuning Apache Spark: Powerful Big Data Processing Recipes

Tuning Apache Spark: Powerful Big Data Processing Recipes
199.99 USD
Buy Now

Video Learning Path OverviewA Learning Path is a specially tailored course that brings together two or more different topics that lead you to achieve an end goal. Much thought goes into the selection of the assets for a Learning Path, and this is done through a complete understanding of the requirements to achieve a goal. Today, organizations have a difficult time working with large datasets. In addition, big data processing and analyzing need to be done in real time to gain valuable insights quickly. This is where data streaming and Spark come in. In this well thought out Learning Path, you will not only learn how to work with Spark to solve the problem of analyzing massive amounts of data for your organization, but youll also learn how to tune it for performance. Beginning with a step by step approach, youll get comfortable in using Spark and will learn how to implement some practical and proven techniques to improve particular aspects of programming and administration in Apache Spark. Youll be able to perform tasks and get the best out of your databases much faster. Moving further and accelerating the pace a bit, Youll learn some of the lesser known techniques to squeeze the best out of Spark and then youll learn to overcome several problems you might come across when working with Spark, without having to break a sweat. The simple and practical solutions provided will get you back in action in no time at all! By the end of the course, you will be well versed in using Spark in your day to day projects. Key FeaturesFrom blueprint architecture to complete code solution, this course treats every important aspect involved in architecting and developing a data streaming pipelineTest Spark jobs using the unit, integration, and end-to-end techniques to make your data pipeline robust and bulletproof. Solve several painful issues like slow-running jobs that affect the performance of your application. Author BiosAnghel Leonard is currently a Java chief architect. He is a member of the Java EE Guardians with 20+ years experience. He has spent most of his career architecting distributed systems. He is also the author of several books, a speaker, and a big fan of working with data. Tomasz Lelek is a Software Engineer, programming mostly in Java and Scala. He has been working with the Spark and ML APIs for the past 5 years with production experience in processing petabytes of data. He is passionate about nearly everything associated with software development and believes that we should always try to consider different solutions and approaches before solving a problem. Recently he was a speaker at conferences in Poland, Confitura and JDD (Java Developers Day), and at Krakow Scala User Group. He has also conducted a live coding session at Geecon Conference. He is a co-founder of initlearn, an e-learning platform that was built with the Java language. He has also written articles about everything related to the Java world.