iX report: Apache projects for analyzing large amounts of data

Hadoop, Storm, Kafka, Calcite, Zeppelin, Hive, Spark, Sqoop… The Apache Software Foundation brings together a continuously growing repertoire of open-source Big Data software. Some projects provide quite similar functions for similar tasks. Developers and Data Scientists are increasingly faced with the challenge of finding the right Apache-software for their purposes. Stephanie Fischer und Dr. Christian Winkler from mgm give an overview of current Apache-projects in the June Edition of the German IT-magazine iX. In a functional map of the Big Data World, the projects are first classified into different categories. Subsequently, the differences between projects with similar functionalities are discussed.