iX report: Apache projects for analyzing large amounts of data

Hadoop, Storm, Kafka, Calcite, Zeppelin, Hive, Spark, Sqoop… The Apache Software Foundation brings together a continuously growing repertoire of open-source Big Data software. Some...

Dataworks Summit: Text Classification with R, Apache Solr and D3.js

From April 5 to 6, Hortonworks hosts the Dataworks Summit Europe 2017 in Munich. Stephanie Fischer and Christian Winkler from mgm are at the scene and give a talk...

Big Data Summit 2017: GfK and mgm give a talk about the classification of...

Day by day the Internet amasses more text in the form of unstructured data. How can we process these large amounts of data automatically?

Classifying unstructured text – mgm presentation at the Apache Big Data

With their presentation „Classifying unstructured text – deterministic and machine learning approaches“ at the Apache Big Data Europe conference in Sevilla...

mgm in the Bitkom report: „Germany – Excellence in Big Data“

mgm has been listed as one of 60 technology providers in the recently published report „Germany – Excellence in Big Data“. The report from...

TDWI Conference 2016 – A controversial dialogue about Big Data

What are the cornerstones of successful Big Data projects? Technical aspects like scalable data storage and fast access times? Or organizational aspects like...

Article: Data driven innovation with Big Data prototypes

mgm Big Data experts Stephanie Fischer and Dr. Christian Winkler discuss the importance of data driven decisions and innovation for companies with regard to technical and organizational matters

Big Data between technology and organizational culture

The introduction of Big Data in is not merely a technical challenge. Our presentation at the TDWI conference dispels the myth and sketches out a more comprehensive picture.

mgm at Big Data conference in Vancouver

„Data Science with News Headlines – Analyzing and Visualizing a Whole Decade“ . The presentation demonstrates how Apache tools can be used to “dig through” unstructured text, analyze and visualize the data.

Geomesa vs. GeoWave: A Benchmark for Geotemporal Point Data

With Geomesa and GeoWave two technologies based on Hadoop will be compared which are specialized in the efficient storage and retrieval of geotemporal data. Both technologies use Apache Accumulo as backend — a key-value store following the BigTable Design (PDF) — and GeoTools for handling geodata. Although the technologies are also able to deal with complex [...]