Open Source Initiatives and Frameworks Addressing Distributed Real-time Data Analytics
2016 (English)In: 2016 IEEE 30TH INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW), 2016, 1481-1484 p.Conference paper (Refereed)
The continuous evolution of digital services, is resulting in the generation of extremely large data sets that are created in almost real time. Exploring new opportunities for improving the quality of these digital services, as well as providing better-personalized experiences to digital users are two major challenges to be addressed. Different methods, tools, and techniques existed today to generate actionable insights from digital services data. Traditionally, big data problems are handled on historical data-sets. However, there is a growing demand on real-time data analytics to offer new services to users and to provide pro-active customers' care, personalized ads, emergency aids, just to give a few examples. Spite of the fact that there are few existing frameworks for real-time analytics, however, utilizing those for solving distributed real-time big data analytical problems stills remains a challenge. Existing real-time data analytics (RTDA) frameworks are not covering all the features that requires for distributed computation in real-time. Therefore, in this paper, we present a qualitative overview and analysis on some of the mostly used existing RTDA frameworks. Specifically, Apache Spark, Apache Flink, Apache Storm, and Apache Samza are covered and discussed in this paper.
Place, publisher, year, edition, pages
2016. 1481-1484 p.
IEEE International Symposium on Parallel and Distributed Processing Workshops, ISSN 2164-7062
Real-time, data analytics, big data, streaming data, data analytics framework, distributed real-time data analysis
Media and Communication Technology
Research subject Computer and Information Sciences Computer Science, Media Technology
IdentifiersURN: urn:nbn:se:lnu:diva-60583DOI: 10.1109/IPDPSW.2016.152ISI: 000391253600182ISBN: 978-1-5090-3682-0 OAI: oai:DiVA.org:lnu-60583DiVA: diva2:1072623
30th IEEE International Parallel and Distributed Processing Symposium (IPDPS), MAY 23-27, 2016, Illinois Inst Technol, Chicago, IL