Download - Hadoop infrastructure for education
![Page 1: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/1.jpg)
HADOOP INFRASTRUCTURE FOR EDUCATION
Darko Marjanović, [email protected]
Miloš Milovanović, [email protected]
Božidar Radenković, [email protected]
University of Belgrade
Faculty of Organizational Sciences
Laboratory for E-business
![Page 2: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/2.jpg)
Laboratory for E-business
• Exists within the Faculty of Organizational Sciences, University of Belgrade
• Organizes e-learning courses since 2001. by using Moodle LMS and blended learning concept
• More than 1000 students take our courses each year• Research areas:
E-business Internet and mobile technologiesBig DataCloud ComputingE-educationAdaptive e-services Internet of things Social media
![Page 3: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/3.jpg)
Overview
• Introduction
• Hadoop model for education
• Implementation
• Cluster organizaton
• Conclusion
![Page 4: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/4.jpg)
Introduction
• Education institutions need to have access to relevant information in order to offer high-quality education to students.
• Main problem – Information arrive to organizations
• from variety of sources
• with rapidly increasing speed
• in variety of types.
• Hadoop as a possible solution to this matter
![Page 5: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/5.jpg)
Hadoop
• Apache Hadoop is an open-source software framework for storage and large-scale processing of data-sets on clusters of commodity hardware.
• All the modules in Hadoop are designed with a fundamental assumption that hardware failures are common and thus should be automatically handled in software by the framework.
![Page 6: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/6.jpg)
Big Data
• Big data is a blanket term for any collection of data sets so large and complex that it becomes difficult to process using on-hand database management tools or traditional data processing applications.
![Page 7: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/7.jpg)
Hadoop model for education
Guidelines used for deploying Hadoop model:
• Efficient data import
• Reliable manipulation
• Flexible output
![Page 8: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/8.jpg)
Model for managing Big Data in educational institutions
![Page 9: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/9.jpg)
Implementation
• Three node cluster
• Integration with Moodle LMS
• Distributed storage
• In its performance, Hadoop cluster consumes a significant amount of resources, and controlling them is inevitable.
![Page 10: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/10.jpg)
Implemented Hadoop e-learning infrastructure
![Page 11: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/11.jpg)
Cluster organization
• Central role that is responsible for Hadoop’s performance is represented by Master node.
• In order to optimize Hard Disk Drive Memory, the implementation described here contains Data Node installed on the Master Node
• Imposed mechanism for preventing data losing between nodes is to constantly monitor network infrastructure.
• Data replication as a mechanism for preserving data within cluster
![Page 12: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/12.jpg)
Hadoop cluster organization in Laboratory for E-business
![Page 13: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/13.jpg)
Conclusion
• A scalable platform that brings Big Data based on Hadoop to e-learning environment is presented.
• Main contribution of described paper is providing environment for manipulating data generated from variety of sources in education activities.
• Primary objective is improvement of e-learning process.
• Future research is directed to:-optimizing integration with e-learning services-integration with cloud platform
![Page 14: Hadoop infrastructure for education](https://reader033.vdocuments.mx/reader033/viewer/2022052622/55942b701a28abc55a8b4751/html5/thumbnails/14.jpg)
HADOOP INFRASTRUCTURE FOR EDUCATION
Darko Marjanović, [email protected]
Miloš Milovanović, [email protected]
Božidar Radenković, [email protected]
University of Belgrade
Faculty of Organizational Sciences
Laboratory for E-business