TY - JOUR AU - Etyang, Lennah AU - Lawrence Nderu, AU - Waweru Mwangi, PY - 2021/04/25 Y2 - 2024/03/28 TI - Parameter Settings Optimization in MapReduce Big Data processing using the MOPSO Algorithm JF - International Journal of Advances in Scientific Research and Engineering (IJASRE), ISSN:2454-8006, DOI: 10.31695/IJASRE JA - IJASRE VL - 7 IS - 4 SE - Articles DO - 10.31695/IJASRE.2021.33923 UR - https://ijasre.net/index.php/ijasre/article/view/1144 SP - 31-43 AB - <p><em>Big data is a commodity that is highly valued in the entire globe. It is not just regarded as data but in the world of experts, we can derive intelligence from it. Because of its characteristics which are Variety, Value, Volume, Velocity, and the growing need of how it can be handled, Organizations are facing difficulties in ensuring optimal as well as affordable processing and storage of large datasets. One of the already existing models used for rapid processing together with storage in big data is known as Hadoop MapReduce.  </em><em>MapReduce is used for large-scale data processing in a parallel and distributed computing environment, while Hadoop is used for running applications and storing data in clusters of commodity hardware</em><em> Furthermore, the Hadoop MapReduce framework needs to tune more than 190 configuration parameters which are mostly done manually. Due to complex interactions and large spaces between parameters, manual tuning is not effective. Even worse, these parameters must be tuned every time Hadoop MapReduce applications are run. </em><em>The main goal of this research is to create an algorithm that will improve efficiency by automatically optimizing parameter settings when MapReduce jobs are running.</em> <em>The algorithm employs the Multi-Objective Particle Swarm Optimization (MOPSO) technique, which uses two objective functions to look for a Pareto optimal solution while optimizing the parameters.</em><em> The results of the experiments have shown that the algorithm has remarkably improved MapReduce job performance in comparison to the use of default settings.</em></p> ER -