TY - JOUR TI - A comparison of Apache Solr and Elasticsearch technologies in support of large-scale data analysis AB - In the era of big data, data has never been more important because it contains hidden insights. Additionally, it is necessary and challenging to extract usable information from enormous volumes of data. When attempting to perform data processing and analytics in a variety of domains, developers of data-intensive systems have consequently met several challenges. In addition, full-text search is one of the most significant components of big data processing and analytics for discovering fragments of required data among large volumes of data. Due to the importance of the subject, this article begins with an examination of the characteristics, capabilities, and technical comparisons of full-text search technologies, followed by a systematic comparison of Apache Solr and Elasticsearch in terms of indexing times and queries on three separate datasets. According to our findings, based on default configuration, Apache Solr has better performance when looking at indexing times measured on three machines with different hardware specifications. Likewise, Apache Solr outperforms Elasticsearch in seven out of ten search queries. Regarding our results, on computers with restricted hardware resources, we recommend utilizing Apache Solr instead of Elasticsearch. In addition, this study provides researchers and developers of data-intensive systems with a complete comparison and suggestions for choosing the most effective full-text search engine for their task. AU - DENİZ, AYSENUR AU - ELÖMER, Muhammed Mehdi AU - Aydin, Ahmet Arif DO - 10.17714/gumusfenbil.1213317 PY - 2023 JO - Gümüşhane Üniversitesi Fen Bilimleri Dergisi VL - 13 IS - 2 SN - 2146-538X SP - 386 EP - 404 DB - TRDizin UR - http://search/yayin/detay/1187343 ER -