Please use this identifier to cite or link to this item:
https://hdl.handle.net/11499/46478
Title: | A framework for investigating search engines' stemming mechanisms: A case study on Bing | Authors: | Senturk, Fatmana Gunduz, Gurhan |
Keywords: | big data Bing indexing mechanism information retrieval search engine stemming Quality Web Big |
Publisher: | Wiley | Abstract: | Big data attracts the attention of governments and a lot of companies today. The developments in technology and the Internet make it one of the important sources of big data. It is easy to get lost in the enormous amount of information contained on the Internet if there were no search engines. Knowing how the search engines work will be helpful to access the desired information. This work aims to be a guide for accessing the right information and also to help to understand search engine stemming and indexing algorithm for interested parties. In this article, we have developed a framework that could be used to investigate the stemming mechanisms of search engines. Our framework also uses Word2vec to analyze semantic relations. We have used our framework to investigate the stemming algorithm of the search engine Bing for English language. In order to achieve that we have used this framework to select words, create queries, send them to Bing, and finally analyze the millions of returned results. We have discussed the results in the context of our article. The results indicate that our framework is useful for analyzing the stemming mechanisms of search engines. | URI: | https://doi.org/10.1002/cpe.6562 https://hdl.handle.net/11499/46478 |
ISSN: | 1532-0626 1532-0634 |
Appears in Collections: | Mühendislik Fakültesi Koleksiyonu Scopus İndeksli Yayınlar Koleksiyonu / Scopus Indexed Publications Collection WoS İndeksli Yayınlar Koleksiyonu / WoS Indexed Publications Collection |
Show full item record
CORE Recommender
SCOPUSTM
Citations
1
checked on Nov 16, 2024
WEB OF SCIENCETM
Citations
1
checked on Nov 21, 2024
Page view(s)
46
checked on Aug 24, 2024
Google ScholarTM
Check
Altmetric
Items in GCRIS Repository are protected by copyright, with all rights reserved, unless otherwise indicated.