Scholarly open access journals, Peer-reviewed, and Refereed Journals, Impact factor 8.14 (Calculate by google scholar and Semantic Scholar | AI-Powered Research Tool) , Multidisciplinary, Monthly, Indexing in all major database & Metadata, Citation Generator, Digital Object Identifier(DOI)
Developing a concurrent crawler using Python and Go and comparing its performance with a single-threaded crawler Due to the issues that Python has with concurrency due to GIL (Global Interpreter Lock), the default Python scrapers, and crawlers are relatively slow. The first version will have no concurrency and will just request each website one at a time. The second version makes use of concurrent futures’ thread pool executor, allowing me to send concurrent requests by making use of threads. This paper will solve these issues by using asyncio and aiohttp, which will allow concurrent requests to be made via an event loop. Then see if altering the implementation language has any effect on the speed of retrieval, internal processing, or memory needs for conducting the crawl. Google introduced GO to allow for increased concurrency.
Keywords:
Web Crawler, GIL (Global Interpreter Lock), asyncio, aiohttp, Python scrapers.
Cite Article:
"Performance analysis of a Non-Concurrent Crawler, a Concurrent Crawler (Python and Go), and a script using asyncio and aiohttp", International Journal for Research Trends and Innovation (www.ijrti.org), ISSN:2455-2631, Vol.7, Issue 6, page no.559 - 566, June-2022, Available :http://www.ijrti.org/papers/IJRTI2206093.pdf
Downloads:
000205401
ISSN:
2456-3315 | IMPACT FACTOR: 8.14 Calculated By Google Scholar| ESTD YEAR: 2016
An International Scholarly Open Access Journal, Peer-Reviewed, Refereed Journal Impact Factor 8.14 Calculate by Google Scholar and Semantic Scholar | AI-Powered Research Tool, Multidisciplinary, Monthly, Multilanguage Journal Indexing in All Major Database & Metadata, Citation Generator