UACEE International Journal of Computer Science and its Applications - Volume 2: Issue 3 [ISSN 2250 - 3765] 248 Blogs Search Engine Using RSS Syndication and Fuzzy Parameters Athraa Jasim Mohammed School of Computing College of Arts and Sciences Universiti Utara Malaysia Sintok, Kedah, Malaysia s808175@student.uum.edu.my Husniza Husni School of Computing College of Arts and Sciences Universiti Utara Malaysia Sintok, Kedah, Malaysia husniza@.uum.edu.my AbstractThe rapid development of the internet eventually increases the number of internet users triggering the need for an intelligent search engine that is able to minimize the search on world wide web (WWW) and find relevant information as requested. To overcome the issue of finding relevant information as well as minimizing the search on WWW, this paper proposes a search engine that is specifically designed and built using RSS syndication and fuzzy Parameters to search for information contained in blogs. The blogs search engine consists of three main phases: 1) crawling using RSS feeds algorithm; 2) indexing weblogs algorithm; and 3) searching technique using fuzzy logic. In RSS crawling process, the RSS feeds need to be gathered to extract useful information such as title, links, time published, and description. Next, indexing weblogs uses the links to retrieve the blog sites for text processing and for constructing the indexing database. In order to retrieve such information requested or queried by any user, an interface is provided to enable the blog search based on keyword with associated degree of importance. The density of keyword is then computed from the indexing database. The rank of the pages is computed by using fuzzy weighted average. The experiment resulted in mean average precision of 81.7% of total system performance. KeywordsRss feeds, blog ssearch engine, fuzzy weighted average, keyword density. I. INTRODUCTION In the last few years, the large population of Internet communities have caused massive quantities of web data, which led to the development and consumption of information. This situation also increases and somehow motivates more people to use blogs. A weblog or blog is a ―frequently updated Web page with dated entries in reverse chronological order, usually containing links with commentary‖ [1]. Instead of encountering blogs by chance during navigating the Internet, a search engine becomes necessary to be able to actively find interested blogs [2]. A search engine is simply "a web site used to easily locate internet resources". Search engines have facilitated the information retrieval process by adopting techniques such as Artificial Intelligence [3]. In this paper, we proposed to use two techniques RSS technique in crawling phase and fuzzy logic in search phase. RSS (Really Simple Syndication, a web content syndicate format) is being used extensively to describe the content and related information of weblogs and news sites. The RSS data published by websites as abstract of its latest contents [4]. RSS is an XML file format designed for web content syndication whereas fuzzy logic employs Fuzzy weighted average (FWA) calculated by two parameters: the importance of query and query density. Figure 1 depicts an example of RSS feed extracted from a weblog site, which exemplifies the main parts of a RSS feed. Figure 1 : RSS feed example Figure 1 : example of RSS feed <?xml version="1.0" encoding="UTF-8"?> <rss version="2.0"> <channel> <title>Robotics Zeitgeist</title> <link>http://robotzeitgeist.com</link> <description>Artificial Intelligence and Robotics blog</description> <lastBuildDate>Tue, 23 Aug 2011 05:22:19 +0000</lastBuildDate> <item> <title>MABEL two-legged robot fastest in the world</title> <link>http://feedproxy.google.com/~r/ArtificialIntelligence AndRobotics/~3/5u4C_ssd790/mabel-two-legged- robot-fastest-in-the-world.html</link> <pubDate>Tue, 23 Aug 2011 05:22:19 +0000</pubDate> <description>It would appear that we have a new champion in the &#8220;what robot can run fastest race&#8221;. The two-legged robot MABEL under development for several years at the University of Michigan was recently revealed to reach a top running speed of 6.8 miles per hour or roughly 11 kilometers per hour. This means that MABEL is significantly faster than the previous record holder which was Toyota&#8217;s humanoid robot</a> with a top speed of 7 kilometers per hour; Honda&#8217;s ASIMO is now in 3rd place with a top speed of 6 kilometers per hour.</p></description> </item>