16vol2no2
16vol2no2
Abstract— The amount of information on the web is increasing considered and what to follow for the designing a system for
day by day at an exponential rate. There is a need for retrieving personalized search.
the necessary information required for the user. The requirement The rest of the paper is organized as follows. The
of the information varies from user to user (example: variation in section II will discuss the past and ongoing research related to
the required information between a geek and geek’s mother). So
there is a need for the personalized search where the results
this field and the section III will propose a system for better
should vary based on the user interests. This paper will discuss designing of personalized search with explanations along with
about the research going on in this particular area and also serve the architecture. The section IV will contain the conclusion for
as a guide in designing a personalized search engine efficiently this paper. The section V will discuss on the future work
and effectively using the ConceptNet with a detailed architecture. related to this area.
281
IRACST – Engineering Science and Technology: An International Journal (ESTIJ), ISSN: 2250-3498,
Vol.2, No. 2, April 2012
B. Personalizing Search based on Folksonomy gun”. Then the concept that is identified from the concept net
will be “Robbery” (based on the data it seems like that). Thus
The delicious website[3] provide an interface for tagging the search will be done by searching for the keyword robbery
where personalized results will be returned based on the which will yield accurate Results. More details about the
search tag but the problem with this is, the search tag must ConceptNet will be found in [11]. Let us now take an example
exactly match with a tag in the folksonomy (Dataset) i.e. it from the fig.1. The query if contains “bank, wallet, ATM”
does not consider the sematic aspect of searching. Peter and then the concept that is identified from the ConceptNet is
stock (2007) [7] proposed natural language processing “money”. This can be seen in the figure which is the root node
methods to identify the entities referenced by the tags that of the three concepts.
could be useful for retrieval by processing the sentences
provided as a query for the search. Heung-nam Kim et al.
(2011) [9] Proposes a method which uses user-tag matrix,
item-tag matrix and user-item matrices. Where the tag
similarities and the item similarities are identified and based
on the cosine similarity calculation of various tags and items.
The results are personalized based on the tag used by the user
and their similarity with the other tags is used for identifying
the items.
282
IRACST – Engineering Science and Technology: An International Journal (ESTIJ), ISSN: 2250-3498,
Vol.2, No. 2, April 2012
C. Page Ranking
E. Architecture
After identifying the personalized results it is also important
that the relevant interested results should come first as
everyone will be looking for the interested resources in the top
10 results. Thus in order to provide the interested results page
ranking using Folksonomy Boosted Rank [9] will serve the
purpose better as it will consider the user profile from the tags
tagged by the user and returns the returns based on his profile
i.e. based on the users interests the ranking of resources(items)
will be done.
V. FUTURE WORK
The system designed now needs an explicit user intervention Rohith Krishna V, is currently
i.e. a user should tag resources based on which his profile is pursuing his final year of
created and personalization is done. Instead of this MS(Software Engineering) at
personalizing based on his/her historic data over the web will VIT University and is interested
serve better as there is no need for the user to understand the in the area of Data Mining
concepts of tagging and Folksonomy as the personalization is mainly personalizing the search.
done implicitly. Another work is to integrate the folksonomy
structure in the taxonomy which will overcome some of the
disadvantages of taxonomy such as updating the taxonomy
and adding extra tags to the taxonomy, also some of the
disadvantages of folksonomy such as unstructured and
unambiguous tags. Thus the integration may help in
maintaining a common vocabulary which is consistent.
REFERENCES