Professional Documents
Culture Documents
An Effective Approach For Discovering Relevant Semantic Associations Based On User Specified Relationships
An Effective Approach For Discovering Relevant Semantic Associations Based On User Specified Relationships
An Effective Approach For Discovering Relevant Semantic Associations Based On User Specified Relationships
(1)
Where l denotes number of user selected relationships in S, L denotes total number of relationships in S, length(S)
denotes length of S which is total number of components in S (Excluding first and last entities) and c is a constant used to
fine tune the relationship weight (Rel
s
).
4.2 Computing the Overall Ranking Score of Semantic Associations
The overall ranking score of semantic associations is computed in terms of other weights as described below;
Context Weight Cs
Consider the scenario where user wishes to find semantic associations in the domain of Politics. Then concepts such as
Politician, Political Organization and Legislation are considered to be more relevant and the concepts such as Financial
Organization and Terrorist Organization are considered to be less relevant. So, user is provided a facility to define his
context and based on this context the associations are ranked.
Subsumption Weight Ss
In an RDF graph, entities that occur at lower level in the hierarchy are treated as more specialized entities and convey more
meaning than the entities that occur at higher levels. So Associations that consists these entities are more relevant.
Path length Weight L
s
In some applications, user might be interested in finding shorter associations, yet in other applications he may wish to find
longer associations.
Popularity Weight P
s
The number of incoming and outgoing edges of an entity is defined as its popularity. In some cases associations that contain
many popular entities are considered to be more relevant.
Rarity Weight R
s
In some applications, user might be interested in rarely occurring events and in other applications he might be interested in
commonly occurring events. For example, in money laundering officials wish to find associations consisting commonly
occurring events because money launderers perform several common transactions to escape from law. So user is allowed to
select favor rare or common associations according to his interest.
Trust Weight T
s
The entities and relationships in a semantic association come from different sources. Some sources may be more trusted and
some sources may be less trusted. For example in India The Hindu is considered to be more trusted source. Thus trust value
is assigned to components in an association based on the source from which it is coming.
The overall ranking score of a semantic associations, S, is given as;
527 Elsevier Publications, 2013
An Effective Approach for Discovering Relevant Semantic Associations Based on User Specified Relationships
(2)
In this, k1+k2+k3+k4+k5+k6+k71and is required to fine-tune the ranking of Semantic Associations. In this work
more weight is given to context and Relationship weight.
5. Experimental Results
The ranking approach presented in this work has been implemented and tested on SWETO (Semantic Web Technology
Evaluation ontology) [11] test-bed. Using a Web-based application, user can enter two entities between which he wish to
discover semantic associations. To define the context user can select his interested concepts from the ontology using a touch
graph like User Interface and the relationships from the list of relationships. Optionally, user can customize the ranking
criteria by assigning weights to each individual criterion. Prior to ranking, the associations are stored in Oracle database.
These unranked associations are then ranked based on the criteria defined in Section 4. Ranked top 10 semantic association
results are shown in Table 1.
To demonstrate the effectiveness of the proposed method, we have compared the system ranking with the user-human
ranking between two entities George W. Bush and Mitt Romney based on promoted law and opposed law relationships which
is shown in Figure 2. The x-axis shows ranking of semantic associations by the proposed method and y-axis shows user-
human ranking which is assigned by the users manually. The Spearmans foot rule [19] distance measure is used to measure
the similarity between the proposed system ranking and the user-human ranking.
Spearmans Foot rule distance measure is given as
(3)
Spearmans Foot rule Coefficient is defined as;
(4)
Based on the results, the average correlation coefficient between the proposed system ranking and the user-human ranking is
0.63. Since the average correlation coefficient between the proposed system ranking and the user-human ranking is greater
than 0.50, the proposed system ranking and the user-human ranking are highly correlated. In information retrieval context,
precision represents the fraction of retrieved associations that are relevant to the search. Figure 3 show that the precision rate
of the proposed method is superior to other existing methods.
Table 1. Top Ranked Semantic Associations
Rank Semantic Association
Ranking
Score
1 George W. Bush-promoted law-PATRIOT Act-opposed law-Edward Kennedy-represents-
Massachusetts-represents-Mitt Romney
0.65
2 George W. Bush-signed by-PATRIOT Act-opposed law-Edward Kennedy-represents-Massachusetts-
represents-Mitt Romney
0.65
3 George W. Bush-signed by-No Child Left Behind Act-promoted law-Edward Kennedy-represents-
Massachusetts-represents-Mitt Romney
0.63
4 George W. Bush-promoted law-PATRIOT Act-passed law-U.S. Senate-member of-Elizabeth Dole-
member of-Republican Party-member of-Mitt Romney
0.62
5 George W. Bush-signed by-PATRIOT Act-passed law-U.S. Senate-member of-Elizabeth Dole-
member of-Republican Party-member of-Mitt Romney
0.62
6 George W. Bush-relative of-Jeb Bush-member of-National Governors Association-member of-Mitt
Romney
0.62
7 George W. Bush-promoted law-PATRIOT Act-passed law-U.S. Senate-member of-Mel Martinez-
member of-Republican Party-member of-Mitt Romney
0.61
8 George W. Bush-promoted law-PATRIOT Act-passed law-U.S. Senate-member of-Pete Domenici-
member of-Republican Party-member of-Mitt Romney
0.61
9 George W. Bush-promoted law-PATRIOT Act-passed law-U.S. Senate-member of-Norm Coleman-
member of-Republican Party-member of-Mitt Romney
0.61
10 George W. Bush-promoted law-PATRIOT Act-passed law-U.S. Senate-member of-Johnny Isakson-
member of-Republican Party-member of-Mitt Romney
0.61
528 Elsevier Publications, 2013
S. Narayana
1
,
B.S. Kumar
2
, Lakshmana Phaneendra Maguluri
3,
G P S Varma
4
and A Govardhan
5
Fig 2. Comparison of system ranking with user-human ranking
Fig 3. Comparison of precision rate between the proposed method and Aleman Meza et al. Method
6. Conclusion
Finding Semantic Associations between two entities is very useful, especially in applications such as National Security,
Business Intelligence, Genetics and Pharmaceutical research. Given two entities, there exist a huge number of associations.
Hence, new techniques are required to find relevant Semantic Associations. This paper proposes a method to define the
context both using user interested concepts and relationships to discover more relevant Semantic Associations. The ranking
of Semantic Associations of the proposed method with existing methods is compared using Spearmans Foot rule. The
average correlation coefficient of proposed method is 0.63. Since the average correlation coefficient is greater than 0.5, the
proposed system ranking is highly correlated with user-human ranking. In addition, the average precision rate of the proposed
method is compared with Aleman Meza et al. method. The average precision rate of the proposed method is higher than
Aleman Meza et al. and Viswanathan et al. methods which describes that the proposed method discovers the relevant
associations more efficiently. When size of RDF graph grows, it is difficult for the user to specify context through the user
interface. Hence, in the coming years, a methodology is to be developed to make the system learn the above said features and
assign ranks to semantic associations.
u
1
2
S
4
S
6
7
8
9
1u
1 2 S 4 S 6 7 8 9 1u
U
s
e
r
-
H
u
m
a
n
R
a
n
k
i
n
g
System Ranking
01
02
0S
04
0S
u%
1u%
2u%
Su%
4u%
Su%
6u%
7u%
8u%
9u%
1uu%
Su 1uu 1Su 2uu
P
r
e
c
i
s
i
o
n
R
a
t
e
Top k-associations
Aleman Neza et al.
viswanathan et al.
Piposeu Nethou
529 Elsevier Publications, 2013
An Effective Approach for Discovering Relevant Semantic Associations Based on User Specified Relationships
References
|1| Aleman-Meza Bonerges, Halaschek-Wiener Christian, Arpinar IB, Ramakrishnan Cartic, Sheth Amit (2005). Ranking Complex
Relationships on the Semantic Web. IEEE Internet Computing 9(3); 37-44. Doi:10.1109/MIC.200.63.
|2| Aleman-Meza, B., Halaschek, C., Arpinar, I. B., and Sheth, A. "Context-Aware Semantic Association Ranking." First International
Workshop on Semantic Web and Databases, Berlin, Germany, 33-50.
|3| Viswanathan V, Ilango K. Ranking semantic relationships between two entities using personalization in contest speciIication.
InIormain Sciences, Elsevier, 207 (2012) 35-49.
|4| Anyanwu KemaIor, Angela Maduko, Sheth Amit. SemRank: ranking complex relationship search results on the Semantic Web, in:
Proc. oI the 14th International World Wide Web ConIerence, ACM Press, 2005, pp. 117127.
|5| Myungjin Lee, Wooju Kim. Semantic association search and rank method based on spreading activation Ior the Semantic Web, in:
IEEE International ConIerence on Industrial Engineering and Engineering Management, 2009, pp. 15231527.
|6| Shahdad Shariatmadari, Ali Mamat, Ibrahim Hamidah, Mustapha Norwati (2008). SwSim:Discovering semantic similarity association
in semantic web. Proceedings oI International Symposium on ITSim 2008, 1-4.
|7| Anyanwu KemaIor, Sheth Amit. p-operator: Discovering and Ranking Semantic Associations on the Semantic Web, ACM SIGMOD
Record, v. 31 n.4, December 2002.
|8| Berners-Lee, T., Hendler, J., and Lassila, O. (2001). "The Semantic Web - A new Iorm oI Web content that is meaningIul to computers
will unleash a revolution oI new possibilities." ScientiIic American, 284(5), 34.
|9| Lassila Ora and Swick R. Resource Description Framework (RDF) Model and Syntax SpeciIication, W3C Recommendation. 1999.
|10| Brickley D and Guha RV. Resource Description Framework (RDF) Schema SpeciIication 1.0, W3C Candidate Recommendation.
2000.
|11| Aleman-Meza, B., Halaschek, C., Sheth, A., Arpinar, I. B., and Sannapareddy, G. "SWETO: Large-Scale Semantic Web Test-bed."
16th International ConIerence on SoItware Engineering and Knowledge Engineering (SEKE2004): Workshop on Ontology in Action,
BanII, Canada, 490-493.
|12| Guha, R. V., and McCool, R. (2003). "TAP: A Semantic Web Test-bed." Journal oI Web Semantics, 1(1), 81-87.
|13| Opencyc. http://sw.opencyc.org
|14| Sheth, A. P., Arpinar, I. B., and Kashyap, V. (2003). "Relationships at the Heart oI Semantic Web: Modeling, Discovering and
Exploiting Complex Semantic Relationships." Enhancing the Power oI the Internet Studies in Fuzziness and SoIt Computing, M.
Nikravesh, B. Azvin, R. Yager, and L. A. Zadeh, eds., Springer-Verlag.
|15| Stumme, G., Hotho, A., Berendt, B.: Semantic Web Mining: State oI the art andIuture directions. Web Semantics: Science, Services
and Agents on the World WideWeb 4(2) (2006) 124 143.
|16| Berendt, B., Hotho, A., Mladenic, D., van Someren, M., Spiliopoulou, M., Stumme,G.: A Roadmap Ior Web Mining: From Web to
Semantic Web. Web Mining: FromWeb to Semantic Web Volume 3209/2004 (2004) 122.
|17| Halaschek, C., Aleman-Meza, B., Arpinar, I. B., and Sheth, A. P. "Discovering and Ranking Semantic Associations over a Large RDF
Metabase." 30th International ConIerence on Very Large Data Bases, Toronto, Canada.
|18| Sheth, A. P., Aleman-Meza, B., Arpinar, I. B., Halaschek, C., Ramakrishnan, C., Bertram, C., Warke, Y., Avant, D., Arpinar, F. S.,
Anyanwu, K., and Kochut, K. (2005a). "Semantic Association IdentiIication and Knowledge Discovery Ior National Security
Applications." Journal oI Database Management, 16(1), 33-53.
|19| P. Diaconis, R. Graham, Spearman`s Iootrule as a measure oI disarray, Journal oI the Royal Statistical Society Series B 39 (2) (1977)
262268.
|20| Narayana S, Govardhan A, Varma GPS. Discovering and Ranking Semantic Associations on the Semantic web, International Journal
oI Computer Science and Management Research, Vol 1 Issue 5 December 2012, pp. 1092-1102.
|21| Narayana S, Govardhan A, Varma GPS, 'An RDF Approach to Discover Relevant Semantic Associations, International Journal oI
Advanced Research in Computer Science and SoItware Engineering, Volume 3, Issue 7, July.
|22| Fabrizio Lamberti, Member, IEEE, Andrea Sanna, and Claudio Demartini, A Relation-Based Page Rank Algorithm Ior Semantic Web
Search Engines, IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, VOL. 21, NO. 1, JANUARY 2009, pp.
123-136.
530 Elsevier Publications, 2013
Index
C
Context weight Cs, 527
P
Path length weight Ls, 527
Popularity weight Ps, 527
R
-operator, 526
Ranking semantic associations, 526527
Rarity weight Rs, 527
S
Semantic Web (SW), 525
data model, 525
semantic association, 524526
SemRank, 526
Subsumption weight Ss, 527
System ranking, 529
T
Trust weight Ts, 527528