International Journal of Advances in Electronics and Computer Science ( IJAECS )
A highly rated peer reviewed monthly International Journal
Editor-in-Chief : Dr. P. Suresh
Contact Person : Technical Editor
Contact Mail : [email protected]  
Current Issue : Volume-11,Issue-2  ( Feb, 2024 ) View More
Journal Impact Factor : 2.68 View More

Journal Info
Publisher:IRAJ
ISSN (p): 2394-2835
Issues /Year :12
Stay up-to-date
Register your interests and receive email alerts tailored to your needs
Follow us
facebook twitter linked in

Paper Detail


Paper Title
Performance Comparison of Similarity Functions For Document Retrieval System

Abstract
Nowadays, measuring the similarity of documents plays an important role in text related researches and applications such as document clustering, plagiarism detection, information retrieval, machine translation and automatic essay scoring. Many researches have been proposed to solve this problem. They can be grouped into three main approaches: String-based, Corpus-based and Knowledge-based Similarities. String based approach is further categorized as the characterbased approach and the term-based approach. Some of the existing similarity measures can’t properly decide the document pair similarity in some circumstance. So, this paper proposes a new similarity approach (called KSD: Keyword Similarity Distance) based on term-based similarity function to properly decide the similarity score in each document pair. The KSDfunction takes keyword similarity distance between each pair of documents and then computes average similarity scores for all documents. In the paper, the proposed function gives the correct related document list than the existing similarity functions. Three similarity functions such as cosine, overlap and proposed similarity are appliedfor evaluating the performance of similarity scores. The keyword extraction process and the similarity calculation are done in C#. According to the experimental results, the proposed function will outperform than other similarity function. Keywords— Similarity function, KSD, Cosine, Overlap.


Author - Su Mon Phyo, Lai Lai Win Kyi

Published : Volume-3,Issue-8  ( Aug, 2016 )


DOIONLINE Number - IJAECS-IRAJ-DOIONLINE-5450   View Here

| PDF |
Viewed - 41
| Published on 2016-09-23
   
   
PAST ISSUES
Volume-11,Issue-1  ( Jan, 2024 )
Volume-10,Issue-12  ( Dec, 2023 )
Volume-10,Issue-11  ( Nov, 2023 )
Volume-10,Issue-10  ( Oct, 2023 )
Volume-10,Issue-9  ( Sep, 2023 )
Volume-10,Issue-8  ( Aug, 2023 )
Volume-10,Issue-7  ( Jul, 2023 )
Volume-10,Issue-6  ( Jun, 2023 )
Volume-10,Issue-5  ( May, 2023 )
Volume-10,Issue-4  ( Apr, 2023 )
Journal Indexed