Paper Title
Content Based Document Retrieval Using Relevance Feature Discovery

In data mining and knowledge engineering schema, the main focus is on the identification of text documents' feature extraction, which illustrates the user preferences' in terms of huge data patterns'. Lots of approaches are proposed earlier for data mining and text classification schemes but all are compacted with only term based methodologies. As well as all this kind of schemes are highly affected from the problem of polysemy and synonymy. Throughout the years, there has been frequently held the speculation that example based strategies ought to perform superior to anything term based ones in portraying client inclinations; yet, how to successfully utilize extensive scale designs remains a difficult issue in content mining. To make an achievement in this testing issue, this paper displays an imaginative model for pertinence highlight revelation. It finds both positive and negative designs in content reports as larger amount highlights and conveys them over low level components [terms]. It additionally orders terms into classes and upgrades term weights taking into account their specificity and their dispersions in examples. Considerable tests utilizing this model on RCV1, TREC points and Reuters- 21578 demonstrate that the proposed display altogether beats both the best in class term-based strategies and the example based techniques. Keywords— Data Mining, Data Classification, Feature Extraction, Text Mining’ and Classification.