Classifier Based Text Mining for Neural Network

Text Mining is around applying knowledge discovery techniques to unstructured text is termed knowledge discovery in text (KDT), or Text data mining or Text Mining. In Neural Network that address classification problems, training set, testing set, learning rate are considered as key tasks. That is collection of input/output patterns that are used to train the network and used to assess the network performance, set the rate of adjustments. This paper describes a proposed back propagation neural net classifier that performs cross validation for original Neural Network. In order to reduce the optimization of classification accuracy, training time. The feasibility the benefits of the proposed approach are demonstrated by means of five data sets like contact-lenses, cpu, weather symbolic, Weather, labor-nega-data. It is shown that , compared to exiting neural network, the training time is reduced by more than 10 times faster when the dataset is larger than CPU or the network has many hidden units while accuracy ('percent correct') was the same for all datasets but contact-lences, which is the only one with missing attributes. For contact-lences the accuracy with Proposed Neural Network was in average around 0.3 % less than with the original Neural Network. This algorithm is independent of specify data sets so that many ideas and solutions can be transferred to other classifier paradigms.




References:
[1] Guobin Ou,Yi Lu Murphey, "Multi-class pattern classification using
neural networks", Pattern Recognition 40 (2007).
[2] Jiawei Han, Micheline Kamber "Data Mining - Concepts and
Techniques" Elsevier, 2003, pages 303 to 311, 322 to 325.
[3] Intrusion Detection: Support Vector Machines and Neural Networks,
Srinivas Mukkamala, Guadalupe Janoski, Andrew Sung {srinivas,
silfalco, Department of Computer Science, New Mexico Institute of
Mining and Technology, Socorro, New Mexico 87801, 2002, IEEE.
[4] N. Jovanovic, V. Milutinovic, and Z. Obradovic, Member, IEEE,
"Foundations of Predictive Data Mining" (2002).
[5] Yochanan Shachmurove, Department of Economics, The City College
of the City, University of New York and The University of
Pennsylvania, Dorota Witkowska, Department of Management,
Technical University of Lodz "CARESS Working Paper #00-11Utilizing
Artificial Neural Network Model to Predict Stock Markets" September
2000.
[6] Bharath, Ramachandran. Neural Network Computing. McGraw-Hill,
Inc., New York, 1994. pp. 4-43.
[7] Luger, George F., and Stubblefield, William A. Artificial Intelligence:
Structures and Strategies for Complex Problem Solving, (2nd Edition).
Benjamin/Cummings Publishing Company, Inc., California, 1993, pp. 516-527.
[8] Off-line Handwriting Recognition Using Artificial Neural Networks
Andrew T. Wilson.
[9] Skapura, David M., Building Neural Networks. ACM Press, New York. pp. 29-33.
[10] Bhavit Gyan, University of Canterbury, Kevin E. Voges, University of
Canterbury Nigel K. Ll. Pope, Griffith University "Artificial Neural
Networks in Marketing from 1999 to 2003: A Region of Origin and Topic Area Analysis".
[11] Margaret H.Dunham, "Data Mining- Introductory and Advanced
Topics" Pearson Education, 2003, pages 106-112.