Knowledge Extraction with Non-Negative Matrix Factorization for Text Classification

Silva, Catarina; Ribeiro, Bernardete

http://hdl.handle.net/10400.8/13155

Use this identifier to reference this record.

Name:	Description:	Size:	Format:
Knowledge extraction with non-negative matrix factorization for text classification.pdf	Text classification has received increasing interest over the past decades for its wide range of applications driven by the ubiquity of textual information. The high dimensionality of those applications led to pervasive use of dimensionality reduction methods, often black-box feature extraction non-linear techniques. We show how Non-Negative Matrix Factorization (NMF), an algorithm able to learn a parts-based representation of data by imposing non-negativity constraints, can be used to represent and extract knowledge from a text classification problem. The resulting reduced set of features is tested with kernel-based machines on Reuters-21578 benchmark showing the method's performance competitiveness.	281.59 KB	Adobe PDF	Download

Send Feedback

Authors

Silva, Catarina

Ribeiro, Bernardete

Abstract(s)

Text classification has received increasing interest over the past decades for its wide range of applications driven by the ubiquity of textual information. The high dimensionality of those applications led to pervasive use of dimensionality reduction methods, often black-box feature extraction non-linear techniques. We show how Non-Negative Matrix Factorization (NMF), an algorithm able to learn a parts-based representation of data by imposing non-negativity constraints, can be used to represent and extract knowledge from a text classification problem. The resulting reduced set of features is tested with kernel-based machines on Reuters-21578 benchmark showing the method's performance competitiveness.

Description

10th International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2009, 23 September 2009 through 26 September 2009 - Code 79260

Keywords

Support Vector Machine Semantic Feature Nonnegative Matrix Factorization Positive Matrix Factorization Knowledge Extraction

URI

http://hdl.handle.net/10400.8/13155

Citation

Silva, C., Ribeiro, B. (2009). Knowledge Extraction with Non-Negative Matrix Factorization for Text Classification. In: Corchado, E., Yin, H. (eds) Intelligent Data Engineering and Automated Learning - IDEAL 2009. IDEAL 2009. Lecture Notes in Computer Science, vol 5788. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04394-9_37.