Text Mining Concepts, Implementation, and Big Data Challenge /

Jo, Taeho.

Text Mining Concepts, Implementation, and Big Data Challenge / [electronic resource] : by Taeho Jo. - 2nd ed. 2024. - XV, 447 p. 368 illus., 147 illus. in color. online resource. - Studies in Big Data, 45 2197-6511 ; . - Studies in Big Data, 45 .

Part I: Foundation -- Introduction -- Text Indexing -- Text Encoding -- Text Association -- Part II: Text Categorization -- Text Categorization: Conceptual View -- Text Categorization: Approaches -- Text Categorization: Implementation -- Text Categorization: Evaluation -- Part III: Text Clustering -- Text Clustering: Conceptual View -- Text Clustering: Approaches -- Text Clustering: Implementation -- Text Clustering: Evaluation -- Part IV: Advanced Topics -- Text Summarization -- Text Segmentation -- Taxonomy Generation -- Dynamic Document Organization -- References -- Index.

This popular book, updated as a textbook for classroom use, discusses text mining and different ways this type of data mining can be used to find implicit knowledge from text collections. The author provides the guidelines for implementing text mining systems in Java, as well as concepts and approaches. The book starts by providing detailed text preprocessing techniques and then goes on to provide concepts, the techniques, the implementation, and the evaluation of text categorization. It then goes into more advanced topics including text summarization, text segmentation, topic mapping, and automatic text management. The book features exercises and code to help readers quickly learn and apply knowledge. Presents an array of updated techniques for preprocessing texts into structured forms, geared for classroom use; Outlines concepts of text categorization and clustering, their algorithms, and implementation guides; Includes advanced topics such as text summarization, text segmentation, topic mapping, and automatic text management.

9783031759765


Computational intelligence.
Telecommunication.
Data mining.
Information storage and retrieval systems.
Quantitative research.
Computational Intelligence.
Communications Engineering, Networks.
Data Mining and Knowledge Discovery.
Information Storage and Retrieval.
Data Analysis and Big Data.

Q342

006.3

Con tecnología Koha