Machine Learning for Text

Aggarwal, Charu C.

Machine Learning for Text [electronic resource] / by Charu C. Aggarwal. - 2nd ed. 2022. - XXIII, 565 p. 92 illus., 5 illus. in color. online resource.

1 An Introduction to Text Analytics -- 2 Text Preparation and Similarity Computation -- 3 Matrix Factorization and Topic Modeling -- 4 Text Clustering -- 5 Text Classification: Basic Models -- 6 Linear Models for Classification and Regression -- 7 Classifier Performance and Evaluation -- 8 Joint Text Mining with Heterogeneous Data -- 9 Information Retrieval and Search Engines -- 10 Language Modeling and Deep Learning -- 11 Attention Mechanisms and Transformers -- 12 Text Summarization -- 13 Information Extraction and Knowledge Graphs -- 14 Question Answering -- 15 Opinion Mining and Sentiment Analysis -- 16 Text Segmentation and Event Detection.

This second edition textbook covers a coherently organized framework for text analytics, which integrates material drawn from the intersecting topics of information retrieval, machine learning, and natural language processing. Particular importance is placed on deep learning methods. The chapters of this book span three broad categories: 1. Basic algorithms: Chapters 1 through 7 discuss the classical algorithms for text analytics such as preprocessing, similarity computation, topic modeling, matrix factorization, clustering, classification, regression, and ensemble analysis. 2. Domain-sensitive learning and information retrieval: Chapters 8 and 9 discuss learning models in heterogeneous settings such as a combination of text with multimedia or Web links. The problem of information retrieval and Web search is also discussed in the context of its relationship with ranking and machine learning methods. 3. Natural language processing: Chapters 10 through 16 discuss various sequence-centric and natural language applications, such as feature engineering, neural language models, deep learning, transformers, pre-trained language models, text summarization, information extraction, knowledge graphs, question answering, opinion mining, text segmentation, and event detection. Compared to the first edition, this second edition textbook (which targets mostly advanced level students majoring in computer science and math) has substantially more material on deep learning and natural language processing. Significant focus is placed on topics like transformers, pre-trained language models, knowledge graphs, and question answering.

9783030966232

10.1007/978-3-030-96623-2 doi


Machine learning.
Data mining.
Information storage and retrieval systems.
Machine Learning.
Data Mining and Knowledge Discovery.
Information Storage and Retrieval.

Q325.5-.7

006.31
© 2024 IIIT-Delhi, library@iiitd.ac.in