Coverart for item
The Resource Survey of text mining II : clustering, classification, and retrieval, Michael W. Berry, Malu Castellanos, editors

Survey of text mining II : clustering, classification, and retrieval, Michael W. Berry, Malu Castellanos, editors

Label
Survey of text mining II : clustering, classification, and retrieval
Title
Survey of text mining II
Title remainder
clustering, classification, and retrieval
Statement of responsibility
Michael W. Berry, Malu Castellanos, editors
Contributor
Subject
Genre
Language
eng
Summary
The proliferation of digital computing devices and their use in communication has resulted in an increased demand for systems and algorithms capable of mining textual data. Thus, the development of techniques for mining unstructured, semi-structured, and fully-structured textual data has become increasingly important in both academia and industry. This second volume continues to survey the evolving field of text mining - the application of techniques of machine learning, in conjunction with natural language processing, information extraction and algebraic/mathematical approaches, to computational information retrieval. Numerous diverse issues are addressed, ranging from the development of new learning approaches to novel document clustering algorithms, collectively spanning several major topic areas in text mining. Features: - Acts as an important benchmark in the development of current and future approaches to mining textual information - Serves as an excellent companion text for courses in text and data mining, information retrieval and computational statistics - Experts from academia and industry share their experiences in solving large-scale retrieval and classification problems - Presents an overview of current methods and software for text mining - Highlights open research questions in document categorization and clustering, and trend detection - Describes new application problems in areas such as email surveillance and anomaly detection Survey of Text Mining II offers a broad selection in state-of-the art algorithms and software for text mining from both academic and industrial perspectives, to generate interest and insight into the state of the field. This book will be an indispensable resource for researchers, practitioners, and professionals involved in information retrieval, computational statistics, and data mining. Michael W. Berry is a professor in the Department of Electrical Engineering and Computer Science at the University of Tennessee, Knoxville. Malu Castellanos is a senior researcher at Hewlett-Packard Laboratories in Palo Alto, California
Cataloging source
GW5XE
Dewey number
005.741
Illustrations
illustrations
Index
index present
LC call number
QA76.9.D343
LC item number
S69 2008eb
Literary form
non fiction
Nature of contents
  • dictionaries
  • bibliography
http://library.link/vocab/relatedWorkOrContributorDate
  • 2007
  • 2007
http://library.link/vocab/relatedWorkOrContributorName
  • Berry, Michael W
  • Castellanos, Malu
  • Society for Industrial and Applied Mathematics
  • International Workshop on Text Mining and its Applications
  • SIAM International Conference on Data Mining
http://library.link/vocab/subjectName
  • Data mining
  • Cluster analysis
  • Discriminant analysis
  • Information retrieval
  • COMPUTERS
  • COMPUTERS
  • COMPUTERS
  • Discriminant analysis
  • Information retrieval
  • Data mining
  • Cluster analysis
  • Informatique
  • Cluster analysis
  • Data mining
  • Discriminant analysis
  • Information retrieval
Label
Survey of text mining II : clustering, classification, and retrieval, Michael W. Berry, Malu Castellanos, editors
Instantiates
Publication
Note
Selected conference papers
Bibliography note
Includes bibliographical references and index
Carrier category
online resource
Carrier category code
  • cr
Carrier MARC source
rdacarrier
Color
multicolored
Content category
text
Content type code
  • txt
Content type MARC source
rdacontent
Contents
Clustering -- Cluster-Preserving Dimension Reduction Methods for Document Classification -- Automatic Discovery of SimilarWords -- Principal Direction Divisive Partitioning with Kernels and k-Means Steering -- Hybrid Clustering with Divergences -- Text Clustering with Local Semantic Kernels -- Document Retrieval and Representation -- Vector Space Models for Search and Cluster Mining -- Applications of Semidefinite Programming in XML Document Classification -- Email Surveillance and Filtering -- Discussion Tracking in Enron Email Using PARAFAC -- Spam Filtering Based on Latent Semantic Indexing -- Anomaly Detection -- A Probabilistic Model for Fast and Confident Categorization of Textual Documents -- Anomaly Detection Using Nonnegative Matrix Factorization -- Document Representation and Quality of Text: An Analysis
Control code
233973005
Dimensions
unknown
Extent
1 online resource (xv, 240 pages)
Form of item
online
Isbn
9781848000469
Media category
computer
Media MARC source
rdamedia
Media type code
  • c
Other physical details
illustrations
http://library.link/vocab/ext/overdrive/overdriveId
978-1-84800-045-2
Specific material designation
remote
System control number
(OCoLC)233973005
Label
Survey of text mining II : clustering, classification, and retrieval, Michael W. Berry, Malu Castellanos, editors
Publication
Note
Selected conference papers
Bibliography note
Includes bibliographical references and index
Carrier category
online resource
Carrier category code
  • cr
Carrier MARC source
rdacarrier
Color
multicolored
Content category
text
Content type code
  • txt
Content type MARC source
rdacontent
Contents
Clustering -- Cluster-Preserving Dimension Reduction Methods for Document Classification -- Automatic Discovery of SimilarWords -- Principal Direction Divisive Partitioning with Kernels and k-Means Steering -- Hybrid Clustering with Divergences -- Text Clustering with Local Semantic Kernels -- Document Retrieval and Representation -- Vector Space Models for Search and Cluster Mining -- Applications of Semidefinite Programming in XML Document Classification -- Email Surveillance and Filtering -- Discussion Tracking in Enron Email Using PARAFAC -- Spam Filtering Based on Latent Semantic Indexing -- Anomaly Detection -- A Probabilistic Model for Fast and Confident Categorization of Textual Documents -- Anomaly Detection Using Nonnegative Matrix Factorization -- Document Representation and Quality of Text: An Analysis
Control code
233973005
Dimensions
unknown
Extent
1 online resource (xv, 240 pages)
Form of item
online
Isbn
9781848000469
Media category
computer
Media MARC source
rdamedia
Media type code
  • c
Other physical details
illustrations
http://library.link/vocab/ext/overdrive/overdriveId
978-1-84800-045-2
Specific material designation
remote
System control number
(OCoLC)233973005

Library Locations

    • Ellis LibraryBorrow it
      1020 Lowry Street, Columbia, MO, 65201, US
      38.944491 -92.326012
    • Engineering Library & Technology CommonsBorrow it
      W2001 Lafferre Hall, Columbia, MO, 65211, US
      38.946102 -92.330125
Processing Feedback ...