Text Mining

Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred Damerau

Autor:	Sholom M. Weiss, Nitin Indurkhya, Tong Zhang, Fred Damerau
EAN:	9780387345550
eBook Format:	PDF
Sprache:	Englisch
Produktart:	eBook
Veröffentlichungsdatum:	08.01.2010
Untertitel:	Predictive Methods for Analyzing Unstructured Information
Kategorie:	Computer
Schlagworte:	Active learning Clustering and matching Document classification and correction Extraction Retrieval Summarization classification clustering data mining information retrieval text mining

149,79 €*

inkl. MwSt.

zzgl. Versandkosten

(ab 25 Euro versandkostenfrei) *außer auf ausgewälte Artikel

Versandkostenfrei

Die Verfügbarkeit wird nach ihrer Bestellung bei uns geprüft.
Bücher sind in der Regel innerhalb von 1-2 Werktagen abholbereit.

Data mining is a mature technology. The prediction problem, looking for predictive patterns in data, has been widely studied. Strong me- ods are available to the practitioner. These methods process structured numerical information, where uniform measurements are taken over a sample of data. Text is often described as unstructured information. So, it would seem, text and numerical data are different, requiring different methods. Or are they? In our view, a prediction problem can be solved by the same methods, whether the data are structured - merical measurements or unstructured text. Text and documents can be transformed into measured values, such as the presence or absence of words, and the same methods that have proven successful for pred- tive data mining can be applied to text. Yet, there are key differences. Evaluation techniques must be adapted to the chronological order of publication and to alternative measures of error. Because the data are documents, more specialized analytical methods may be preferred for text. Moreover, the methods must be modi?ed to accommodate very high dimensions: tens of thousands of words and documents. Still, the central themes are similar.

Verwandte Artikel

Text Mining Weiss, Sholom M, Indurkhya, Nitin, Zhang, Tong, Damerau, Fred

175,50 €*