Text Processing in Java by Mitzi Morris

Text Processing in Java by Mitzi Morris

Regular price
Checking stock...
Regular price
Checking stock...
Proud to be B-Corp

Our business meets the highest standards of verified social and environmental performance, public transparency and legal accountability to balance profit and purpose. In short, we care about people and the planet.

The feel-good place to buy books
  • Free delivery in the UK
  • Supporting authors with AuthorSHARE
  • 100% recyclable packaging
  • B Corp - kinder to people and planet
  • Buy-back with World of Books - Sell Your Books

Text Processing in Java by Mitzi Morris

This book teaches you how to master the subtle art of multilingual text processing and prevent text data corruption. It provides an introduction to natural language processing using Lucene and Solr. It gives you tools and techniques to manage large collections of
text data, whether they come from news feeds, databases, or legacy documents. Each chapter contains executable programs that can also be used for text data forensics. Topics covered: -Unicode code points -Character encodings from ASCI and Big5 to UTF-8 and UTF-32LE -Character normalization using International Components for Unicode (ICU) -Java I/O, including working directly with zip, gzip, and tar files -Regular expressions in Java -Transporting text data via HTP -Parsing and generating XML, HTML, and JSON -Using Lucene 4 for natural language search and text classification -Search, spelling correction, and clustering with Solr 4 Other books on text processing presuppose much of the material covered in this book.
They gloss over the details of transforming text from one format to another and assume perfect input data. The messy reality of raw text will have you reaching for this book again and again.
SKU Nicht verfügbar
ISBN 13 9780988208728
ISBN 10 0988208725
Titel Text Processing in Java
Autor Mitzi Morris
Buchzustand Nicht verfügbar
Bindungsart Paperback
Verlag Colloquial Media Corporation
Erscheinungsjahr 2014-01-01
Seitenanzahl 328
Hinweis auf dem Einband Die Abbildung des Buches dient nur Illustrationszwecken, die tatsächliche Bindung, das Cover und die Auflage können sich davon unterscheiden.
Hinweis Nicht verfügbar