“This book is the Bible for anyone who needs to manage large data collections. Managing Gigabytes: Compressing and Indexing Documents and Images. Managing Gigabytes: Compressing and Indexing Documents and. Images-Ian H. Witten, Alistair Moffat, and Timothy C. Bell (New. York: Van Nostrand Reinhold. Managing Gigabytes: Compressing and Indexing Documents and Images. 3. Author(s). I.H. Witten ; A. Moffat ; T.C. Bell. View All Authors. Sign In or Purchase.

Author: Mazukazahn Akinoktilar
Country: Cyprus
Language: English (Spanish)
Genre: Photos
Published (Last): 1 June 2015
Pages: 53
PDF File Size: 16.30 Mb
ePub File Size: 12.55 Mb
ISBN: 274-1-48809-334-4
Downloads: 26398
Price: Free* [*Free Regsitration Required]
Uploader: Vushakar

Managing Gigabytes

Shopbop Designer Fashion Brands. Compressing and Indexing Documents and Images ] [Author: It also details dozens of powerful techniques supported by mg, the authors’ own system for compressing, storing, and retrieving text, images, and textual images.

It discusses all the algorithms and tradeoffs, and comes with free downloadable source code to experiment with. Try the Kindle edition and experience these great reading features: If you care about search engines, you need this book: WittenAlistair MoffatTimothy Ckmpressing. Managing Gigabytes Alistair Moffat. Indexijg ideas are very well explained, and the problems are solved in a stepwise fashion, leading from a simple, inefficient solution to a problem to a more complex, efficient one.

Product details Hardcover Publisher: Share your thoughts with other customers. Ships from and sold by Amazon.

ComiXology Thousands of Digital Comics. Page 1 of 1 Start over Page 1 of 1. Just on a personal note, it would be great to see some emphasis in the future editions in regards to web mining applications.


Students, researchers, compresisng practitioners will all benefit from reading this book.

Managing Gigabytes : Alistair Moffat :

Bell Snippet view – This serves as a superior text for students studying document and imaging indeximg, processing and information and multimedia retrieval subjects. The authors are smart guys who could do sth, google mg for their website and mg4j xnd the ported java implementation. Another book related to the same area worth mentioning is “Modern Information Retrieval”. The use of compression in storing the text, integers, lexicon and inverted list is detailed beautifully. The book is both detailed and enjoyable; the authors have combined elegant writing with top-grade programming.

If you care about search engines, you need this book: Information Retrieval in Practice. Witten is a professor of computer science at the University of Waikato in New Zealand. Other books in this series. While this book was published almost two decades ago, it is docuents the best introductory text to the topic of information retrieval.

There’s a problem loading this menu right now. The second part is indexing plus some query which I highly recommended because it is “practical”. Selected pages Title Page. The ideas on compression and efficiency described in the book and implemented in the software are the best that I know of in the public domain, and I’ve looked!


It has been 8 years since it was published and I could see it is still one of the best in IR field.

Managing Gigabytes

This is the only book there is that will anx teach you how to build an information retrieval system aka search engine. Description In this fully updated second edition of the highly acclaimed Managing Gigabytes, authors Witten, Moffat, and Bell continue to provide unparalleled coverage of state-of-the-art techniques for compressing and indexing data.

Without much long magic equations, it is not hard for common user to pick it up. What distinguishes this book from is that it doesn’t assume any previous knowledge – technical or otherwise – on the topic, and builds all ideas and concepts presented from the ground up.

Managing Gigabytes : Compressing and Indexing Documents and Images, Second Edition

That said, I strongly look forward to Managing Terabytes if it ever appears. It discusses all the algorithms and tradeoffs, and comes with free downloadable source nidexing to experiment with.

Get fast, free shipping with Amazon Prime. Buy the selected items together This item: