The Lexbe Uber Index
Is Your eDiscovery Search Finding All of the Critical Evidence?
It’s easy to assume that all search tools included in popular e-discovery and litigation document management are about the same, but that could be a costly mistake. In reality, quality varies greatly. Most document review tools in use today regularly exclude important data from their search indices, and as a result, from your search results. They miss key information for a variety of reasons detailed below. Missing search results can be important because if you don’t find the key evidence in your case and the opposition does, you and your client can be at a tremendous disadvantage. As electronically stored information (ESI) collections and the number of custodians increase, the need for complete, comprehensive, accurate search results is more critical than ever.
With the Uber Index feature set part Lexbe eDiscovery Platform, we’ve moved search to the next level in terms of accuracy, quality, and completeness. We index extracted text in native files and the OCRed text from paginated versions of the same files, as well as an English translation of foreign-language documents, if available- all in a comprehensive, combined index for fast and easy searching. By enabling the Auto-Translation+ feature you can return documents in other languages with your English search queries. This means that a search for “Office Building” in English will return a German document with “Bürogebäude”.
Shortcomings of Traditional ‘Print-Driver’ based Search Approaches
Most search tools in use today index only the text that is created using a print-driver version of a document (TIFF or PDF), similar to what you would see if you print a document. This ‘virtual printed’ version of a document can be OCRed, but it commonly misses a surprising amount of common real world data, including hidden sheets, filtered cells, and hidden cells in Excel worksheets, speaker notes in PowerPoint presentations, and revisions, notes and comments in Word documents.
Other search tools, including latest generation early case analysis (ECA) tools, use the native extracted text for search. This has the advantage of avoiding many or even all of the above problems but creates its own. Information in a document that requires OCR, or cannot be natively extracted, will be lost. Separate or embedded image files that need OCR remain a common and important component of ESI collections.
More and more documents in complex cases today may be in multiple languages, sometimes within a single email string! Allowing you to search and review an English translation in-line with your review is a challenge which if not competently addressed, can result in missing key evidence. There are many translation services available to the legal industry, but these services can be slow and expensive. From a document review perspective, they are not usually included in timely search results. Even the most advanced review tools on the market today do not have a solution that integrates search results from documents in multiple languages.
The bottom line is that whatever approach on its own will lose evidence and compromise search quality.
Advantage of ESI Search Multi Index+
Lexbe solves the the search problems of traditional eDiscovery review and litigation document management programs with the Uber search index. The Uber index includes:
- Text extracted from the native file. This picks up all text and any meta-data linked to the
- Text from an OCRed version of the same file
- Text from an English translated version, if available, as well as an English translation of foreign-language documents, if available.
Our search returns results from both text extracted from native files and text from a PDF-paginated version we create, including OCRed images. You get the best of both worlds and better search results. Lexbe’s language translation index rounds out the most robust, multi-layered index on the market. Hidden text, comments, revisions, notes, hidden sheets, foreign language and more, will be available to you and your team. Don’t risk your case by missing key evidence or inadvertently releasing privileged documents!