KnowledgeGraph

The PureDiscovery Technology Stack

  • BrainSpace: Companies, organizations, or even the global community have never had the ability to dynamically socialize knowledge. By intimately linking the concepts, thoughts and ideas in documents with their authors, collaborators or researchers PD BrainSpace does just that. BrainSpace is a breakthrough technology that transforms an organizations documents into a collective intelligence that automatically connects people with what they know in ways that have simply never been possible before.
  • Transparent Semantic Search: The stigma of semantic search has always been that it is a Black Box. Users enter queries and receive results, but when asked how the results were achieved the vendors respond with "it's magic" or "it just works." PD Concept Search has completely removed the top off the black box and for the first time ever, users are not only able to see what has been learned by the system, but also use our QueryCloud application to control it.
  • QueryCloud Visual Query Generator: Our innovative QueryCloud interface surfaces to the user all of the words and phrases inferred by the BrainSpace that are semantically related to their query. QueryCloud then allows users to control what terms or phrases are used, not used, emphasized or de-emphasized. All with the simple click of a button. QueryCloud provides users with a window into the BrainSpace.
  • Visual Clustering: The first step into understanding what exactly is contained in a large corpus of documents is to organize them into related sets of documents or clusters. PD Clustering is our very fast and scaleable clustering engine that can bring order to large document collections very, very quickly. PD Clustering dynamically orders similar documents into automatically named clusters enabling users to browse data by semantically related groups rather than looking at each individual document. PD Clustering is fast enough to cluster even the largest of document populations with a benchmark of over 80 million pages clustered in a 48 hr period on a single machine.
  • Near-Dupe Identification PureDiscovery's Near-Dedupe Identification Engine provides instant value to any application by detecting and grouping near duplicate documents. Identifying documents with these slight variances results in dramatic savings in time wasted looking at the same document again and again.